2025-05-07T19:42:32.1109162Z Current runner version: '2.323.0' 2025-05-07T19:42:32.1115408Z Runner name: 'i-0735a6dcd00858cc7' 2025-05-07T19:42:32.1116456Z Machine name: 'ip-10-0-3-41' 2025-05-07T19:42:32.1119298Z ##[group]GITHUB_TOKEN Permissions 2025-05-07T19:42:32.1121584Z Contents: read 2025-05-07T19:42:32.1122279Z Metadata: read 2025-05-07T19:42:32.1122864Z Packages: read 2025-05-07T19:42:32.1123386Z ##[endgroup] 2025-05-07T19:42:32.1125653Z Secret source: None 2025-05-07T19:42:32.1126321Z Prepare workflow directory 2025-05-07T19:42:32.1748226Z Prepare all required actions 2025-05-07T19:42:32.1785957Z Getting action download info 2025-05-07T19:42:32.3482860Z Download action repository 'actions/checkout@v4' (SHA:11bd71901bbe5b1630ceea73d27597364c9af683) 2025-05-07T19:42:32.5632828Z Download action repository 'actions/upload-artifact@v4' (SHA:ea165f8d65b6e75b540449e92b4886f43607fa02) 2025-05-07T19:42:32.9960664Z Complete job name: build_artifact (x86, linux.24xlarge, default, 3.11, 12.6.3, clang) 2025-05-07T19:42:33.0778512Z A job started hook has been configured by the self-hosted runner administrator 2025-05-07T19:42:33.0899884Z ##[group]Run '/home/ec2-user/runner-scripts/before_job.sh' 2025-05-07T19:42:33.0909240Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:42:33.0910216Z ##[endgroup] 2025-05-07T19:42:34.8609206Z Runner Type: linux.24xlarge 2025-05-07T19:42:34.8610661Z Instance Type: c5.24xlarge 2025-05-07T19:42:34.8650168Z AMI Name: unknown 2025-05-07T19:42:34.8651049Z AMI ID: ami-071226ecf16aa7d96 2025-05-07T19:42:39.9368080Z ##[group]Checking docker version 2025-05-07T19:42:39.9380658Z ##[command]/usr/bin/docker version --format '{{.Server.APIVersion}}' 2025-05-07T19:42:39.9593153Z '1.44' 2025-05-07T19:42:39.9609529Z Docker daemon API version: '1.44' 2025-05-07T19:42:39.9610090Z ##[command]/usr/bin/docker version --format '{{.Client.APIVersion}}' 2025-05-07T19:42:39.9805327Z '1.44' 2025-05-07T19:42:39.9816061Z Docker client API version: '1.44' 2025-05-07T19:42:39.9824143Z ##[endgroup] 2025-05-07T19:42:39.9827216Z ##[group]Clean up resources from previous jobs 2025-05-07T19:42:39.9833232Z ##[command]/usr/bin/docker ps --all --quiet --no-trunc --filter "label=a7f211" 2025-05-07T19:42:40.0002829Z ##[command]/usr/bin/docker network prune --force --filter "label=a7f211" 2025-05-07T19:42:40.0153649Z ##[endgroup] 2025-05-07T19:42:40.0154413Z ##[group]Create local container network 2025-05-07T19:42:40.0165414Z ##[command]/usr/bin/docker network create --label a7f211 github_network_b8a0e08264114d09b83c4f1649c20a21 2025-05-07T19:42:40.2962903Z 9f7ee9a1ad140e879780c73103284cb6e8f3e761ef5bd3b8b575da2a9b4c9abd 2025-05-07T19:42:40.2985229Z ##[endgroup] 2025-05-07T19:42:40.3016393Z ##[group]Starting job container 2025-05-07T19:42:40.3036219Z ##[command]/usr/bin/docker pull amazonlinux:2023 2025-05-07T19:42:40.4318259Z 2023: Pulling from library/amazonlinux 2025-05-07T19:42:40.4425378Z Digest: sha256:cb5b4c509d62ae388f674c139ae5e8281fc160c217d474445e912043e1941988 2025-05-07T19:42:40.4427089Z Status: Image is up to date for amazonlinux:2023 2025-05-07T19:42:40.4446506Z docker.io/library/amazonlinux:2023 2025-05-07T19:42:40.4544734Z ##[command]/usr/bin/docker create --name db7e8a0d80694b6c947d879fd1fcdfb9_amazonlinux2023_6189ae --label a7f211 --workdir /__w/FBGEMM/FBGEMM --network github_network_b8a0e08264114d09b83c4f1649c20a21 --user root -e "HOME=/github/home" -e GITHUB_ACTIONS=true -e CI=true -v "/var/run/docker.sock":"/var/run/docker.sock" -v "/home/ec2-user/actions-runner/_work":"/__w" -v "/home/ec2-user/actions-runner/externals":"/__e":ro -v "/home/ec2-user/actions-runner/_work/_temp":"/__w/_temp" -v "/home/ec2-user/actions-runner/_work/_actions":"/__w/_actions" -v "/home/ec2-user/actions-runner/_work/_tool":"/__w/_tool" -v "/home/ec2-user/actions-runner/_work/_temp/_github_home":"/github/home" -v "/home/ec2-user/actions-runner/_work/_temp/_github_workflow":"/github/workflow" --entrypoint "tail" amazonlinux:2023 "-f" "/dev/null" 2025-05-07T19:42:40.5488939Z 5cbac523ab1bac2d9a3da38db9fa5ba5ff830b0fdf4c0802e1a29dc793634db1 2025-05-07T19:42:40.5511376Z ##[command]/usr/bin/docker start 5cbac523ab1bac2d9a3da38db9fa5ba5ff830b0fdf4c0802e1a29dc793634db1 2025-05-07T19:42:40.9913679Z 5cbac523ab1bac2d9a3da38db9fa5ba5ff830b0fdf4c0802e1a29dc793634db1 2025-05-07T19:42:40.9935234Z ##[command]/usr/bin/docker ps --all --filter id=5cbac523ab1bac2d9a3da38db9fa5ba5ff830b0fdf4c0802e1a29dc793634db1 --filter status=running --no-trunc --format "{{.ID}} {{.Status}}" 2025-05-07T19:42:41.0085752Z 5cbac523ab1bac2d9a3da38db9fa5ba5ff830b0fdf4c0802e1a29dc793634db1 Up Less than a second 2025-05-07T19:42:41.0104943Z ##[command]/usr/bin/docker inspect --format "{{range .Config.Env}}{{println .}}{{end}}" 5cbac523ab1bac2d9a3da38db9fa5ba5ff830b0fdf4c0802e1a29dc793634db1 2025-05-07T19:42:41.0246833Z HOME=/github/home 2025-05-07T19:42:41.0247404Z GITHUB_ACTIONS=true 2025-05-07T19:42:41.0247738Z CI=true 2025-05-07T19:42:41.0248193Z PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-05-07T19:42:41.0264888Z ##[endgroup] 2025-05-07T19:42:41.0274433Z ##[group]Waiting for all services to be ready 2025-05-07T19:42:41.0276535Z ##[endgroup] 2025-05-07T19:42:41.0359667Z ##[group]Run yum update -y; yum install -y binutils findutils git pciutils sudo tar wget which 2025-05-07T19:42:41.0360567Z yum update -y; yum install -y binutils findutils git pciutils sudo tar wget which 2025-05-07T19:42:41.0361460Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:42:41.0361968Z env: 2025-05-07T19:42:41.0362340Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:42:41.0362705Z BUILD_ENV: build_binary 2025-05-07T19:42:41.0363073Z BUILD_TARGET: default 2025-05-07T19:42:41.0363381Z BUILD_VARIANT: cuda 2025-05-07T19:42:41.0363733Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:42:41.0364022Z ##[endgroup] 2025-05-07T19:42:41.9032163Z Amazon Linux 2023 repository 66 MB/s | 37 MB 00:00 2025-05-07T19:42:48.5366804Z Last metadata expiration check: 0:00:07 ago on Wed May 7 19:42:41 2025. 2025-05-07T19:42:49.0991419Z Dependencies resolved. 2025-05-07T19:42:49.1169766Z Nothing to do. 2025-05-07T19:42:49.1170895Z Complete! 2025-05-07T19:42:49.3674322Z Last metadata expiration check: 0:00:08 ago on Wed May 7 19:42:41 2025. 2025-05-07T19:42:49.4311590Z Dependencies resolved. 2025-05-07T19:42:49.4537989Z ======================================================================================== 2025-05-07T19:42:49.4539182Z Package Arch Version Repository Size 2025-05-07T19:42:49.4539993Z ======================================================================================== 2025-05-07T19:42:49.4540459Z Installing: 2025-05-07T19:42:49.4540886Z binutils x86_64 2.41-50.amzn2023.0.3 amazonlinux 5.3 M 2025-05-07T19:42:49.4541520Z findutils x86_64 1:4.8.0-2.amzn2023.0.2 amazonlinux 539 k 2025-05-07T19:42:49.4542119Z git x86_64 2.47.1-1.amzn2023.0.2 amazonlinux 54 k 2025-05-07T19:42:49.4542722Z pciutils x86_64 3.7.0-3.amzn2023.0.2 amazonlinux 93 k 2025-05-07T19:42:49.4543351Z sudo x86_64 1.9.15-1.p5.amzn2023.0.1 amazonlinux 1.3 M 2025-05-07T19:42:49.4543886Z tar x86_64 2:1.34-1.amzn2023.0.4 amazonlinux 879 k 2025-05-07T19:42:49.4544525Z wget x86_64 1.21.3-1.amzn2023.0.4 amazonlinux 779 k 2025-05-07T19:42:49.4545123Z which x86_64 2.21-26.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:49.4545591Z Installing dependencies: 2025-05-07T19:42:49.4546074Z cracklib x86_64 2.9.6-27.amzn2023.0.2 amazonlinux 82 k 2025-05-07T19:42:49.4546675Z cyrus-sasl-lib x86_64 2.1.27-18.amzn2023.0.3 amazonlinux 786 k 2025-05-07T19:42:49.4547399Z elfutils-debuginfod-client x86_64 0.188-3.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:49.4548087Z git-core x86_64 2.47.1-1.amzn2023.0.2 amazonlinux 4.7 M 2025-05-07T19:42:49.4548920Z git-core-doc noarch 2.47.1-1.amzn2023.0.2 amazonlinux 2.8 M 2025-05-07T19:42:49.4549549Z gnutls x86_64 3.8.3-6.amzn2023.0.1 amazonlinux 1.1 M 2025-05-07T19:42:49.4550086Z groff-base x86_64 1.22.4-7.amzn2023.0.2 amazonlinux 1.0 M 2025-05-07T19:42:49.4550847Z gzip x86_64 1.12-1.amzn2023.0.1 amazonlinux 160 k 2025-05-07T19:42:49.4551470Z hwdata noarch 0.384-1.amzn2023.0.3 amazonlinux 1.6 M 2025-05-07T19:42:49.4552099Z jansson x86_64 2.14-0.amzn2023 amazonlinux 46 k 2025-05-07T19:42:49.4552711Z kmod-libs x86_64 29-2.amzn2023.0.5 amazonlinux 62 k 2025-05-07T19:42:49.4553222Z less x86_64 608-2.amzn2023.0.2 amazonlinux 168 k 2025-05-07T19:42:49.4553999Z libcbor x86_64 0.7.0-3.amzn2023.0.2 amazonlinux 57 k 2025-05-07T19:42:49.4554785Z libdb x86_64 5.3.28-49.amzn2023.0.2 amazonlinux 756 k 2025-05-07T19:42:49.4555379Z libeconf x86_64 0.4.0-1.amzn2023.0.3 amazonlinux 28 k 2025-05-07T19:42:49.4555964Z libedit x86_64 3.1-38.20210714cvs.amzn2023.0.2 amazonlinux 108 k 2025-05-07T19:42:49.4556588Z libfdisk x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 153 k 2025-05-07T19:42:49.4557190Z libfido2 x86_64 1.10.0-2.amzn2023.0.2 amazonlinux 95 k 2025-05-07T19:42:49.4557871Z libmetalink x86_64 0.1.3-14.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:49.4558549Z libpwquality x86_64 1.4.4-6.amzn2023.0.2 amazonlinux 106 k 2025-05-07T19:42:49.4559258Z libsemanage x86_64 3.4-5.amzn2023.0.2 amazonlinux 121 k 2025-05-07T19:42:49.4559856Z libutempter x86_64 1.2.1-4.amzn2023.0.2 amazonlinux 26 k 2025-05-07T19:42:49.4560492Z nano x86_64 8.3-1.amzn2023 amazonlinux 706 k 2025-05-07T19:42:49.4561080Z ncurses x86_64 6.2-4.20200222.amzn2023.0.6 amazonlinux 394 k 2025-05-07T19:42:49.4561640Z nettle x86_64 3.10.1-1.amzn2023.0.1 amazonlinux 573 k 2025-05-07T19:42:49.4562261Z openldap x86_64 2.4.57-6.amzn2023.0.7 amazonlinux 256 k 2025-05-07T19:42:49.4562821Z openssh x86_64 8.7p1-8.amzn2023.0.14 amazonlinux 454 k 2025-05-07T19:42:49.4563466Z openssh-clients x86_64 8.7p1-8.amzn2023.0.14 amazonlinux 708 k 2025-05-07T19:42:49.4665670Z pam x86_64 1.5.1-8.amzn2023.0.4 amazonlinux 542 k 2025-05-07T19:42:49.4666222Z pciutils-libs x86_64 3.7.0-3.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:49.4666826Z perl-AutoLoader noarch 5.74-477.amzn2023.0.6 amazonlinux 22 k 2025-05-07T19:42:49.4667629Z perl-B x86_64 1.80-477.amzn2023.0.6 amazonlinux 179 k 2025-05-07T19:42:49.4668502Z perl-Carp noarch 1.50-458.amzn2023.0.2 amazonlinux 29 k 2025-05-07T19:42:49.4669188Z perl-Class-Struct noarch 0.66-477.amzn2023.0.6 amazonlinux 22 k 2025-05-07T19:42:49.4669790Z perl-Data-Dumper x86_64 2.174-460.amzn2023.0.2 amazonlinux 55 k 2025-05-07T19:42:49.4670374Z perl-Digest noarch 1.20-1.amzn2023.0.2 amazonlinux 26 k 2025-05-07T19:42:49.4670990Z perl-Digest-MD5 x86_64 2.58-2.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:49.4671560Z perl-DynaLoader x86_64 1.47-477.amzn2023.0.6 amazonlinux 26 k 2025-05-07T19:42:49.4672405Z perl-Encode x86_64 4:3.15-462.amzn2023.0.2 amazonlinux 1.7 M 2025-05-07T19:42:49.4672938Z perl-Errno x86_64 1.30-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:49.4673509Z perl-Error noarch 1:0.17029-5.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:49.4674185Z perl-Exporter noarch 5.74-459.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:49.4674734Z perl-Fcntl x86_64 1.13-477.amzn2023.0.6 amazonlinux 21 k 2025-05-07T19:42:49.4675298Z perl-File-Basename noarch 2.85-477.amzn2023.0.6 amazonlinux 18 k 2025-05-07T19:42:49.4675912Z perl-File-Find noarch 1.37-477.amzn2023.0.6 amazonlinux 26 k 2025-05-07T19:42:49.4676495Z perl-File-Path noarch 2.18-2.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:49.4677079Z perl-File-Temp noarch 1:0.231.100-2.amzn2023.0.2 amazonlinux 60 k 2025-05-07T19:42:49.4677796Z perl-File-stat noarch 1.09-477.amzn2023.0.6 amazonlinux 17 k 2025-05-07T19:42:49.4678403Z perl-FileHandle noarch 2.03-477.amzn2023.0.6 amazonlinux 16 k 2025-05-07T19:42:49.4679012Z perl-Getopt-Long noarch 1:2.52-2.amzn2023.0.2 amazonlinux 60 k 2025-05-07T19:42:49.4679608Z perl-Getopt-Std noarch 1.12-477.amzn2023.0.6 amazonlinux 16 k 2025-05-07T19:42:49.4680189Z perl-Git noarch 2.47.1-1.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:49.4680748Z perl-HTTP-Tiny noarch 0.078-1.amzn2023.0.3 amazonlinux 56 k 2025-05-07T19:42:49.4681303Z perl-IO x86_64 1.43-477.amzn2023.0.6 amazonlinux 87 k 2025-05-07T19:42:49.4681898Z perl-IPC-Open3 noarch 1.21-477.amzn2023.0.6 amazonlinux 23 k 2025-05-07T19:42:49.4682512Z perl-MIME-Base64 x86_64 3.16-2.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:49.4683102Z perl-Net-SSLeay x86_64 1.94-1.amzn2023.0.1 amazonlinux 392 k 2025-05-07T19:42:49.4683655Z perl-POSIX x86_64 1.94-477.amzn2023.0.6 amazonlinux 97 k 2025-05-07T19:42:49.4684219Z perl-PathTools x86_64 3.78-459.amzn2023.0.2 amazonlinux 85 k 2025-05-07T19:42:49.4684799Z perl-Pod-Escapes noarch 1:1.07-458.amzn2023.0.2 amazonlinux 20 k 2025-05-07T19:42:49.4685412Z perl-Pod-Perldoc noarch 3.28.01-459.amzn2023.0.3 amazonlinux 84 k 2025-05-07T19:42:49.4686028Z perl-Pod-Simple noarch 1:3.42-2.amzn2023.0.2 amazonlinux 215 k 2025-05-07T19:42:49.4686608Z perl-Pod-Usage noarch 4:2.01-2.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:49.4687222Z perl-Scalar-List-Utils x86_64 4:1.56-459.amzn2023.0.2 amazonlinux 71 k 2025-05-07T19:42:49.4687845Z perl-SelectSaver noarch 1.02-477.amzn2023.0.6 amazonlinux 12 k 2025-05-07T19:42:49.4688427Z perl-Socket x86_64 4:2.032-1.amzn2023.0.2 amazonlinux 55 k 2025-05-07T19:42:49.4689087Z perl-Storable x86_64 1:3.21-458.amzn2023.0.2 amazonlinux 96 k 2025-05-07T19:42:49.4689630Z perl-Symbol noarch 1.08-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:49.4690218Z perl-Term-ANSIColor noarch 5.01-459.amzn2023.0.2 amazonlinux 48 k 2025-05-07T19:42:49.4690797Z perl-Term-Cap noarch 1.17-458.amzn2023.0.2 amazonlinux 22 k 2025-05-07T19:42:49.4691366Z perl-TermReadKey x86_64 2.38-9.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:49.4691955Z perl-Text-ParseWords noarch 3.30-458.amzn2023.0.2 amazonlinux 17 k 2025-05-07T19:42:49.4692587Z perl-Text-Tabs+Wrap noarch 2021.0726-1.amzn2023.0.1 amazonlinux 22 k 2025-05-07T19:42:49.4695547Z perl-Time-Local noarch 2:1.300-5.amzn2023.0.2 amazonlinux 34 k 2025-05-07T19:42:49.4696066Z perl-URI noarch 5.09-1.amzn2023.0.2 amazonlinux 108 k 2025-05-07T19:42:49.4696578Z perl-base noarch 2.27-477.amzn2023.0.6 amazonlinux 17 k 2025-05-07T19:42:49.4697096Z perl-constant noarch 1.33-459.amzn2023.0.2 amazonlinux 23 k 2025-05-07T19:42:49.4697615Z perl-if noarch 0.60.800-477.amzn2023.0.6 amazonlinux 14 k 2025-05-07T19:42:49.4698140Z perl-interpreter x86_64 4:5.32.1-477.amzn2023.0.6 amazonlinux 71 k 2025-05-07T19:42:49.4698644Z perl-lib x86_64 0.65-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:49.4699143Z perl-libnet noarch 3.13-2.amzn2023.0.2 amazonlinux 126 k 2025-05-07T19:42:49.4699632Z perl-libs x86_64 4:5.32.1-477.amzn2023.0.6 amazonlinux 2.0 M 2025-05-07T19:42:49.4700212Z perl-mro x86_64 1.23-477.amzn2023.0.6 amazonlinux 29 k 2025-05-07T19:42:49.4700720Z perl-overload noarch 1.31-477.amzn2023.0.6 amazonlinux 46 k 2025-05-07T19:42:49.4701255Z perl-overloading noarch 0.02-477.amzn2023.0.6 amazonlinux 13 k 2025-05-07T19:42:49.4701805Z perl-parent noarch 1:0.238-458.amzn2023.0.2 amazonlinux 14 k 2025-05-07T19:42:49.4702343Z perl-podlators noarch 1:4.14-458.amzn2023.0.2 amazonlinux 112 k 2025-05-07T19:42:49.4702853Z perl-subs noarch 1.03-477.amzn2023.0.6 amazonlinux 12 k 2025-05-07T19:42:49.4703355Z perl-vars noarch 1.05-477.amzn2023.0.6 amazonlinux 13 k 2025-05-07T19:42:49.4703846Z shadow-utils x86_64 2:4.9-12.amzn2023.0.4 amazonlinux 1.1 M 2025-05-07T19:42:49.4704360Z systemd-libs x86_64 252.23-3.amzn2023 amazonlinux 613 k 2025-05-07T19:42:49.4704845Z util-linux x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 2.2 M 2025-05-07T19:42:49.4705349Z util-linux-core x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 432 k 2025-05-07T19:42:49.4705763Z Installing weak dependencies: 2025-05-07T19:42:49.4706181Z nano-default-editor noarch 8.3-1.amzn2023 amazonlinux 10 k 2025-05-07T19:42:49.4706748Z perl-IO-Socket-IP noarch 0.41-3.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:49.4707295Z perl-IO-Socket-SSL noarch 2.075-1.amzn2023.0.2 amazonlinux 218 k 2025-05-07T19:42:49.4707853Z perl-Mozilla-CA noarch 20200520-4.amzn2023.0.2 amazonlinux 13 k 2025-05-07T19:42:49.4708386Z perl-NDBM_File x86_64 1.15-477.amzn2023.0.6 amazonlinux 23 k 2025-05-07T19:42:49.4708909Z sudo-python-plugin x86_64 1.9.15-1.p5.amzn2023.0.1 amazonlinux 56 k 2025-05-07T19:42:49.4709237Z 2025-05-07T19:42:49.4709345Z Transaction Summary 2025-05-07T19:42:49.4709609Z ======================================================================================== 2025-05-07T19:42:49.4709923Z Install 107 Packages 2025-05-07T19:42:49.4710060Z 2025-05-07T19:42:49.4710197Z Total download size: 38 M 2025-05-07T19:42:49.4710459Z Installed size: 151 M 2025-05-07T19:42:49.4710707Z Downloading Packages: 2025-05-07T19:42:49.5733104Z (1/107): cracklib-2.9.6-27.amzn2023.0.2.x86_64. 3.6 MB/s | 82 kB 00:00 2025-05-07T19:42:49.5837022Z (2/107): elfutils-debuginfod-client-0.188-3.amz 6.4 MB/s | 41 kB 00:00 2025-05-07T19:42:49.6111571Z (3/107): binutils-2.41-50.amzn2023.0.3.x86_64.r 87 MB/s | 5.3 MB 00:00 2025-05-07T19:42:49.6170000Z (4/107): cyrus-sasl-lib-2.1.27-18.amzn2023.0.3. 12 MB/s | 786 kB 00:00 2025-05-07T19:42:49.6226236Z (5/107): findutils-4.8.0-2.amzn2023.0.2.x86_64. 14 MB/s | 539 kB 00:00 2025-05-07T19:42:49.6252483Z (6/107): git-2.47.1-1.amzn2023.0.2.x86_64.rpm 7.7 MB/s | 54 kB 00:00 2025-05-07T19:42:49.6426874Z (7/107): gnutls-3.8.3-6.amzn2023.0.1.x86_64.rpm 70 MB/s | 1.1 MB 00:00 2025-05-07T19:42:49.6595483Z (8/107): git-core-doc-2.47.1-1.amzn2023.0.2.noa 77 MB/s | 2.8 MB 00:00 2025-05-07T19:42:49.6816447Z (9/107): git-core-2.47.1-1.amzn2023.0.2.x86_64. 74 MB/s | 4.7 MB 00:00 2025-05-07T19:42:49.6876689Z (10/107): groff-base-1.22.4-7.amzn2023.0.2.x86_ 25 MB/s | 1.0 MB 00:00 2025-05-07T19:42:49.6914073Z (11/107): gzip-1.12-1.amzn2023.0.1.x86_64.rpm 5.2 MB/s | 160 kB 00:00 2025-05-07T19:42:49.6963817Z (12/107): jansson-2.14-0.amzn2023.x86_64.rpm 5.5 MB/s | 46 kB 00:00 2025-05-07T19:42:49.7067656Z (13/107): hwdata-0.384-1.amzn2023.0.3.noarch.rp 88 MB/s | 1.6 MB 00:00 2025-05-07T19:42:49.7081580Z (14/107): kmod-libs-29-2.amzn2023.0.5.x86_64.rp 3.6 MB/s | 62 kB 00:00 2025-05-07T19:42:49.7116598Z (15/107): less-608-2.amzn2023.0.2.x86_64.rpm 13 MB/s | 168 kB 00:00 2025-05-07T19:42:49.7156622Z (16/107): libcbor-0.7.0-3.amzn2023.0.2.x86_64.r 8.6 MB/s | 57 kB 00:00 2025-05-07T19:42:49.7216290Z (17/107): libdb-5.3.28-49.amzn2023.0.2.x86_64.r 61 MB/s | 756 kB 00:00 2025-05-07T19:42:49.7229498Z (18/107): libeconf-0.4.0-1.amzn2023.0.3.x86_64. 2.5 MB/s | 28 kB 00:00 2025-05-07T19:42:49.7254323Z (19/107): libedit-3.1-38.20210714cvs.amzn2023.0 13 MB/s | 108 kB 00:00 2025-05-07T19:42:49.7307113Z (20/107): libfido2-1.10.0-2.amzn2023.0.2.x86_64 13 MB/s | 95 kB 00:00 2025-05-07T19:42:49.7332891Z (21/107): libfdisk-2.37.4-1.amzn2023.0.4.x86_64 15 MB/s | 153 kB 00:00 2025-05-07T19:42:49.7352302Z (22/107): libmetalink-0.1.3-14.amzn2023.0.2.x86 3.3 MB/s | 31 kB 00:00 2025-05-07T19:42:49.7398097Z (23/107): libpwquality-1.4.4-6.amzn2023.0.2.x86 12 MB/s | 106 kB 00:00 2025-05-07T19:42:49.7417179Z (24/107): libutempter-1.2.1-4.amzn2023.0.2.x86_ 4.5 MB/s | 26 kB 00:00 2025-05-07T19:42:49.7441237Z (25/107): libsemanage-3.4-5.amzn2023.0.2.x86_64 14 MB/s | 121 kB 00:00 2025-05-07T19:42:49.7482576Z (26/107): nano-default-editor-8.3-1.amzn2023.no 1.7 MB/s | 10 kB 00:00 2025-05-07T19:42:49.7531005Z (27/107): nano-8.3-1.amzn2023.x86_64.rpm 53 MB/s | 706 kB 00:00 2025-05-07T19:42:49.7581513Z (28/107): ncurses-6.2-4.20200222.amzn2023.0.6.x 29 MB/s | 394 kB 00:00 2025-05-07T19:42:49.7623974Z (29/107): nettle-3.10.1-1.amzn2023.0.1.x86_64.r 42 MB/s | 573 kB 00:00 2025-05-07T19:42:49.7659507Z (30/107): openldap-2.4.57-6.amzn2023.0.7.x86_64 22 MB/s | 256 kB 00:00 2025-05-07T19:42:49.7703944Z (31/107): openssh-8.7p1-8.amzn2023.0.14.x86_64. 37 MB/s | 454 kB 00:00 2025-05-07T19:42:49.7758168Z (32/107): openssh-clients-8.7p1-8.amzn2023.0.14 53 MB/s | 708 kB 00:00 2025-05-07T19:42:49.7805363Z (33/107): pam-1.5.1-8.amzn2023.0.4.x86_64.rpm 38 MB/s | 542 kB 00:00 2025-05-07T19:42:49.7831455Z (34/107): pciutils-3.7.0-3.amzn2023.0.2.x86_64. 7.9 MB/s | 93 kB 00:00 2025-05-07T19:42:49.7856526Z (35/107): pciutils-libs-3.7.0-3.amzn2023.0.2.x8 4.8 MB/s | 41 kB 00:00 2025-05-07T19:42:49.7877246Z (36/107): perl-AutoLoader-5.74-477.amzn2023.0.6 3.2 MB/s | 22 kB 00:00 2025-05-07T19:42:49.7910480Z (37/107): perl-B-1.80-477.amzn2023.0.6.x86_64.r 24 MB/s | 179 kB 00:00 2025-05-07T19:42:49.7940002Z (38/107): perl-Carp-1.50-458.amzn2023.0.2.noarc 3.7 MB/s | 29 kB 00:00 2025-05-07T19:42:49.7957027Z (39/107): perl-Class-Struct-0.66-477.amzn2023.0 2.8 MB/s | 22 kB 00:00 2025-05-07T19:42:49.7986909Z (40/107): perl-Digest-1.20-1.amzn2023.0.2.noarc 5.8 MB/s | 26 kB 00:00 2025-05-07T19:42:49.8003762Z (41/107): perl-Data-Dumper-2.174-460.amzn2023.0 5.8 MB/s | 55 kB 00:00 2025-05-07T19:42:49.8027904Z (42/107): perl-Digest-MD5-2.58-2.amzn2023.0.2.x 5.4 MB/s | 36 kB 00:00 2025-05-07T19:42:49.8049140Z (43/107): perl-DynaLoader-1.47-477.amzn2023.0.6 4.5 MB/s | 26 kB 00:00 2025-05-07T19:42:49.8166350Z (44/107): perl-Encode-3.15-462.amzn2023.0.2.x86 108 MB/s | 1.7 MB 00:00 2025-05-07T19:42:49.8188564Z (45/107): perl-Errno-1.30-477.amzn2023.0.6.x86_ 995 kB/s | 15 kB 00:00 2025-05-07T19:42:49.8200614Z (46/107): perl-Error-0.17029-5.amzn2023.0.2.noa 2.8 MB/s | 41 kB 00:00 2025-05-07T19:42:49.8227821Z (47/107): perl-Exporter-5.74-459.amzn2023.0.2.n 5.7 MB/s | 31 kB 00:00 2025-05-07T19:42:49.8259336Z (48/107): perl-Fcntl-1.13-477.amzn2023.0.6.x86_ 4.3 MB/s | 21 kB 00:00 2025-05-07T19:42:49.8277296Z (49/107): perl-File-Basename-2.85-477.amzn2023. 2.6 MB/s | 18 kB 00:00 2025-05-07T19:42:49.8297334Z (50/107): perl-File-Find-1.37-477.amzn2023.0.6. 3.7 MB/s | 26 kB 00:00 2025-05-07T19:42:49.8316942Z (51/107): perl-File-Path-2.18-2.amzn2023.0.2.no 6.5 MB/s | 36 kB 00:00 2025-05-07T19:42:49.8336497Z (52/107): perl-File-Temp-0.231.100-2.amzn2023.0 10 MB/s | 60 kB 00:00 2025-05-07T19:42:49.8356491Z (53/107): perl-File-stat-1.09-477.amzn2023.0.6. 3.2 MB/s | 17 kB 00:00 2025-05-07T19:42:49.8376137Z (54/107): perl-FileHandle-2.03-477.amzn2023.0.6 2.9 MB/s | 16 kB 00:00 2025-05-07T19:42:49.8397667Z (55/107): perl-Getopt-Long-2.52-2.amzn2023.0.2. 10 MB/s | 60 kB 00:00 2025-05-07T19:42:49.8417219Z (56/107): perl-Getopt-Std-1.12-477.amzn2023.0.6 2.8 MB/s | 16 kB 00:00 2025-05-07T19:42:49.8434739Z (57/107): perl-Git-2.47.1-1.amzn2023.0.2.noarch 7.6 MB/s | 42 kB 00:00 2025-05-07T19:42:49.8457672Z (58/107): perl-HTTP-Tiny-0.078-1.amzn2023.0.3.n 10 MB/s | 56 kB 00:00 2025-05-07T19:42:49.8496711Z (59/107): perl-IO-1.43-477.amzn2023.0.6.x86_64. 11 MB/s | 87 kB 00:00 2025-05-07T19:42:49.8514239Z (60/107): perl-IO-Socket-IP-0.41-3.amzn2023.0.2 5.7 MB/s | 42 kB 00:00 2025-05-07T19:42:49.8557219Z (61/107): perl-IO-Socket-SSL-2.075-1.amzn2023.0 22 MB/s | 218 kB 00:00 2025-05-07T19:42:49.8576141Z (62/107): perl-IPC-Open3-1.21-477.amzn2023.0.6. 4.5 MB/s | 23 kB 00:00 2025-05-07T19:42:49.8595248Z (63/107): perl-MIME-Base64-3.16-2.amzn2023.0.2. 4.5 MB/s | 31 kB 00:00 2025-05-07T19:42:49.8636030Z (64/107): perl-Mozilla-CA-20200520-4.amzn2023.0 1.7 MB/s | 13 kB 00:00 2025-05-07T19:42:49.8681700Z (65/107): perl-Net-SSLeay-1.94-1.amzn2023.0.1.x 46 MB/s | 392 kB 00:00 2025-05-07T19:42:49.8708355Z (66/107): perl-POSIX-1.94-477.amzn2023.0.6.x86_ 16 MB/s | 97 kB 00:00 2025-05-07T19:42:49.8739310Z (67/107): perl-PathTools-3.78-459.amzn2023.0.2. 15 MB/s | 85 kB 00:00 2025-05-07T19:42:49.8765710Z (68/107): perl-Pod-Escapes-1.07-458.amzn2023.0. 3.6 MB/s | 20 kB 00:00 2025-05-07T19:42:49.8807109Z (69/107): perl-Pod-Perldoc-3.28.01-459.amzn2023 13 MB/s | 84 kB 00:00 2025-05-07T19:42:49.8838006Z (70/107): perl-Pod-Simple-3.42-2.amzn2023.0.2.n 30 MB/s | 215 kB 00:00 2025-05-07T19:42:49.8872052Z (71/107): perl-Pod-Usage-2.01-2.amzn2023.0.2.no 7.0 MB/s | 41 kB 00:00 2025-05-07T19:42:49.8899220Z (72/107): perl-Scalar-List-Utils-1.56-459.amzn2 13 MB/s | 71 kB 00:00 2025-05-07T19:42:49.8919818Z (73/107): perl-SelectSaver-1.02-477.amzn2023.0. 2.6 MB/s | 12 kB 00:00 2025-05-07T19:42:49.8966586Z (74/107): perl-Socket-2.032-1.amzn2023.0.2.x86_ 8.4 MB/s | 55 kB 00:00 2025-05-07T19:42:49.8995053Z (75/107): perl-Storable-3.21-458.amzn2023.0.2.x 14 MB/s | 96 kB 00:00 2025-05-07T19:42:49.9013474Z (76/107): perl-Symbol-1.08-477.amzn2023.0.6.noa 3.4 MB/s | 15 kB 00:00 2025-05-07T19:42:49.9037132Z (77/107): perl-NDBM_File-1.15-477.amzn2023.0.6. 492 kB/s | 23 kB 00:00 2025-05-07T19:42:49.9059003Z (78/107): perl-Term-ANSIColor-5.01-459.amzn2023 8.0 MB/s | 48 kB 00:00 2025-05-07T19:42:49.9076635Z (79/107): perl-Term-Cap-1.17-458.amzn2023.0.2.n 3.7 MB/s | 22 kB 00:00 2025-05-07T19:42:49.9108682Z (80/107): perl-TermReadKey-2.38-9.amzn2023.0.2. 5.4 MB/s | 36 kB 00:00 2025-05-07T19:42:49.9124115Z (81/107): perl-Text-ParseWords-3.30-458.amzn202 2.5 MB/s | 17 kB 00:00 2025-05-07T19:42:49.9144722Z (82/107): perl-Text-Tabs+Wrap-2021.0726-1.amzn2 3.5 MB/s | 22 kB 00:00 2025-05-07T19:42:49.9190701Z (83/107): perl-URI-5.09-1.amzn2023.0.2.noarch.r 18 MB/s | 108 kB 00:00 2025-05-07T19:42:49.9206704Z (84/107): perl-Time-Local-1.300-5.amzn2023.0.2. 3.5 MB/s | 34 kB 00:00 2025-05-07T19:42:49.9227313Z (85/107): perl-base-2.27-477.amzn2023.0.6.noarc 2.1 MB/s | 17 kB 00:00 2025-05-07T19:42:49.9251770Z (86/107): perl-constant-1.33-459.amzn2023.0.2.n 4.0 MB/s | 23 kB 00:00 2025-05-07T19:42:49.9294489Z (87/107): perl-interpreter-5.32.1-477.amzn2023. 11 MB/s | 71 kB 00:00 2025-05-07T19:42:49.9312557Z (88/107): perl-if-0.60.800-477.amzn2023.0.6.noa 1.8 MB/s | 14 kB 00:00 2025-05-07T19:42:49.9331824Z (89/107): perl-lib-0.65-477.amzn2023.0.6.x86_64 1.9 MB/s | 15 kB 00:00 2025-05-07T19:42:49.9359236Z (90/107): perl-libnet-3.13-2.amzn2023.0.2.noarc 19 MB/s | 126 kB 00:00 2025-05-07T19:42:49.9496143Z (91/107): perl-libs-5.32.1-477.amzn2023.0.6.x86 117 MB/s | 2.0 MB 00:00 2025-05-07T19:42:49.9514674Z (92/107): perl-mro-1.23-477.amzn2023.0.6.x86_64 1.6 MB/s | 29 kB 00:00 2025-05-07T19:42:49.9531991Z (93/107): perl-overload-1.31-477.amzn2023.0.6.n 2.8 MB/s | 46 kB 00:00 2025-05-07T19:42:49.9556971Z (94/107): perl-overloading-0.02-477.amzn2023.0. 2.3 MB/s | 13 kB 00:00 2025-05-07T19:42:49.9592282Z (95/107): perl-parent-0.238-458.amzn2023.0.2.no 2.8 MB/s | 14 kB 00:00 2025-05-07T19:42:49.9620412Z (96/107): perl-podlators-4.14-458.amzn2023.0.2. 14 MB/s | 112 kB 00:00 2025-05-07T19:42:49.9639312Z (97/107): perl-subs-1.03-477.amzn2023.0.6.noarc 1.5 MB/s | 12 kB 00:00 2025-05-07T19:42:49.9665059Z (98/107): perl-vars-1.05-477.amzn2023.0.6.noarc 2.0 MB/s | 13 kB 00:00 2025-05-07T19:42:49.9744738Z (99/107): shadow-utils-4.9-12.amzn2023.0.4.x86_ 92 MB/s | 1.1 MB 00:00 2025-05-07T19:42:49.9831896Z (100/107): sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 69 MB/s | 1.3 MB 00:00 2025-05-07T19:42:49.9844399Z (101/107): sudo-python-plugin-1.9.15-1.p5.amzn2 3.4 MB/s | 56 kB 00:00 2025-05-07T19:42:49.9901740Z (102/107): systemd-libs-252.23-3.amzn2023.x86_6 44 MB/s | 613 kB 00:00 2025-05-07T19:42:50.0005068Z (103/107): tar-1.34-1.amzn2023.0.4.x86_64.rpm 59 MB/s | 879 kB 00:00 2025-05-07T19:42:50.0131882Z (104/107): util-linux-2.37.4-1.amzn2023.0.4.x86 81 MB/s | 2.2 MB 00:00 2025-05-07T19:42:50.0163872Z (105/107): util-linux-core-2.37.4-1.amzn2023.0. 16 MB/s | 432 kB 00:00 2025-05-07T19:42:50.0225101Z (106/107): wget-1.21.3-1.amzn2023.0.4.x86_64.rp 39 MB/s | 779 kB 00:00 2025-05-07T19:42:50.0241560Z (107/107): which-2.21-26.amzn2023.0.2.x86_64.rp 6.3 MB/s | 42 kB 00:00 2025-05-07T19:42:50.0263689Z -------------------------------------------------------------------------------- 2025-05-07T19:42:50.0265710Z Total 67 MB/s | 38 MB 00:00 2025-05-07T19:42:51.0995978Z Running transaction check 2025-05-07T19:42:51.1463993Z Transaction check succeeded. 2025-05-07T19:42:51.1464570Z Running transaction test 2025-05-07T19:42:51.5183997Z Transaction test succeeded. 2025-05-07T19:42:51.5185405Z Running transaction 2025-05-07T19:42:52.3478137Z Preparing : 1/1 2025-05-07T19:42:52.3632387Z Installing : systemd-libs-252.23-3.amzn2023.x86_64 1/107 2025-05-07T19:42:52.3870491Z Installing : nettle-3.10.1-1.amzn2023.0.1.x86_64 2/107 2025-05-07T19:42:52.4073610Z Installing : gnutls-3.8.3-6.amzn2023.0.1.x86_64 3/107 2025-05-07T19:42:52.4122118Z Installing : util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 4/107 2025-05-07T19:42:52.4194444Z Running scriptlet: util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 4/107 2025-05-07T19:42:52.4285661Z Installing : pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 5/107 2025-05-07T19:42:52.4563625Z Installing : ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 6/107 2025-05-07T19:42:52.4625441Z Installing : nano-8.3-1.amzn2023.x86_64 7/107 2025-05-07T19:42:52.4675782Z Installing : nano-default-editor-8.3-1.amzn2023.noarch 8/107 2025-05-07T19:42:52.5180923Z Installing : libsemanage-3.4-5.amzn2023.0.2.x86_64 9/107 2025-05-07T19:42:52.5250802Z Installing : shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 10/107 2025-05-07T19:42:52.5572632Z Running scriptlet: libutempter-1.2.1-4.amzn2023.0.2.x86_64 11/107 2025-05-07T19:42:52.5624617Z Installing : libutempter-1.2.1-4.amzn2023.0.2.x86_64 11/107 2025-05-07T19:42:52.5679723Z Installing : libmetalink-0.1.3-14.amzn2023.0.2.x86_64 12/107 2025-05-07T19:42:52.5741301Z Installing : libfdisk-2.37.4-1.amzn2023.0.4.x86_64 13/107 2025-05-07T19:42:52.5788354Z Installing : libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 14/107 2025-05-07T19:42:52.5917393Z Installing : libeconf-0.4.0-1.amzn2023.0.3.x86_64 15/107 2025-05-07T19:42:52.5971202Z Installing : libdb-5.3.28-49.amzn2023.0.2.x86_64 16/107 2025-05-07T19:42:52.6020629Z Installing : libcbor-0.7.0-3.amzn2023.0.2.x86_64 17/107 2025-05-07T19:42:52.6083307Z Installing : libfido2-1.10.0-2.amzn2023.0.2.x86_64 18/107 2025-05-07T19:42:52.6138940Z Installing : less-608-2.amzn2023.0.2.x86_64 19/107 2025-05-07T19:42:52.6180832Z Installing : kmod-libs-29-2.amzn2023.0.5.x86_64 20/107 2025-05-07T19:42:52.6606258Z Installing : jansson-2.14-0.amzn2023.x86_64 21/107 2025-05-07T19:42:52.6683076Z Installing : hwdata-0.384-1.amzn2023.0.3.noarch 22/107 2025-05-07T19:42:52.6822808Z Installing : gzip-1.12-1.amzn2023.0.1.x86_64 23/107 2025-05-07T19:42:52.7262993Z Installing : cracklib-2.9.6-27.amzn2023.0.2.x86_64 24/107 2025-05-07T19:42:52.7432377Z Installing : pam-1.5.1-8.amzn2023.0.4.x86_64 25/107 2025-05-07T19:42:52.8256934Z Installing : libpwquality-1.4.4-6.amzn2023.0.2.x86_64 26/107 2025-05-07T19:42:52.8257614Z Installing : util-linux-2.37.4-1.amzn2023.0.4.x86_64 27/107 2025-05-07T19:42:52.8258217Z warning: /etc/adjtime created as /etc/adjtime.rpmnew 2025-05-07T19:42:52.8258550Z 2025-05-07T19:42:52.8447390Z Running scriptlet: util-linux-2.37.4-1.amzn2023.0.4.x86_64 27/107 2025-05-07T19:42:52.8731881Z Running scriptlet: openssh-8.7p1-8.amzn2023.0.14.x86_64 28/107 2025-05-07T19:42:52.8915618Z Installing : openssh-8.7p1-8.amzn2023.0.14.x86_64 28/107 2025-05-07T19:42:52.8955313Z Installing : openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 29/107 2025-05-07T19:42:53.0104557Z Running scriptlet: openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 29/107 2025-05-07T19:42:53.1642821Z Installing : git-core-2.47.1-1.amzn2023.0.2.x86_64 30/107 2025-05-07T19:42:53.1746993Z Installing : git-core-doc-2.47.1-1.amzn2023.0.2.noarch 31/107 2025-05-07T19:42:53.2172990Z Running scriptlet: groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:42:53.2231931Z Installing : groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:42:53.2307527Z Running scriptlet: groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:42:53.2359478Z Installing : perl-Digest-1.20-1.amzn2023.0.2.noarch 33/107 2025-05-07T19:42:53.2430600Z Installing : perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 34/107 2025-05-07T19:42:53.2473103Z Installing : perl-B-1.80-477.amzn2023.0.6.x86_64 35/107 2025-05-07T19:42:53.2505485Z Installing : perl-FileHandle-2.03-477.amzn2023.0.6.noarch 36/107 2025-05-07T19:42:53.2552772Z Installing : perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 37/107 2025-05-07T19:42:53.2626813Z Installing : perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 38/107 2025-05-07T19:42:53.2673624Z Installing : perl-libnet-3.13-2.amzn2023.0.2.noarch 39/107 2025-05-07T19:42:53.2758433Z Installing : perl-base-2.27-477.amzn2023.0.6.noarch 40/107 2025-05-07T19:42:53.2960571Z Installing : perl-URI-5.09-1.amzn2023.0.2.noarch 41/107 2025-05-07T19:42:53.3026406Z Installing : perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 42/107 2025-05-07T19:42:53.3065370Z Installing : perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noa 43/107 2025-05-07T19:42:53.3099989Z Installing : perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 44/107 2025-05-07T19:42:53.3145750Z Installing : perl-if-0.60.800-477.amzn2023.0.6.noarch 45/107 2025-05-07T19:42:53.3191786Z Installing : perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 46/107 2025-05-07T19:42:53.3236590Z Installing : perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 47/107 2025-05-07T19:42:53.3315301Z Installing : perl-File-Path-2.18-2.amzn2023.0.2.noarch 48/107 2025-05-07T19:42:53.3357493Z Installing : perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 49/107 2025-05-07T19:42:53.3393760Z Installing : perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 50/107 2025-05-07T19:42:53.3431790Z Installing : perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 51/107 2025-05-07T19:42:53.3479063Z Installing : perl-POSIX-1.94-477.amzn2023.0.6.x86_64 52/107 2025-05-07T19:42:53.3521889Z Installing : perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 53/107 2025-05-07T19:42:53.3555025Z Installing : perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 54/107 2025-05-07T19:42:53.3593828Z Installing : perl-subs-1.03-477.amzn2023.0.6.noarch 55/107 2025-05-07T19:42:53.3651089Z Installing : perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 56/107 2025-05-07T19:42:53.3691255Z Installing : perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 57/107 2025-05-07T19:42:53.3785161Z Installing : perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 58/107 2025-05-07T19:42:53.3845027Z Installing : perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 59/107 2025-05-07T19:42:53.3893407Z Installing : perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 60/107 2025-05-07T19:42:53.3922935Z Installing : perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 61/107 2025-05-07T19:42:53.3955286Z Installing : perl-Symbol-1.08-477.amzn2023.0.6.noarch 62/107 2025-05-07T19:42:53.4017132Z Installing : perl-File-stat-1.09-477.amzn2023.0.6.noarch 63/107 2025-05-07T19:42:53.4091034Z Installing : perl-podlators-1:4.14-458.amzn2023.0.2.noarch 64/107 2025-05-07T19:42:53.4137664Z Installing : perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 65/107 2025-05-07T19:42:53.4175899Z Installing : perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 66/107 2025-05-07T19:42:53.4219579Z Installing : perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarc 67/107 2025-05-07T19:42:53.4276166Z Installing : perl-mro-1.23-477.amzn2023.0.6.x86_64 68/107 2025-05-07T19:42:53.4321134Z Installing : perl-IO-1.43-477.amzn2023.0.6.x86_64 69/107 2025-05-07T19:42:53.4371288Z Installing : perl-overloading-0.02-477.amzn2023.0.6.noarch 70/107 2025-05-07T19:42:53.4419641Z Installing : perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 71/107 2025-05-07T19:42:53.4445113Z Installing : perl-Errno-1.30-477.amzn2023.0.6.x86_64 72/107 2025-05-07T19:42:53.4479171Z Installing : perl-File-Basename-2.85-477.amzn2023.0.6.noarch 73/107 2025-05-07T19:42:53.4522052Z Installing : perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 74/107 2025-05-07T19:42:53.4584349Z Installing : perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 75/107 2025-05-07T19:42:53.4634972Z Installing : perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x 76/107 2025-05-07T19:42:53.4683544Z Installing : perl-constant-1.33-459.amzn2023.0.2.noarch 77/107 2025-05-07T19:42:53.4731130Z Installing : perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 78/107 2025-05-07T19:42:53.4772762Z Installing : perl-overload-1.31-477.amzn2023.0.6.noarch 79/107 2025-05-07T19:42:53.4811242Z Installing : perl-parent-1:0.238-458.amzn2023.0.2.noarch 80/107 2025-05-07T19:42:53.4857137Z Installing : perl-vars-1.05-477.amzn2023.0.6.noarch 81/107 2025-05-07T19:42:53.4891947Z Installing : perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 82/107 2025-05-07T19:42:53.4928862Z Installing : perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 83/107 2025-05-07T19:42:53.4965951Z Installing : perl-Carp-1.50-458.amzn2023.0.2.noarch 84/107 2025-05-07T19:42:53.5008539Z Installing : perl-Exporter-5.74-459.amzn2023.0.2.noarch 85/107 2025-05-07T19:42:53.5068884Z Installing : perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 86/107 2025-05-07T19:42:53.5586845Z Installing : perl-PathTools-3.78-459.amzn2023.0.2.x86_64 87/107 2025-05-07T19:42:53.6533124Z Installing : perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 88/107 2025-05-07T19:42:53.6630091Z Installing : perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 89/107 2025-05-07T19:42:53.6691475Z Installing : perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_6 90/107 2025-05-07T19:42:53.6737619Z Installing : perl-Error-1:0.17029-5.amzn2023.0.2.noarch 91/107 2025-05-07T19:42:53.6792230Z Installing : perl-File-Find-1.37-477.amzn2023.0.6.noarch 92/107 2025-05-07T19:42:53.6831565Z Installing : perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 93/107 2025-05-07T19:42:53.6875448Z Installing : perl-lib-0.65-477.amzn2023.0.6.x86_64 94/107 2025-05-07T19:42:53.6917553Z Installing : perl-Git-2.47.1-1.amzn2023.0.2.noarch 95/107 2025-05-07T19:42:53.6968715Z Installing : git-2.47.1-1.amzn2023.0.2.x86_64 96/107 2025-05-07T19:42:53.7146548Z Installing : elfutils-debuginfod-client-0.188-3.amzn2023.0.2. 97/107 2025-05-07T19:42:53.7238838Z Installing : cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 98/107 2025-05-07T19:42:53.7298167Z Installing : openldap-2.4.57-6.amzn2023.0.7.x86_64 99/107 2025-05-07T19:42:53.7683011Z Installing : sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_ 100/107 2025-05-07T19:42:53.8907036Z Installing : sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 101/107 2025-05-07T19:42:53.8975467Z Installing : binutils-2.41-50.amzn2023.0.3.x86_64 102/107 2025-05-07T19:42:53.9088756Z Running scriptlet: binutils-2.41-50.amzn2023.0.3.x86_64 102/107 2025-05-07T19:42:53.9372850Z Installing : pciutils-3.7.0-3.amzn2023.0.2.x86_64 103/107 2025-05-07T19:42:53.9447123Z Installing : wget-1.21.3-1.amzn2023.0.4.x86_64 104/107 2025-05-07T19:42:53.9673750Z Installing : which-2.21-26.amzn2023.0.2.x86_64 105/107 2025-05-07T19:42:53.9867042Z Installing : tar-2:1.34-1.amzn2023.0.4.x86_64 106/107 2025-05-07T19:42:53.9923171Z Installing : findutils-1:4.8.0-2.amzn2023.0.2.x86_64 107/107 2025-05-07T19:42:54.0040851Z Running scriptlet: pam-1.5.1-8.amzn2023.0.4.x86_64 107/107 2025-05-07T19:42:54.7708790Z Running scriptlet: findutils-1:4.8.0-2.amzn2023.0.2.x86_64 107/107 2025-05-07T19:42:54.7709884Z Verifying : binutils-2.41-50.amzn2023.0.3.x86_64 1/107 2025-05-07T19:42:54.7710519Z Verifying : cracklib-2.9.6-27.amzn2023.0.2.x86_64 2/107 2025-05-07T19:42:54.7711184Z Verifying : cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 3/107 2025-05-07T19:42:54.7711883Z Verifying : elfutils-debuginfod-client-0.188-3.amzn2023.0.2. 4/107 2025-05-07T19:42:54.7712521Z Verifying : findutils-1:4.8.0-2.amzn2023.0.2.x86_64 5/107 2025-05-07T19:42:54.7713162Z Verifying : git-2.47.1-1.amzn2023.0.2.x86_64 6/107 2025-05-07T19:42:54.7713838Z Verifying : git-core-2.47.1-1.amzn2023.0.2.x86_64 7/107 2025-05-07T19:42:54.7714508Z Verifying : git-core-doc-2.47.1-1.amzn2023.0.2.noarch 8/107 2025-05-07T19:42:54.7715524Z Verifying : gnutls-3.8.3-6.amzn2023.0.1.x86_64 9/107 2025-05-07T19:42:54.7716129Z Verifying : groff-base-1.22.4-7.amzn2023.0.2.x86_64 10/107 2025-05-07T19:42:54.7716784Z Verifying : gzip-1.12-1.amzn2023.0.1.x86_64 11/107 2025-05-07T19:42:54.7717363Z Verifying : hwdata-0.384-1.amzn2023.0.3.noarch 12/107 2025-05-07T19:42:54.7717996Z Verifying : jansson-2.14-0.amzn2023.x86_64 13/107 2025-05-07T19:42:54.7718597Z Verifying : kmod-libs-29-2.amzn2023.0.5.x86_64 14/107 2025-05-07T19:42:54.7719243Z Verifying : less-608-2.amzn2023.0.2.x86_64 15/107 2025-05-07T19:42:54.7719870Z Verifying : libcbor-0.7.0-3.amzn2023.0.2.x86_64 16/107 2025-05-07T19:42:54.7720464Z Verifying : libdb-5.3.28-49.amzn2023.0.2.x86_64 17/107 2025-05-07T19:42:54.7721099Z Verifying : libeconf-0.4.0-1.amzn2023.0.3.x86_64 18/107 2025-05-07T19:42:54.7721744Z Verifying : libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 19/107 2025-05-07T19:42:54.7722359Z Verifying : libfdisk-2.37.4-1.amzn2023.0.4.x86_64 20/107 2025-05-07T19:42:54.7723068Z Verifying : libfido2-1.10.0-2.amzn2023.0.2.x86_64 21/107 2025-05-07T19:42:54.7723639Z Verifying : libmetalink-0.1.3-14.amzn2023.0.2.x86_64 22/107 2025-05-07T19:42:54.7724358Z Verifying : libpwquality-1.4.4-6.amzn2023.0.2.x86_64 23/107 2025-05-07T19:42:54.7724968Z Verifying : libsemanage-3.4-5.amzn2023.0.2.x86_64 24/107 2025-05-07T19:42:54.7725665Z Verifying : libutempter-1.2.1-4.amzn2023.0.2.x86_64 25/107 2025-05-07T19:42:54.7726325Z Verifying : nano-8.3-1.amzn2023.x86_64 26/107 2025-05-07T19:42:54.7726929Z Verifying : nano-default-editor-8.3-1.amzn2023.noarch 27/107 2025-05-07T19:42:54.7727624Z Verifying : ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 28/107 2025-05-07T19:42:54.7728229Z Verifying : nettle-3.10.1-1.amzn2023.0.1.x86_64 29/107 2025-05-07T19:42:54.7729370Z Verifying : openldap-2.4.57-6.amzn2023.0.7.x86_64 30/107 2025-05-07T19:42:54.7730051Z Verifying : openssh-8.7p1-8.amzn2023.0.14.x86_64 31/107 2025-05-07T19:42:54.7730675Z Verifying : openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 32/107 2025-05-07T19:42:54.7731409Z Verifying : pam-1.5.1-8.amzn2023.0.4.x86_64 33/107 2025-05-07T19:42:54.7731967Z Verifying : pciutils-3.7.0-3.amzn2023.0.2.x86_64 34/107 2025-05-07T19:42:54.7732685Z Verifying : pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 35/107 2025-05-07T19:42:54.7733362Z Verifying : perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 36/107 2025-05-07T19:42:54.7734125Z Verifying : perl-B-1.80-477.amzn2023.0.6.x86_64 37/107 2025-05-07T19:42:54.7734814Z Verifying : perl-Carp-1.50-458.amzn2023.0.2.noarch 38/107 2025-05-07T19:42:54.7735427Z Verifying : perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 39/107 2025-05-07T19:42:54.7736087Z Verifying : perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 40/107 2025-05-07T19:42:54.7736742Z Verifying : perl-Digest-1.20-1.amzn2023.0.2.noarch 41/107 2025-05-07T19:42:54.7737395Z Verifying : perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 42/107 2025-05-07T19:42:54.7738064Z Verifying : perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 43/107 2025-05-07T19:42:54.7738719Z Verifying : perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 44/107 2025-05-07T19:42:54.7739456Z Verifying : perl-Errno-1.30-477.amzn2023.0.6.x86_64 45/107 2025-05-07T19:42:54.7740212Z Verifying : perl-Error-1:0.17029-5.amzn2023.0.2.noarch 46/107 2025-05-07T19:42:54.7740837Z Verifying : perl-Exporter-5.74-459.amzn2023.0.2.noarch 47/107 2025-05-07T19:42:54.7741506Z Verifying : perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 48/107 2025-05-07T19:42:54.7742089Z Verifying : perl-File-Basename-2.85-477.amzn2023.0.6.noarch 49/107 2025-05-07T19:42:54.7742814Z Verifying : perl-File-Find-1.37-477.amzn2023.0.6.noarch 50/107 2025-05-07T19:42:54.7743481Z Verifying : perl-File-Path-2.18-2.amzn2023.0.2.noarch 51/107 2025-05-07T19:42:54.7744067Z Verifying : perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 52/107 2025-05-07T19:42:54.7744597Z Verifying : perl-File-stat-1.09-477.amzn2023.0.6.noarch 53/107 2025-05-07T19:42:54.7745164Z Verifying : perl-FileHandle-2.03-477.amzn2023.0.6.noarch 54/107 2025-05-07T19:42:54.7745707Z Verifying : perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 55/107 2025-05-07T19:42:54.7746279Z Verifying : perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 56/107 2025-05-07T19:42:54.7746847Z Verifying : perl-Git-2.47.1-1.amzn2023.0.2.noarch 57/107 2025-05-07T19:42:54.7747379Z Verifying : perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 58/107 2025-05-07T19:42:54.7747941Z Verifying : perl-IO-1.43-477.amzn2023.0.6.x86_64 59/107 2025-05-07T19:42:54.7748466Z Verifying : perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 60/107 2025-05-07T19:42:54.7749037Z Verifying : perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 61/107 2025-05-07T19:42:54.7749586Z Verifying : perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 62/107 2025-05-07T19:42:54.7750146Z Verifying : perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 63/107 2025-05-07T19:42:54.7750707Z Verifying : perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 64/107 2025-05-07T19:42:54.7751254Z Verifying : perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 65/107 2025-05-07T19:42:54.7751799Z Verifying : perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 66/107 2025-05-07T19:42:54.7752331Z Verifying : perl-POSIX-1.94-477.amzn2023.0.6.x86_64 67/107 2025-05-07T19:42:54.7752886Z Verifying : perl-PathTools-3.78-459.amzn2023.0.2.x86_64 68/107 2025-05-07T19:42:54.7753440Z Verifying : perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 69/107 2025-05-07T19:42:54.7754089Z Verifying : perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 70/107 2025-05-07T19:42:54.7754656Z Verifying : perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 71/107 2025-05-07T19:42:54.7755181Z Verifying : perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 72/107 2025-05-07T19:42:54.7755729Z Verifying : perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x 73/107 2025-05-07T19:42:54.7756365Z Verifying : perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 74/107 2025-05-07T19:42:54.7756927Z Verifying : perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 75/107 2025-05-07T19:42:54.7757468Z Verifying : perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 76/107 2025-05-07T19:42:54.7757992Z Verifying : perl-Symbol-1.08-477.amzn2023.0.6.noarch 77/107 2025-05-07T19:42:54.7758555Z Verifying : perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 78/107 2025-05-07T19:42:54.7759110Z Verifying : perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 79/107 2025-05-07T19:42:54.7759667Z Verifying : perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 80/107 2025-05-07T19:42:54.7760237Z Verifying : perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarc 81/107 2025-05-07T19:42:54.7760797Z Verifying : perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noa 82/107 2025-05-07T19:42:54.7761347Z Verifying : perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 83/107 2025-05-07T19:42:54.7761936Z Verifying : perl-URI-5.09-1.amzn2023.0.2.noarch 84/107 2025-05-07T19:42:54.7762474Z Verifying : perl-base-2.27-477.amzn2023.0.6.noarch 85/107 2025-05-07T19:42:54.7763006Z Verifying : perl-constant-1.33-459.amzn2023.0.2.noarch 86/107 2025-05-07T19:42:54.7763551Z Verifying : perl-if-0.60.800-477.amzn2023.0.6.noarch 87/107 2025-05-07T19:42:54.7764112Z Verifying : perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_6 88/107 2025-05-07T19:42:54.7764629Z Verifying : perl-lib-0.65-477.amzn2023.0.6.x86_64 89/107 2025-05-07T19:42:54.7765164Z Verifying : perl-libnet-3.13-2.amzn2023.0.2.noarch 90/107 2025-05-07T19:42:54.7765680Z Verifying : perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 91/107 2025-05-07T19:42:54.7766202Z Verifying : perl-mro-1.23-477.amzn2023.0.6.x86_64 92/107 2025-05-07T19:42:54.7766758Z Verifying : perl-overload-1.31-477.amzn2023.0.6.noarch 93/107 2025-05-07T19:42:54.7767315Z Verifying : perl-overloading-0.02-477.amzn2023.0.6.noarch 94/107 2025-05-07T19:42:54.7767872Z Verifying : perl-parent-1:0.238-458.amzn2023.0.2.noarch 95/107 2025-05-07T19:42:54.7768391Z Verifying : perl-podlators-1:4.14-458.amzn2023.0.2.noarch 96/107 2025-05-07T19:42:54.7768939Z Verifying : perl-subs-1.03-477.amzn2023.0.6.noarch 97/107 2025-05-07T19:42:54.7769461Z Verifying : perl-vars-1.05-477.amzn2023.0.6.noarch 98/107 2025-05-07T19:42:54.7769993Z Verifying : shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 99/107 2025-05-07T19:42:54.7770512Z Verifying : sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 100/107 2025-05-07T19:42:54.7771038Z Verifying : sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_ 101/107 2025-05-07T19:42:54.7771609Z Verifying : systemd-libs-252.23-3.amzn2023.x86_64 102/107 2025-05-07T19:42:54.7772115Z Verifying : tar-2:1.34-1.amzn2023.0.4.x86_64 103/107 2025-05-07T19:42:54.7772630Z Verifying : util-linux-2.37.4-1.amzn2023.0.4.x86_64 104/107 2025-05-07T19:42:54.7773179Z Verifying : util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 105/107 2025-05-07T19:42:54.7773694Z Verifying : wget-1.21.3-1.amzn2023.0.4.x86_64 106/107 2025-05-07T19:42:54.9023943Z Verifying : which-2.21-26.amzn2023.0.2.x86_64 107/107 2025-05-07T19:42:54.9024296Z 2025-05-07T19:42:54.9024748Z Installed: 2025-05-07T19:42:54.9025195Z binutils-2.41-50.amzn2023.0.3.x86_64 2025-05-07T19:42:54.9025758Z cracklib-2.9.6-27.amzn2023.0.2.x86_64 2025-05-07T19:42:54.9026314Z cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 2025-05-07T19:42:54.9027183Z elfutils-debuginfod-client-0.188-3.amzn2023.0.2.x86_64 2025-05-07T19:42:54.9027779Z findutils-1:4.8.0-2.amzn2023.0.2.x86_64 2025-05-07T19:42:54.9028291Z git-2.47.1-1.amzn2023.0.2.x86_64 2025-05-07T19:42:54.9028971Z git-core-2.47.1-1.amzn2023.0.2.x86_64 2025-05-07T19:42:54.9029517Z git-core-doc-2.47.1-1.amzn2023.0.2.noarch 2025-05-07T19:42:54.9030031Z gnutls-3.8.3-6.amzn2023.0.1.x86_64 2025-05-07T19:42:54.9030555Z groff-base-1.22.4-7.amzn2023.0.2.x86_64 2025-05-07T19:42:54.9031057Z gzip-1.12-1.amzn2023.0.1.x86_64 2025-05-07T19:42:54.9031609Z hwdata-0.384-1.amzn2023.0.3.noarch 2025-05-07T19:42:54.9032302Z jansson-2.14-0.amzn2023.x86_64 2025-05-07T19:42:54.9032815Z kmod-libs-29-2.amzn2023.0.5.x86_64 2025-05-07T19:42:54.9033334Z less-608-2.amzn2023.0.2.x86_64 2025-05-07T19:42:54.9033937Z libcbor-0.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:42:54.9034452Z libdb-5.3.28-49.amzn2023.0.2.x86_64 2025-05-07T19:42:54.9034979Z libeconf-0.4.0-1.amzn2023.0.3.x86_64 2025-05-07T19:42:54.9035520Z libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 2025-05-07T19:42:54.9036082Z libfdisk-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:42:54.9036618Z libfido2-1.10.0-2.amzn2023.0.2.x86_64 2025-05-07T19:42:54.9037155Z libmetalink-0.1.3-14.amzn2023.0.2.x86_64 2025-05-07T19:42:54.9037724Z libpwquality-1.4.4-6.amzn2023.0.2.x86_64 2025-05-07T19:42:54.9038268Z libsemanage-3.4-5.amzn2023.0.2.x86_64 2025-05-07T19:42:54.9038824Z libutempter-1.2.1-4.amzn2023.0.2.x86_64 2025-05-07T19:42:54.9039331Z nano-8.3-1.amzn2023.x86_64 2025-05-07T19:42:54.9039870Z nano-default-editor-8.3-1.amzn2023.noarch 2025-05-07T19:42:54.9040447Z ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 2025-05-07T19:42:54.9040967Z nettle-3.10.1-1.amzn2023.0.1.x86_64 2025-05-07T19:42:54.9041491Z openldap-2.4.57-6.amzn2023.0.7.x86_64 2025-05-07T19:42:54.9042015Z openssh-8.7p1-8.amzn2023.0.14.x86_64 2025-05-07T19:42:54.9042607Z openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 2025-05-07T19:42:54.9043157Z pam-1.5.1-8.amzn2023.0.4.x86_64 2025-05-07T19:42:54.9043705Z pciutils-3.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:42:54.9044282Z pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:42:54.9044869Z perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 2025-05-07T19:42:54.9045462Z perl-B-1.80-477.amzn2023.0.6.x86_64 2025-05-07T19:42:54.9046126Z perl-Carp-1.50-458.amzn2023.0.2.noarch 2025-05-07T19:42:54.9046711Z perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 2025-05-07T19:42:54.9047451Z perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 2025-05-07T19:42:54.9048097Z perl-Digest-1.20-1.amzn2023.0.2.noarch 2025-05-07T19:42:54.9048661Z perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 2025-05-07T19:42:54.9049202Z perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 2025-05-07T19:42:54.9049755Z perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 2025-05-07T19:42:54.9050267Z perl-Errno-1.30-477.amzn2023.0.6.x86_64 2025-05-07T19:42:54.9050810Z perl-Error-1:0.17029-5.amzn2023.0.2.noarch 2025-05-07T19:42:54.9051369Z perl-Exporter-5.74-459.amzn2023.0.2.noarch 2025-05-07T19:42:54.9051879Z perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 2025-05-07T19:42:54.9052419Z perl-File-Basename-2.85-477.amzn2023.0.6.noarch 2025-05-07T19:42:54.9053765Z perl-File-Find-1.37-477.amzn2023.0.6.noarch 2025-05-07T19:42:54.9054347Z perl-File-Path-2.18-2.amzn2023.0.2.noarch 2025-05-07T19:42:54.9054902Z perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 2025-05-07T19:42:54.9055432Z perl-File-stat-1.09-477.amzn2023.0.6.noarch 2025-05-07T19:42:54.9056019Z perl-FileHandle-2.03-477.amzn2023.0.6.noarch 2025-05-07T19:42:54.9056566Z perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 2025-05-07T19:42:54.9057133Z perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 2025-05-07T19:42:54.9057671Z perl-Git-2.47.1-1.amzn2023.0.2.noarch 2025-05-07T19:42:54.9058229Z perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 2025-05-07T19:42:54.9058776Z perl-IO-1.43-477.amzn2023.0.6.x86_64 2025-05-07T19:42:54.9059307Z perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 2025-05-07T19:42:54.9059886Z perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 2025-05-07T19:42:54.9060413Z perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 2025-05-07T19:42:54.9060948Z perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 2025-05-07T19:42:54.9061478Z perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 2025-05-07T19:42:54.9062013Z perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 2025-05-07T19:42:54.9062527Z perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 2025-05-07T19:42:54.9063029Z perl-POSIX-1.94-477.amzn2023.0.6.x86_64 2025-05-07T19:42:54.9063550Z perl-PathTools-3.78-459.amzn2023.0.2.x86_64 2025-05-07T19:42:54.9064076Z perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 2025-05-07T19:42:54.9064614Z perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 2025-05-07T19:42:54.9065144Z perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 2025-05-07T19:42:54.9065644Z perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 2025-05-07T19:42:54.9066169Z perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x86_64 2025-05-07T19:42:54.9066699Z perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 2025-05-07T19:42:54.9067226Z perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 2025-05-07T19:42:54.9067715Z perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 2025-05-07T19:42:54.9068230Z perl-Symbol-1.08-477.amzn2023.0.6.noarch 2025-05-07T19:42:54.9068845Z perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 2025-05-07T19:42:54.9069377Z perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 2025-05-07T19:42:54.9069909Z perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 2025-05-07T19:42:54.9070450Z perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarch 2025-05-07T19:42:54.9071023Z perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noarch 2025-05-07T19:42:54.9071563Z perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 2025-05-07T19:42:54.9072054Z perl-URI-5.09-1.amzn2023.0.2.noarch 2025-05-07T19:42:54.9072560Z perl-base-2.27-477.amzn2023.0.6.noarch 2025-05-07T19:42:54.9073066Z perl-constant-1.33-459.amzn2023.0.2.noarch 2025-05-07T19:42:54.9073582Z perl-if-0.60.800-477.amzn2023.0.6.noarch 2025-05-07T19:42:54.9074495Z perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_64 2025-05-07T19:42:54.9075044Z perl-lib-0.65-477.amzn2023.0.6.x86_64 2025-05-07T19:42:54.9075590Z perl-libnet-3.13-2.amzn2023.0.2.noarch 2025-05-07T19:42:54.9076117Z perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 2025-05-07T19:42:54.9076643Z perl-mro-1.23-477.amzn2023.0.6.x86_64 2025-05-07T19:42:54.9077179Z perl-overload-1.31-477.amzn2023.0.6.noarch 2025-05-07T19:42:54.9077767Z perl-overloading-0.02-477.amzn2023.0.6.noarch 2025-05-07T19:42:54.9078324Z perl-parent-1:0.238-458.amzn2023.0.2.noarch 2025-05-07T19:42:54.9078876Z perl-podlators-1:4.14-458.amzn2023.0.2.noarch 2025-05-07T19:42:54.9079441Z perl-subs-1.03-477.amzn2023.0.6.noarch 2025-05-07T19:42:54.9079977Z perl-vars-1.05-477.amzn2023.0.6.noarch 2025-05-07T19:42:54.9080519Z shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 2025-05-07T19:42:54.9081026Z sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 2025-05-07T19:42:54.9081574Z sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_64 2025-05-07T19:42:54.9082148Z systemd-libs-252.23-3.amzn2023.x86_64 2025-05-07T19:42:54.9082653Z tar-2:1.34-1.amzn2023.0.4.x86_64 2025-05-07T19:42:54.9083209Z util-linux-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:42:54.9083745Z util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:42:54.9084282Z wget-1.21.3-1.amzn2023.0.4.x86_64 2025-05-07T19:42:54.9084767Z which-2.21-26.amzn2023.0.2.x86_64 2025-05-07T19:42:54.9085188Z 2025-05-07T19:42:54.9085313Z Complete! 2025-05-07T19:42:54.9817597Z ##[group]Run actions/checkout@v4 2025-05-07T19:42:54.9817900Z with: 2025-05-07T19:42:54.9818108Z submodules: true 2025-05-07T19:42:54.9818326Z repository: pytorch/FBGEMM 2025-05-07T19:42:54.9818758Z token: *** 2025-05-07T19:42:54.9818953Z ssh-strict: true 2025-05-07T19:42:54.9819168Z ssh-user: git 2025-05-07T19:42:54.9819379Z persist-credentials: true 2025-05-07T19:42:54.9819631Z clean: true 2025-05-07T19:42:54.9819860Z sparse-checkout-cone-mode: true 2025-05-07T19:42:54.9820119Z fetch-depth: 1 2025-05-07T19:42:54.9820335Z fetch-tags: false 2025-05-07T19:42:54.9820540Z show-progress: true 2025-05-07T19:42:54.9820768Z lfs: false 2025-05-07T19:42:54.9820973Z set-safe-directory: true 2025-05-07T19:42:54.9821405Z env: 2025-05-07T19:42:54.9821605Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:42:54.9821909Z BUILD_ENV: build_binary 2025-05-07T19:42:54.9822135Z BUILD_TARGET: default 2025-05-07T19:42:54.9822422Z BUILD_VARIANT: cuda 2025-05-07T19:42:54.9822694Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:42:54.9822926Z ##[endgroup] 2025-05-07T19:42:54.9863403Z ##[command]/usr/bin/docker exec 5cbac523ab1bac2d9a3da38db9fa5ba5ff830b0fdf4c0802e1a29dc793634db1 sh -c "cat /etc/*release | grep ^ID" 2025-05-07T19:42:55.3111154Z Syncing repository: pytorch/FBGEMM 2025-05-07T19:42:55.3112502Z ##[group]Getting Git version info 2025-05-07T19:42:55.3112856Z Working directory is '/__w/FBGEMM/FBGEMM' 2025-05-07T19:42:55.3113394Z [command]/usr/bin/git version 2025-05-07T19:42:55.3113789Z git version 2.47.1 2025-05-07T19:42:55.3115031Z ##[endgroup] 2025-05-07T19:42:55.3119362Z Temporarily overriding HOME='/__w/_temp/0fdc4fe6-e24b-494d-927f-420090d9e2fb' before making global git config changes 2025-05-07T19:42:55.3120171Z Adding repository directory to the temporary git global config as a safe directory 2025-05-07T19:42:55.3120829Z [command]/usr/bin/git config --global --add safe.directory /__w/FBGEMM/FBGEMM 2025-05-07T19:42:55.3148450Z [command]/usr/bin/git config --local --get remote.origin.url 2025-05-07T19:42:55.3165688Z https://github.com/pytorch/FBGEMM 2025-05-07T19:42:55.3181018Z ##[group]Removing previously created refs, to avoid conflicts 2025-05-07T19:42:55.3183647Z [command]/usr/bin/git rev-parse --symbolic-full-name --verify --quiet HEAD 2025-05-07T19:42:55.3201058Z HEAD 2025-05-07T19:42:55.3235233Z ##[endgroup] 2025-05-07T19:42:55.3235943Z [command]/usr/bin/git submodule status 2025-05-07T19:42:55.3577837Z e5d7c0bd5d9aec44d68830187138149e6a8c4e32 external/asmjit (e5d7c0b) 2025-05-07T19:42:55.3656843Z 4a61bdd4bd4ed730e078aebc7c0fcf046ff29406 external/composable_kernel (4a61bdd) 2025-05-07T19:42:55.3731757Z 6543fec09b2f04ac4a666882998b534afc9c1349 external/cpuinfo (6543fec) 2025-05-07T19:42:55.3813753Z 3ed8d2ec4ba35ef5d9d8353826209b6f868f63d3 external/cutlass (3ed8d2e) 2025-05-07T19:42:55.3883842Z f8d7d77c06936315286eb55f8de22cd23c188571 external/googletest (f8d7d77) 2025-05-07T19:42:55.3957596Z 420084499c7c1e1c2d801922f40df202eac5f3a0 external/hipify_torch (4200844) 2025-05-07T19:42:55.4018215Z 9cca280a4d0ccf0c08f47a99aa71d1b0e52f8d03 external/json (9cca280) 2025-05-07T19:42:55.4023023Z ##[group]Cleaning the repository 2025-05-07T19:42:55.4025971Z [command]/usr/bin/git clean -ffdx 2025-05-07T19:42:55.4664150Z Removing amdgpu-install_6.3.60300-1_all.deb 2025-05-07T19:42:55.4665170Z Removing collect_env.py 2025-05-07T19:42:55.4665902Z Removing fbgemm_gpu/_skbuild/ 2025-05-07T19:42:55.4666990Z Removing fbgemm_gpu/bench/verify_fp16_stochastic_benchmark.hip 2025-05-07T19:42:55.4668285Z Removing fbgemm_gpu/codegen/genscript/__pycache__/ 2025-05-07T19:42:55.4670000Z Removing fbgemm_gpu/codegen/inference/embedding_forward_quantized_cpu_template_hip.cpp 2025-05-07T19:42:55.4670700Z Removing fbgemm_gpu/codegen/inference/embedding_forward_quantized_host_cpu_hip.cpp 2025-05-07T19:42:55.4671357Z Removing fbgemm_gpu/codegen/inference/embedding_forward_quantized_host_hip.cpp 2025-05-07T19:42:55.4672312Z Removing fbgemm_gpu/codegen/inference/embedding_forward_quantized_split_lookup.hip 2025-05-07T19:42:55.4673037Z Removing fbgemm_gpu/codegen/inference/embedding_forward_quantized_split_nbit_host_template.hip 2025-05-07T19:42:55.4673955Z Removing fbgemm_gpu/codegen/inference/embedding_forward_quantized_split_nbit_kernel_template.hip 2025-05-07T19:42:55.4674908Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_dense_host_cpu_hip.cpp 2025-05-07T19:42:55.4675671Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_cpu_approx_template_hip.cpp 2025-05-07T19:42:55.4676469Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_cpu_template_hip.cpp 2025-05-07T19:42:55.4677341Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_device_kernel_template_hip.cuh 2025-05-07T19:42:55.4678279Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_grad_template.hip 2025-05-07T19:42:55.4679051Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_host_cpu_template_hip.cpp 2025-05-07T19:42:55.4679845Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_host_template_hip.cpp 2025-05-07T19:42:55.4680634Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_indice_weights_template.hip 2025-05-07T19:42:55.4681454Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_kernel_cta_template.hip 2025-05-07T19:42:55.4682242Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_kernel_warp_template.hip 2025-05-07T19:42:55.4683035Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_meta_template_hip.cpp 2025-05-07T19:42:55.4684017Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_template.hip 2025-05-07T19:42:55.4684666Z Removing fbgemm_gpu/codegen/training/forward/embedding_forward_split_cpu_hip.cpp 2025-05-07T19:42:55.4685378Z Removing fbgemm_gpu/codegen/training/forward/embedding_forward_split_kernel_nobag_small_template.hip 2025-05-07T19:42:55.4686111Z Removing fbgemm_gpu/codegen/training/forward/embedding_forward_split_kernel_template.hip 2025-05-07T19:42:55.4686816Z Removing fbgemm_gpu/codegen/training/forward/embedding_forward_split_kernel_v2_template.hip 2025-05-07T19:42:55.4687478Z Removing fbgemm_gpu/codegen/training/forward/embedding_forward_split_template.hip 2025-05-07T19:42:55.4688139Z Removing fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_cpu_host_hip.cpp 2025-05-07T19:42:55.4688803Z Removing fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_ops_hip.cpp 2025-05-07T19:42:55.4689519Z Removing fbgemm_gpu/codegen/training/optimizer/embedding_optimizer_split_device_kernel_template_hip.cuh 2025-05-07T19:42:55.4690304Z Removing fbgemm_gpu/codegen/training/optimizer/embedding_optimizer_split_host_template_hip.cpp 2025-05-07T19:42:55.4691030Z Removing fbgemm_gpu/codegen/training/optimizer/embedding_optimizer_split_kernel_template.hip 2025-05-07T19:42:55.4691728Z Removing fbgemm_gpu/codegen/training/optimizer/embedding_optimizer_split_template.hip 2025-05-07T19:42:55.4692433Z Removing fbgemm_gpu/codegen/training/pt2/embedding_split_host_pt2_autograd_template_hip.cpp 2025-05-07T19:42:55.4693146Z Removing fbgemm_gpu/codegen/training/pt2/embedding_split_host_pt2_cpu_wrapper_template_hip.cpp 2025-05-07T19:42:55.4693873Z Removing fbgemm_gpu/codegen/training/pt2/embedding_split_host_pt2_hip_wrapper_template.cpp 2025-05-07T19:42:55.4694725Z Removing fbgemm_gpu/codegen/utils/embedding_bounds_check_host_cpu_hip.cpp 2025-05-07T19:42:55.4695294Z Removing fbgemm_gpu/codegen/utils/embedding_bounds_check_host_hip.cpp 2025-05-07T19:42:55.4695820Z Removing fbgemm_gpu/codegen/utils/embedding_bounds_check_v1.hip 2025-05-07T19:42:55.4696298Z Removing fbgemm_gpu/codegen/utils/embedding_bounds_check_v2.hip 2025-05-07T19:42:55.4696712Z Removing fbgemm_gpu/dist/ 2025-05-07T19:42:55.4697073Z Removing fbgemm_gpu/experimental/example/src/cutlass_sgemm_nn.hip 2025-05-07T19:42:55.4697592Z Removing fbgemm_gpu/experimental/example/src/example_nccl_hip.cpp 2025-05-07T19:42:55.4698245Z Removing fbgemm_gpu/experimental/gen_ai/src/attention/gqa_attn_splitk.hip 2025-05-07T19:42:55.4698802Z Removing fbgemm_gpu/experimental/gen_ai/src/coalesce/coalesce.hip 2025-05-07T19:42:55.4699295Z Removing fbgemm_gpu/experimental/gen_ai/src/comm/car.hip 2025-05-07T19:42:55.4699737Z Removing fbgemm_gpu/experimental/gen_ai/src/comm/car_hip.cpp 2025-05-07T19:42:55.4700281Z Removing fbgemm_gpu/experimental/gen_ai/src/gather_scatter/gather_scatter.hip 2025-05-07T19:42:55.4700827Z Removing fbgemm_gpu/experimental/gen_ai/src/kv_cache/kv_cache.hip 2025-05-07T19:42:55.4701355Z Removing fbgemm_gpu/experimental/gen_ai/src/kv_cache/kv_cache_hip.cpp 2025-05-07T19:42:55.4701872Z Removing fbgemm_gpu/experimental/gen_ai/src/moe/index_shuffling.hip 2025-05-07T19:42:55.4702663Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/ck_extensions/bf16_grouped/kernels/bf16_grouped_common_hip.h 2025-05-07T19:42:55.4703566Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/ck_extensions/fp8_rowwise/kernels/fp8_rowwise_common_hip.h 2025-05-07T19:42:55.4704509Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/ck_extensions/fp8_rowwise_batched/kernels/fp8_rowwise_batched_common_hip.h 2025-05-07T19:42:55.4705527Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/ck_extensions/fp8_rowwise_grouped/kernels/fp8_rowwise_grouped_common_hip.h 2025-05-07T19:42:55.4706495Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/ck_extensions/fused_moe/fused_moe_op_hip.cpp 2025-05-07T19:42:55.4707131Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cublas_utils_hip.h 2025-05-07T19:42:55.4707764Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/bf16bf16bf16_grouped.hip 2025-05-07T19:42:55.4708441Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/bf16i4bf16.hip 2025-05-07T19:42:55.4709172Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/bf16i4bf16_rowwise_batched.hip 2025-05-07T19:42:55.4709943Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/bf16i4bf16_shuffled_grouped.hip 2025-05-07T19:42:55.4710859Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16.hip 2025-05-07T19:42:55.4711627Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_128_128_4_1_1_f.hip 2025-05-07T19:42:55.4712475Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_128_128_4_1_1_t.hip 2025-05-07T19:42:55.4713332Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_128_192_2_2_1_f.hip 2025-05-07T19:42:55.4714463Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_128_192_2_2_1_t.hip 2025-05-07T19:42:55.4715341Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_128_256_2_1_1_f.hip 2025-05-07T19:42:55.4716255Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_128_256_2_1_1_t.hip 2025-05-07T19:42:55.4749469Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_128_2_2_1_f.hip 2025-05-07T19:42:55.4750366Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_128_2_2_1_t.hip 2025-05-07T19:42:55.4751441Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_128_2_4_1_f.hip 2025-05-07T19:42:55.4752313Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_128_2_4_1_t.hip 2025-05-07T19:42:55.4753190Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_192_2_2_1_f.hip 2025-05-07T19:42:55.4754197Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_192_2_2_1_t.hip 2025-05-07T19:42:55.4755087Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_192_2_4_1_f.hip 2025-05-07T19:42:55.4756177Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_192_2_4_1_t.hip 2025-05-07T19:42:55.4757057Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_192_4_1_1_f.hip 2025-05-07T19:42:55.4757937Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_192_4_1_1_t.hip 2025-05-07T19:42:55.4758832Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_256_2_1_1_f.hip 2025-05-07T19:42:55.4759713Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_256_2_1_1_t.hip 2025-05-07T19:42:55.4760573Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_256_2_2_1_f.hip 2025-05-07T19:42:55.4761552Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_256_2_2_1_t.hip 2025-05-07T19:42:55.4762429Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_256_2_4_1_f.hip 2025-05-07T19:42:55.4763288Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_256_2_4_1_t.hip 2025-05-07T19:42:55.4764172Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_256_4_1_1_f.hip 2025-05-07T19:42:55.4765044Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_256_4_1_1_t.hip 2025-05-07T19:42:55.4765881Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_common_hip.cuh 2025-05-07T19:42:55.4766849Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_manifest_hip.cuh 2025-05-07T19:42:55.4767588Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16.hip 2025-05-07T19:42:55.4768305Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_blockwise.hip 2025-05-07T19:42:55.4769125Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_cublas.hip 2025-05-07T19:42:55.4769842Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_lite.hip 2025-05-07T19:42:55.4770539Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise.hip 2025-05-07T19:42:55.4771422Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_128_128_128_2_1_1_t_f.hip 2025-05-07T19:42:55.4772441Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_128_256_128_2_1_1_f_t.hip 2025-05-07T19:42:55.4773481Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_128_256_128_4_4_1_f_t.hip 2025-05-07T19:42:55.4774515Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_64_128_128_1_1_1_f_f.hip 2025-05-07T19:42:55.4775567Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_64_16_128_1_1_1_f_f.hip 2025-05-07T19:42:55.4776662Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_64_256_128_1_1_1_f_f.hip 2025-05-07T19:42:55.4777643Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_64_256_128_2_1_1_f_f.hip 2025-05-07T19:42:55.4778601Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_64_32_128_2_1_1_f_f.hip 2025-05-07T19:42:55.4779571Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_64_64_128_2_1_1_f_f.hip 2025-05-07T19:42:55.4780508Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_common_hip.cuh 2025-05-07T19:42:55.4781395Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise_batched/common_hip.cuh 2025-05-07T19:42:55.4782555Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise_batched/dispatch_fp8_rowwise_batched_kernel_on_cluster_size_and_transpose.hip 2025-05-07T19:42:55.4783763Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise_batched/dispatch_fp8_rowwise_batched_kernel_on_tile_size.hip 2025-05-07T19:42:55.4784810Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise_batched/f8f8bf16_rowwise_batched.hip 2025-05-07T19:42:55.4785812Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise_batched/f8f8bf16_rowwise_batched_impl.hip 2025-05-07T19:42:55.4786777Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise_batched/handle_transposition.hip 2025-05-07T19:42:55.4787698Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise_grouped.hip 2025-05-07T19:42:55.4788422Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_tensorwise.hip 2025-05-07T19:42:55.4789129Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8i4bf16_rowwise.hip 2025-05-07T19:42:55.4789817Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8i4bf16_shuffled.hip 2025-05-07T19:42:55.4790538Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8i4bf16_shuffled_grouped.hip 2025-05-07T19:42:55.4791233Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/i8i8bf16.hip 2025-05-07T19:42:55.4791879Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/i8i8bf16_dynamic.hip 2025-05-07T19:42:55.4792664Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/include/fp8_blockwise_cutlass_helpers_hip.h 2025-05-07T19:42:55.4793470Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/mixed_dtype_utils.hip 2025-05-07T19:42:55.4794386Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/fast_gemv/bf16_fast_gemv.hip 2025-05-07T19:42:55.4795075Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/fast_gemv/bf16fp8bf16_fast_gemv.hip 2025-05-07T19:42:55.4795764Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/fast_gemv/fp8fp8bf16_fast_gemv.hip 2025-05-07T19:42:55.4796452Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/fast_gemv/include/fast_gemv.hip 2025-05-07T19:42:55.4797150Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/fast_gemv/include/fast_gemv_hip.cuh 2025-05-07T19:42:55.4797832Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/fast_gemv/include/utility_hip.cuh 2025-05-07T19:42:55.4798444Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/quantize.hip 2025-05-07T19:42:55.4798971Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/quantize_hip.cpp 2025-05-07T19:42:55.4799454Z Removing fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:42:55.4799833Z Removing fbgemm_gpu/fbgemm_gpu_nightly.egg-info/ 2025-05-07T19:42:55.4800262Z Removing fbgemm_gpu/include/fbgemm_gpu/cumem_utils_hip.h 2025-05-07T19:42:55.4800824Z Removing fbgemm_gpu/include/fbgemm_gpu/embedding_backward_template_helpers_hip.cuh 2025-05-07T19:42:55.4801460Z Removing fbgemm_gpu/include/fbgemm_gpu/embedding_forward_split_cpu_hip.h 2025-05-07T19:42:55.4802080Z Removing fbgemm_gpu/include/fbgemm_gpu/embedding_forward_template_helpers_hip.cuh 2025-05-07T19:42:55.4802656Z Removing fbgemm_gpu/include/fbgemm_gpu/layout_transform_ops_hip.cuh 2025-05-07T19:42:55.4803238Z Removing fbgemm_gpu/include/fbgemm_gpu/permute_multi_embedding_function_hip.h 2025-05-07T19:42:55.4803791Z Removing fbgemm_gpu/include/fbgemm_gpu/quantize_ops_hip.cuh 2025-05-07T19:42:55.4804279Z Removing fbgemm_gpu/include/fbgemm_gpu/sparse_ops_hip.cuh 2025-05-07T19:42:55.4804785Z Removing fbgemm_gpu/include/fbgemm_gpu/split_embeddings_utils_hip.cuh 2025-05-07T19:42:55.4805323Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/barrier_isolation_hip.cuh 2025-05-07T19:42:55.4806048Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/bench_utils_hip.cuh 2025-05-07T19:42:55.4806762Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/bitonic_sort_hip.cuh 2025-05-07T19:42:55.4807275Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/cub_namespace_postfix_hip.cuh 2025-05-07T19:42:55.4807801Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/cub_namespace_prefix_hip.cuh 2025-05-07T19:42:55.4808340Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/device_cache_flusher_hip.cuh 2025-05-07T19:42:55.4808886Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/device_properties_hip.cuh 2025-05-07T19:42:55.4809370Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/dispatch_macros_hip.h 2025-05-07T19:42:55.4809918Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/embedding_bounds_check_common_hip.cuh 2025-05-07T19:42:55.4810454Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/find_qparams_hip.cuh 2025-05-07T19:42:55.4810969Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/float_hip.cuh 2025-05-07T19:42:55.4811406Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/hip_prelude.cuh 2025-05-07T19:42:55.4811890Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/host_device_buffer_pair_hip.cuh 2025-05-07T19:42:55.4812454Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/inclusive_sum_scan_hip.cuh 2025-05-07T19:42:55.4812957Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/kernel_launcher_hip.cuh 2025-05-07T19:42:55.4813480Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/rocm/stochastic_rounding_hip.h 2025-05-07T19:42:55.4813970Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/rocm/vec2_hip.h 2025-05-07T19:42:55.4814514Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/rocm/weight_row_hip.h 2025-05-07T19:42:55.4815000Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/shared_memory_hip.cuh 2025-05-07T19:42:55.4815488Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding_hip.cuh 2025-05-07T19:42:55.4816033Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/tensor_accessor_builder_hip.h 2025-05-07T19:42:55.4816525Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/tensor_accessor_hip.h 2025-05-07T19:42:55.4816980Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/vec4_hip.cuh 2025-05-07T19:42:55.4817391Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/vec4acc_hip.cuh 2025-05-07T19:42:55.4817841Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/vec_quant_hip.cuh 2025-05-07T19:42:55.4818275Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/vecn_hip.cuh 2025-05-07T19:42:55.4818700Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/weight_row_hip.cuh 2025-05-07T19:42:55.4819207Z Removing fbgemm_gpu/src/dram_kv_embedding_cache/dram_kv_embedding_cache_hip.h 2025-05-07T19:42:55.4819775Z Removing fbgemm_gpu/src/dram_kv_embedding_cache/dram_kv_embedding_cache_wrapper_hip.h 2025-05-07T19:42:55.4820357Z Removing fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update.hip 2025-05-07T19:42:55.4820928Z Removing fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update_gpu_hip.cpp 2025-05-07T19:42:55.4821487Z Removing fbgemm_gpu/src/histogram_binning_calibration_ops.hip 2025-05-07T19:42:55.4821934Z Removing fbgemm_gpu/src/input_combine_ops/input_combine.hip 2025-05-07T19:42:55.4822382Z Removing fbgemm_gpu/src/input_combine_ops/input_combine_cpu_hip.cpp 2025-05-07T19:42:55.4822958Z Removing fbgemm_gpu/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.hip 2025-05-07T19:42:55.4823639Z Removing fbgemm_gpu/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu_hip.cpp 2025-05-07T19:42:55.4824319Z Removing fbgemm_gpu/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.hip 2025-05-07T19:42:55.4824939Z Removing fbgemm_gpu/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.hip 2025-05-07T19:42:55.4825444Z Removing fbgemm_gpu/src/jagged_tensor_ops/common_hip.cuh 2025-05-07T19:42:55.4825904Z Removing fbgemm_gpu/src/jagged_tensor_ops/dense_to_jagged_forward.hip 2025-05-07T19:42:55.4826397Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_bmm_forward.hip 2025-05-07T19:42:55.4827027Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.hip 2025-05-07T19:42:55.4827699Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.hip 2025-05-07T19:42:55.4828569Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.hip 2025-05-07T19:42:55.4829370Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_index_add_2d_forward.hip 2025-05-07T19:42:55.4829951Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_index_select_2d_forward.hip 2025-05-07T19:42:55.4830534Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_jagged_bmm_forward.hip 2025-05-07T19:42:55.4831066Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_softmax_backward.hip 2025-05-07T19:42:55.4831601Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_softmax_forward.hip 2025-05-07T19:42:55.4832119Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops.hip 2025-05-07T19:42:55.4832620Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_cpu_hip.cpp 2025-05-07T19:42:55.4833348Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_to_padded_dense_backward.hip 2025-05-07T19:42:55.4834020Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_to_padded_dense_forward.hip 2025-05-07T19:42:55.4834589Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_unique_indices.hip 2025-05-07T19:42:55.4835188Z Removing fbgemm_gpu/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.hip 2025-05-07T19:42:55.4835743Z Removing fbgemm_gpu/src/layout_transform_ops/layout_transform_ops.hip 2025-05-07T19:42:55.4836306Z Removing fbgemm_gpu/src/layout_transform_ops/layout_transform_ops_cpu_hip.cpp 2025-05-07T19:42:55.4836794Z Removing fbgemm_gpu/src/memory_utils/common_hip.cuh 2025-05-07T19:42:55.4837192Z Removing fbgemm_gpu/src/memory_utils/memory_utils.hip 2025-05-07T19:42:55.4837604Z Removing fbgemm_gpu/src/memory_utils/memory_utils_hip.cpp 2025-05-07T19:42:55.4838041Z Removing fbgemm_gpu/src/memory_utils/memory_utils_ops.hip 2025-05-07T19:42:55.4838487Z Removing fbgemm_gpu/src/memory_utils/memory_utils_ops_hip.cpp 2025-05-07T19:42:55.4839059Z Removing fbgemm_gpu/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu_hip.cpp 2025-05-07T19:42:55.4839753Z Removing fbgemm_gpu/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu_hip.cpp 2025-05-07T19:42:55.4840284Z Removing fbgemm_gpu/src/metric_ops/metric_ops.hip 2025-05-07T19:42:55.4840851Z Removing fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_function_hip.cpp 2025-05-07T19:42:55.4841528Z Removing fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_ops.hip 2025-05-07T19:42:55.4842207Z Removing fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu_hip.cpp 2025-05-07T19:42:55.4842904Z Removing fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.hip 2025-05-07T19:42:55.4843588Z Removing fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu_hip.cpp 2025-05-07T19:42:55.4844310Z Removing fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.hip 2025-05-07T19:42:55.4845010Z Removing fbgemm_gpu/src/ps_split_embeddings_cache/ps_split_table_batched_embeddings_hip.cpp 2025-05-07T19:42:55.4845707Z Removing fbgemm_gpu/src/ps_split_embeddings_cache/ps_table_batched_embeddings_hip.h 2025-05-07T19:42:55.4846459Z Removing fbgemm_gpu/src/quantize_ops/common_hip.cuh 2025-05-07T19:42:55.4846853Z Removing fbgemm_gpu/src/quantize_ops/mx/common_hip.cuh 2025-05-07T19:42:55.4847237Z Removing fbgemm_gpu/src/quantize_ops/mx_common_hip.cuh 2025-05-07T19:42:55.4847658Z Removing fbgemm_gpu/src/quantize_ops/quantize_bfloat16.hip 2025-05-07T19:42:55.4848103Z Removing fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.hip 2025-05-07T19:42:55.4848565Z Removing fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.hip 2025-05-07T19:42:55.4849069Z Removing fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.hip 2025-05-07T19:42:55.4849506Z Removing fbgemm_gpu/src/quantize_ops/quantize_hfp8.hip 2025-05-07T19:42:55.4849912Z Removing fbgemm_gpu/src/quantize_ops/quantize_msfp.hip 2025-05-07T19:42:55.4850291Z Removing fbgemm_gpu/src/quantize_ops/quantize_mx.hip 2025-05-07T19:42:55.4850688Z Removing fbgemm_gpu/src/quantize_ops/quantize_mx_hip.cuh 2025-05-07T19:42:55.4851269Z Removing fbgemm_gpu/src/quantize_ops/quantize_ops_cpu_hip.cpp 2025-05-07T19:42:55.4851735Z Removing fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.hip 2025-05-07T19:42:55.4852175Z Removing fbgemm_gpu/src/sparse_ops/common_hip.cuh 2025-05-07T19:42:55.4852607Z Removing fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.hip 2025-05-07T19:42:55.4853112Z Removing fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum_hip.cpp 2025-05-07T19:42:55.4853568Z Removing fbgemm_gpu/src/sparse_ops/sparse_async_cumsum.hip 2025-05-07T19:42:55.4854007Z Removing fbgemm_gpu/src/sparse_ops/sparse_async_cumsum_hip.cpp 2025-05-07T19:42:55.4854499Z Removing fbgemm_gpu/src/sparse_ops/sparse_batched_unary_embeddings.hip 2025-05-07T19:42:55.4855072Z Removing fbgemm_gpu/src/sparse_ops/sparse_block_bucketize_features.hip 2025-05-07T19:42:55.4855571Z Removing fbgemm_gpu/src/sparse_ops/sparse_bucketize_features.hip 2025-05-07T19:42:55.4856067Z Removing fbgemm_gpu/src/sparse_ops/sparse_compute_frequency_sequence.hip 2025-05-07T19:42:55.4856605Z Removing fbgemm_gpu/src/sparse_ops/sparse_expand_into_jagged_permute.hip 2025-05-07T19:42:55.4857072Z Removing fbgemm_gpu/src/sparse_ops/sparse_group_index.hip 2025-05-07T19:42:55.4857490Z Removing fbgemm_gpu/src/sparse_ops/sparse_index_add.hip 2025-05-07T19:42:55.4857910Z Removing fbgemm_gpu/src/sparse_ops/sparse_index_select.hip 2025-05-07T19:42:55.4858340Z Removing fbgemm_gpu/src/sparse_ops/sparse_invert_permute.hip 2025-05-07T19:42:55.4858782Z Removing fbgemm_gpu/src/sparse_ops/sparse_ops_cpu_hip.cpp 2025-05-07T19:42:55.4859237Z Removing fbgemm_gpu/src/sparse_ops/sparse_pack_segments_backward.hip 2025-05-07T19:42:55.4859740Z Removing fbgemm_gpu/src/sparse_ops/sparse_pack_segments_forward.hip 2025-05-07T19:42:55.4860190Z Removing fbgemm_gpu/src/sparse_ops/sparse_permute102.hip 2025-05-07T19:42:55.4860610Z Removing fbgemm_gpu/src/sparse_ops/sparse_permute_1d.hip 2025-05-07T19:42:55.4861029Z Removing fbgemm_gpu/src/sparse_ops/sparse_permute_2d.hip 2025-05-07T19:42:55.4861465Z Removing fbgemm_gpu/src/sparse_ops/sparse_permute_embeddings.hip 2025-05-07T19:42:55.4861895Z Removing fbgemm_gpu/src/sparse_ops/sparse_range.hip 2025-05-07T19:42:55.4862305Z Removing fbgemm_gpu/src/sparse_ops/sparse_reorder_batched_ad.hip 2025-05-07T19:42:55.4862766Z Removing fbgemm_gpu/src/sparse_ops/sparse_segment_sum_csr.hip 2025-05-07T19:42:55.4863251Z Removing fbgemm_gpu/src/sparse_ops/sparse_zipf.hip 2025-05-07T19:42:55.4863697Z Removing fbgemm_gpu/src/split_embeddings_cache/cachelib_cache_hip.cpp 2025-05-07T19:42:55.4864201Z Removing fbgemm_gpu/src/split_embeddings_cache/common_hip.cuh 2025-05-07T19:42:55.4864635Z Removing fbgemm_gpu/src/split_embeddings_cache/common_hip.h 2025-05-07T19:42:55.4865282Z Removing fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.hip 2025-05-07T19:42:55.4865801Z Removing fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.hip 2025-05-07T19:42:55.4866360Z Removing fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.hip 2025-05-07T19:42:55.4866940Z Removing fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte_hip.cpp 2025-05-07T19:42:55.4867527Z Removing fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.hip 2025-05-07T19:42:55.4868116Z Removing fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices_hip.cpp 2025-05-07T19:42:55.4868658Z Removing fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.hip 2025-05-07T19:42:55.4869183Z Removing fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.hip 2025-05-07T19:42:55.4869719Z Removing fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.hip 2025-05-07T19:42:55.4870302Z Removing fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte_hip.cpp 2025-05-07T19:42:55.4870838Z Removing fbgemm_gpu/src/split_embeddings_cache/lxu_cache.hip 2025-05-07T19:42:55.4871307Z Removing fbgemm_gpu/src/split_embeddings_cache/lxu_cache_hip.cpp 2025-05-07T19:42:55.4871839Z Removing fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.hip 2025-05-07T19:42:55.4872520Z Removing fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.hip 2025-05-07T19:42:55.4873146Z Removing fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops_hip.cpp 2025-05-07T19:42:55.4873814Z Removing fbgemm_gpu/src/split_embeddings_utils/generate_vbe_metadata.hip 2025-05-07T19:42:55.4874556Z Removing fbgemm_gpu/src/split_embeddings_utils/get_infos_metadata.hip 2025-05-07T19:42:55.4875103Z Removing fbgemm_gpu/src/split_embeddings_utils/radix_sort_pairs.hip 2025-05-07T19:42:55.4875658Z Removing fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils_hip.cpp 2025-05-07T19:42:55.4876271Z Removing fbgemm_gpu/src/split_embeddings_utils/transpose_embedding_input.hip 2025-05-07T19:42:55.4876886Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/embedding_rocksdb_wrapper_hip.h 2025-05-07T19:42:55.4877569Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/kv_db_hip_utils.cpp 2025-05-07T19:42:55.4878097Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/kv_db_hip_utils.h 2025-05-07T19:42:55.4878720Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/kv_db_table_batched_embeddings_hip.cpp 2025-05-07T19:42:55.4879411Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/kv_db_table_batched_embeddings_hip.h 2025-05-07T19:42:55.4880050Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/kv_tensor_wrapper_cpu_hip.cpp 2025-05-07T19:42:55.4880708Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/ssd_scratch_pad_indices_queue_hip.cpp 2025-05-07T19:42:55.4881369Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/ssd_split_embeddings_cache_hip.hip 2025-05-07T19:42:55.4882076Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/ssd_split_table_batched_embeddings_hip.cpp 2025-05-07T19:42:55.4882778Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/ssd_table_batched_embeddings_hip.h 2025-05-07T19:42:55.4883294Z Removing fbgemm_gpu/src/topology_utils_hip.cpp 2025-05-07T19:42:55.4883740Z Removing fbgemm_gpu/test/tbe/utils/cpu_kernel_test_hip.cpp 2025-05-07T19:42:55.4884177Z Removing fbgemm_gpu/test/utils/kernel_launcher_test.hip 2025-05-07T19:42:55.4884636Z Removing fbgemm_gpu/test/utils/stochastic_rounding_test.hip 2025-05-07T19:42:55.4885084Z Removing fbgemm_gpu/test/utils/tensor_accessor2_test.hip 2025-05-07T19:42:55.4885569Z Removing fbgemm_gpu/test/utils/tensor_accessor_builder_test.hip 2025-05-07T19:42:55.4886140Z Removing fbgemm_gpu/test/utils/tensor_accessor_builder_with_memcheck_test.hip 2025-05-07T19:42:55.4886762Z Removing fbgemm_gpu/test/utils/tensor_accessor_test.hip 2025-05-07T19:42:55.4887221Z Removing fbgemm_gpu/test/utils/tensor_accessor_with_memcheck_test.hip 2025-05-07T19:42:55.4887659Z Removing fbgemm_gpu/test/utils/weight_row_test.hip 2025-05-07T19:42:55.4889765Z [command]/usr/bin/git reset --hard HEAD 2025-05-07T19:42:55.5827866Z HEAD is now at 1c9ad64 Merge f6528e7b1e8f5602e7dba30cd73b48ae6630981c into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:42:55.5831669Z ##[endgroup] 2025-05-07T19:42:55.5833409Z ##[group]Disabling automatic garbage collection 2025-05-07T19:42:55.5838120Z [command]/usr/bin/git config --local gc.auto 0 2025-05-07T19:42:55.5865850Z ##[endgroup] 2025-05-07T19:42:55.5866302Z ##[group]Setting up auth 2025-05-07T19:42:55.5870046Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-05-07T19:42:55.5896890Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-05-07T19:42:55.6235942Z Entering 'external/asmjit' 2025-05-07T19:42:55.6297804Z Entering 'external/composable_kernel' 2025-05-07T19:42:55.6356822Z Entering 'external/cpuinfo' 2025-05-07T19:42:55.6427817Z Entering 'external/cutlass' 2025-05-07T19:42:55.6500259Z Entering 'external/googletest' 2025-05-07T19:42:55.6557855Z Entering 'external/hipify_torch' 2025-05-07T19:42:55.6618711Z Entering 'external/json' 2025-05-07T19:42:55.6698709Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-05-07T19:42:55.6731581Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-05-07T19:42:55.7021488Z Entering 'external/asmjit' 2025-05-07T19:42:55.7073381Z Entering 'external/composable_kernel' 2025-05-07T19:42:55.7129994Z Entering 'external/cpuinfo' 2025-05-07T19:42:55.7182522Z Entering 'external/cutlass' 2025-05-07T19:42:55.7251849Z Entering 'external/googletest' 2025-05-07T19:42:55.7307424Z Entering 'external/hipify_torch' 2025-05-07T19:42:55.7376083Z Entering 'external/json' 2025-05-07T19:42:55.7445264Z [command]/usr/bin/git config --local http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-05-07T19:42:55.7496489Z ##[endgroup] 2025-05-07T19:42:55.7497398Z ##[group]Fetching the repository 2025-05-07T19:42:55.7503125Z [command]/usr/bin/git -c protocol.version=2 fetch --no-tags --prune --no-recurse-submodules --depth=1 origin +a2f4c52051596e74bc8c16e3d2867a4ecdd271e0:refs/remotes/pull/4066/merge 2025-05-07T19:42:55.9262499Z From https://github.com/pytorch/FBGEMM 2025-05-07T19:42:55.9263217Z + 1c9ad64...a2f4c52 a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 -> pull/4066/merge (forced update) 2025-05-07T19:42:55.9285821Z ##[endgroup] 2025-05-07T19:42:55.9286228Z ##[group]Determining the checkout info 2025-05-07T19:42:55.9287939Z ##[endgroup] 2025-05-07T19:42:55.9291830Z [command]/usr/bin/git sparse-checkout disable 2025-05-07T19:42:55.9807779Z [command]/usr/bin/git config --local --unset-all extensions.worktreeConfig 2025-05-07T19:42:55.9831729Z ##[group]Checking out the ref 2025-05-07T19:42:55.9834862Z [command]/usr/bin/git checkout --progress --force refs/remotes/pull/4066/merge 2025-05-07T19:42:56.0854022Z Warning: you are leaving 1 commit behind, not connected to 2025-05-07T19:42:56.0854444Z any of your branches: 2025-05-07T19:42:56.0854603Z 2025-05-07T19:42:56.0854999Z 1c9ad64 Merge f6528e7b1e8f5602e7dba30cd73b48ae6630981c into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:42:56.0855465Z 2025-05-07T19:42:56.0855672Z If you want to keep it by creating a new branch, this may be a good time 2025-05-07T19:42:56.0856081Z to do so with: 2025-05-07T19:42:56.0856211Z 2025-05-07T19:42:56.0856340Z git branch 1c9ad64 2025-05-07T19:42:56.0856558Z 2025-05-07T19:42:56.0856959Z HEAD is now at a2f4c52 Merge 6060cd4b5f971680caecdcc657faccb5720d1c3e into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:42:56.0859278Z ##[endgroup] 2025-05-07T19:42:56.0859717Z ##[group]Setting up auth for fetching submodules 2025-05-07T19:42:56.0864730Z [command]/usr/bin/git config --global http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-05-07T19:42:56.0922378Z [command]/usr/bin/git config --global --unset-all url.https://github.com/.insteadOf 2025-05-07T19:42:56.0946636Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf git@github.com: 2025-05-07T19:42:56.0971772Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf org-21003710@github.com: 2025-05-07T19:42:56.0995724Z ##[endgroup] 2025-05-07T19:42:56.0996123Z ##[group]Fetching submodules 2025-05-07T19:42:56.0996417Z [command]/usr/bin/git submodule sync 2025-05-07T19:42:56.1369653Z Synchronizing submodule url for 'external/asmjit' 2025-05-07T19:42:56.1371031Z Synchronizing submodule url for 'external/composable_kernel' 2025-05-07T19:42:56.1372100Z Synchronizing submodule url for 'external/cpuinfo' 2025-05-07T19:42:56.1372508Z Synchronizing submodule url for 'external/cutlass' 2025-05-07T19:42:56.1372909Z Synchronizing submodule url for 'external/googletest' 2025-05-07T19:42:56.1373444Z Synchronizing submodule url for 'external/hipify_torch' 2025-05-07T19:42:56.1373861Z Synchronizing submodule url for 'external/json' 2025-05-07T19:42:56.1381379Z [command]/usr/bin/git -c protocol.version=2 submodule update --init --force --depth=1 2025-05-07T19:42:56.2157447Z Submodule path 'external/asmjit': checked out 'e5d7c0bd5d9aec44d68830187138149e6a8c4e32' 2025-05-07T19:42:56.4801009Z Submodule path 'external/composable_kernel': checked out '4a61bdd4bd4ed730e078aebc7c0fcf046ff29406' 2025-05-07T19:42:56.5743092Z Submodule path 'external/cpuinfo': checked out '6543fec09b2f04ac4a666882998b534afc9c1349' 2025-05-07T19:42:57.2359807Z Submodule path 'external/cutlass': checked out '3ed8d2ec4ba35ef5d9d8353826209b6f868f63d3' 2025-05-07T19:42:57.2751311Z Submodule path 'external/googletest': checked out 'f8d7d77c06936315286eb55f8de22cd23c188571' 2025-05-07T19:42:57.2827407Z Submodule path 'external/hipify_torch': checked out '420084499c7c1e1c2d801922f40df202eac5f3a0' 2025-05-07T19:42:57.3902576Z Submodule path 'external/json': checked out '9cca280a4d0ccf0c08f47a99aa71d1b0e52f8d03' 2025-05-07T19:42:57.3911529Z [command]/usr/bin/git submodule foreach git config --local gc.auto 0 2025-05-07T19:42:57.4199272Z Entering 'external/asmjit' 2025-05-07T19:42:57.4234043Z Entering 'external/composable_kernel' 2025-05-07T19:42:57.4264962Z Entering 'external/cpuinfo' 2025-05-07T19:42:57.4299206Z Entering 'external/cutlass' 2025-05-07T19:42:57.4330156Z Entering 'external/googletest' 2025-05-07T19:42:57.4356788Z Entering 'external/hipify_torch' 2025-05-07T19:42:57.4385463Z Entering 'external/json' 2025-05-07T19:42:57.4434337Z ##[endgroup] 2025-05-07T19:42:57.4435592Z ##[group]Persisting credentials for submodules 2025-05-07T19:42:57.4438570Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'url\.https\:\/\/github\.com\/\.insteadOf' && git config --local --unset-all 'url.https://github.com/.insteadOf' || :" 2025-05-07T19:42:57.4758655Z Entering 'external/asmjit' 2025-05-07T19:42:57.4796214Z url.https://github.com/.insteadof 2025-05-07T19:42:57.4796605Z url.https://github.com/.insteadof 2025-05-07T19:42:57.4831385Z Entering 'external/composable_kernel' 2025-05-07T19:42:57.4874191Z url.https://github.com/.insteadof 2025-05-07T19:42:57.4875234Z url.https://github.com/.insteadof 2025-05-07T19:42:57.4917448Z Entering 'external/cpuinfo' 2025-05-07T19:42:57.4958777Z url.https://github.com/.insteadof 2025-05-07T19:42:57.4959664Z url.https://github.com/.insteadof 2025-05-07T19:42:57.4997715Z Entering 'external/cutlass' 2025-05-07T19:42:57.5033514Z url.https://github.com/.insteadof 2025-05-07T19:42:57.5034040Z url.https://github.com/.insteadof 2025-05-07T19:42:57.5077124Z Entering 'external/googletest' 2025-05-07T19:42:57.5116159Z url.https://github.com/.insteadof 2025-05-07T19:42:57.5116529Z url.https://github.com/.insteadof 2025-05-07T19:42:57.5155577Z Entering 'external/hipify_torch' 2025-05-07T19:42:57.5195302Z url.https://github.com/.insteadof 2025-05-07T19:42:57.5195696Z url.https://github.com/.insteadof 2025-05-07T19:42:57.5230596Z Entering 'external/json' 2025-05-07T19:42:57.5271915Z url.https://github.com/.insteadof 2025-05-07T19:42:57.5272921Z url.https://github.com/.insteadof 2025-05-07T19:42:57.5330169Z [command]/usr/bin/git submodule foreach sh -c "git config --local 'http.https://github.com/.extraheader' 'AUTHORIZATION: basic ***' && git config --local --show-origin --name-only --get-regexp remote.origin.url" 2025-05-07T19:42:57.5600663Z Entering 'external/asmjit' 2025-05-07T19:42:57.5662634Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/asmjit/config remote.origin.url 2025-05-07T19:42:57.5665229Z Entering 'external/composable_kernel' 2025-05-07T19:42:57.5717015Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/composable_kernel/config remote.origin.url 2025-05-07T19:42:57.5718596Z Entering 'external/cpuinfo' 2025-05-07T19:42:57.5778757Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/cpuinfo/config remote.origin.url 2025-05-07T19:42:57.5780270Z Entering 'external/cutlass' 2025-05-07T19:42:57.5830609Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/cutlass/config remote.origin.url 2025-05-07T19:42:57.5831151Z Entering 'external/googletest' 2025-05-07T19:42:57.5888762Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/googletest/config remote.origin.url 2025-05-07T19:42:57.5889885Z Entering 'external/hipify_torch' 2025-05-07T19:42:57.5941133Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/hipify_torch/config remote.origin.url 2025-05-07T19:42:57.5942698Z Entering 'external/json' 2025-05-07T19:42:57.5988087Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/json/config remote.origin.url 2025-05-07T19:42:57.6053027Z [command]/usr/bin/git submodule foreach git config --local --add 'url.https://github.com/.insteadOf' 'git@github.com:' 2025-05-07T19:42:57.6323774Z Entering 'external/asmjit' 2025-05-07T19:42:57.6365019Z Entering 'external/composable_kernel' 2025-05-07T19:42:57.6392851Z Entering 'external/cpuinfo' 2025-05-07T19:42:57.6420351Z Entering 'external/cutlass' 2025-05-07T19:42:57.6457214Z Entering 'external/googletest' 2025-05-07T19:42:57.6490819Z Entering 'external/hipify_torch' 2025-05-07T19:42:57.6517429Z Entering 'external/json' 2025-05-07T19:42:57.6571503Z [command]/usr/bin/git submodule foreach git config --local --add 'url.https://github.com/.insteadOf' 'org-21003710@github.com:' 2025-05-07T19:42:57.6846655Z Entering 'external/asmjit' 2025-05-07T19:42:57.6883587Z Entering 'external/composable_kernel' 2025-05-07T19:42:57.6915036Z Entering 'external/cpuinfo' 2025-05-07T19:42:57.6940067Z Entering 'external/cutlass' 2025-05-07T19:42:57.6963854Z Entering 'external/googletest' 2025-05-07T19:42:57.6990126Z Entering 'external/hipify_torch' 2025-05-07T19:42:57.7029823Z Entering 'external/json' 2025-05-07T19:42:57.7071732Z ##[endgroup] 2025-05-07T19:42:57.7097614Z [command]/usr/bin/git log -1 --format=%H 2025-05-07T19:42:57.7116887Z a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 2025-05-07T19:42:57.7256693Z ##[group]Run . $PRELUDE; print_system_info 2025-05-07T19:42:57.7257121Z . $PRELUDE; print_system_info 2025-05-07T19:42:57.7257645Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:42:57.7258034Z env: 2025-05-07T19:42:57.7258301Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:42:57.7258619Z BUILD_ENV: build_binary 2025-05-07T19:42:57.7258910Z BUILD_TARGET: default 2025-05-07T19:42:57.7259148Z BUILD_VARIANT: cuda 2025-05-07T19:42:57.7259430Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:42:57.7259689Z ##[endgroup] 2025-05-07T19:42:58.1595605Z ################################################################################ 2025-05-07T19:42:58.1596015Z # Print System Info 2025-05-07T19:42:58.1596292Z # 2025-05-07T19:42:58.1612734Z # [2025-05-07T19:42:58.160Z] + print_system_info 2025-05-07T19:42:58.1613814Z ################################################################################ 2025-05-07T19:42:58.1614126Z 2025-05-07T19:42:58.1614370Z ################################################################################ 2025-05-07T19:42:58.1614737Z [INFO] Printing environment variables ... 2025-05-07T19:42:58.1615088Z + printenv 2025-05-07T19:42:58.1615239Z 2025-05-07T19:42:58.1625247Z GITHUB_WORKSPACE=/__w/FBGEMM/FBGEMM 2025-05-07T19:42:58.1625563Z BUILD_VARIANT=cuda 2025-05-07T19:42:58.1626055Z HOSTNAME=5cbac523ab1b 2025-05-07T19:42:58.1626532Z GITHUB_PATH=/__w/_temp/_runner_file_commands/add_path_c29d0ee9-9bb4-483f-8f2b-5cfda6c315c9 2025-05-07T19:42:58.1627027Z GITHUB_ACTION=__run_2 2025-05-07T19:42:58.1627307Z GITHUB_RUN_NUMBER=10601 2025-05-07T19:42:58.1627585Z RUNNER_NAME=i-0735a6dcd00858cc7 2025-05-07T19:42:58.1627913Z GITHUB_REPOSITORY_OWNER_ID=21003710 2025-05-07T19:42:58.1628239Z PLATFORM_NAME_LC=linux-x86_64 2025-05-07T19:42:58.1628758Z MACHINE_NAME_LC=x86_64 2025-05-07T19:42:58.1629022Z GITHUB_TRIGGERING_ACTOR=q10 2025-05-07T19:42:58.1629442Z PRELUDE=.github/scripts/setup_env.bash 2025-05-07T19:42:58.1629813Z GITHUB_REF_TYPE=branch 2025-05-07T19:42:58.1630371Z *** 2025-05-07T19:42:58.1630625Z GITHUB_REPOSITORY_ID=150154628 2025-05-07T19:42:58.1630912Z GITHUB_ACTIONS=true 2025-05-07T19:42:58.1631239Z GITHUB_SHA=a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 2025-05-07T19:42:58.1631845Z GITHUB_WORKFLOW_REF=pytorch/FBGEMM/.github/workflows/fbgemm_gpu_ci_cuda.yml@refs/pull/4066/merge 2025-05-07T19:42:58.1632424Z RUNNER_ENVIRONMENT=self-hosted 2025-05-07T19:42:58.1632724Z GITHUB_REF=refs/pull/4066/merge 2025-05-07T19:42:58.1633027Z RUNNER_OS=Linux 2025-05-07T19:42:58.1633300Z GITHUB_REF_PROTECTED=false 2025-05-07T19:42:58.1633568Z HOME=/github/home 2025-05-07T19:42:58.1633966Z GITHUB_API_URL=https://api.github.com 2025-05-07T19:42:58.1634281Z RUNNER_ARCH=X64 2025-05-07T19:42:58.1634549Z RUNNER_TEMP=/__w/_temp 2025-05-07T19:42:58.1634799Z BUILD_TARGET=default 2025-05-07T19:42:58.1635271Z GITHUB_STATE=/__w/_temp/_runner_file_commands/save_state_c29d0ee9-9bb4-483f-8f2b-5cfda6c315c9 2025-05-07T19:42:58.1635948Z GITHUB_ENV=/__w/_temp/_runner_file_commands/set_env_c29d0ee9-9bb4-483f-8f2b-5cfda6c315c9 2025-05-07T19:42:58.1636482Z GITHUB_EVENT_PATH=/github/workflow/event.json 2025-05-07T19:42:58.1636829Z GITHUB_EVENT_NAME=pull_request 2025-05-07T19:42:58.1641552Z GITHUB_RUN_ID=14891846252 2025-05-07T19:42:58.1642079Z GITHUB_STEP_SUMMARY=/__w/_temp/_runner_file_commands/step_summary_c29d0ee9-9bb4-483f-8f2b-5cfda6c315c9 2025-05-07T19:42:58.1642617Z BUILD_ENV=build_binary 2025-05-07T19:42:58.1642897Z GITHUB_ACTOR=q10 2025-05-07T19:42:58.1643133Z GITHUB_RUN_ATTEMPT=1 2025-05-07T19:42:58.1643411Z KERN_NAME_LC=linux 2025-05-07T19:42:58.1643654Z BUILD_CUDA_VERSION=12.6.3 2025-05-07T19:42:58.1644003Z GITHUB_GRAPHQL_URL=https://api.github.com/graphql 2025-05-07T19:42:58.1644404Z PLATFORM_NAME=Linux-x86_64 2025-05-07T19:42:58.1644727Z GITHUB_SERVER_URL=https://github.com 2025-05-07T19:42:58.1645028Z SHLVL=1 2025-05-07T19:42:58.1645279Z GITHUB_ACTOR_ID=255046 2025-05-07T19:42:58.1645545Z RUNNER_TOOL_CACHE=/__w/_tool 2025-05-07T19:42:58.1646121Z GITHUB_WORKFLOW_SHA=6060cd4b5f971680caecdcc657faccb5720d1c3e 2025-05-07T19:42:58.1646525Z GITHUB_REF_NAME=4066/merge 2025-05-07T19:42:58.1646803Z KERN_NAME=Linux 2025-05-07T19:42:58.1647082Z GITHUB_JOB=build_artifact 2025-05-07T19:42:58.1647376Z GITHUB_REPOSITORY=pytorch/FBGEMM 2025-05-07T19:42:58.1647707Z GITHUB_RETENTION_DAYS=90 2025-05-07T19:42:58.1647984Z RUNNER_WORKSPACE=/__w/FBGEMM 2025-05-07T19:42:58.1648303Z GITHUB_ACTION_REPOSITORY= 2025-05-07T19:42:58.1648670Z PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-05-07T19:42:58.1649109Z GITHUB_BASE_REF=main 2025-05-07T19:42:58.1649355Z CI=true 2025-05-07T19:42:58.1649615Z GITHUB_REPOSITORY_OWNER=pytorch 2025-05-07T19:42:58.1649928Z GITHUB_HEAD_REF=bm/genai-rocm-oss-6 2025-05-07T19:42:58.1650263Z GITHUB_ACTION_REF= 2025-05-07T19:42:58.1650666Z GITHUB_WORKFLOW=FBGEMM GPU/GenAI CUDA CI 2025-05-07T19:42:58.1651177Z GITHUB_OUTPUT=/__w/_temp/_runner_file_commands/set_output_c29d0ee9-9bb4-483f-8f2b-5cfda6c315c9 2025-05-07T19:42:58.1651704Z MACHINE_NAME=x86_64 2025-05-07T19:42:58.1651950Z _=/usr/bin/printenv 2025-05-07T19:42:58.1652098Z 2025-05-07T19:42:58.1652259Z ################################################################################ 2025-05-07T19:42:58.1652610Z [INFO] Print ldd version ... 2025-05-07T19:42:58.1652905Z + ldd --version 2025-05-07T19:42:58.1653040Z 2025-05-07T19:42:58.1653195Z ldd (GNU libc) 2.34 2025-05-07T19:42:58.1653476Z Copyright (C) 2021 Free Software Foundation, Inc. 2025-05-07T19:42:58.1653962Z This is free software; see the source for copying conditions. There is NO 2025-05-07T19:42:58.1654620Z warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. 2025-05-07T19:42:58.1655091Z Written by Roland McGrath and Ulrich Drepper. 2025-05-07T19:42:58.1655313Z 2025-05-07T19:42:58.1655425Z ################################################################################ 2025-05-07T19:42:58.1655762Z [INFO] Print CPU info ... 2025-05-07T19:42:58.1656007Z + nproc 2025-05-07T19:42:58.1656140Z 2025-05-07T19:42:58.1664104Z 96 2025-05-07T19:42:58.1664252Z 2025-05-07T19:42:58.1664396Z + lscpu 2025-05-07T19:42:58.1664903Z 2025-05-07T19:42:58.1934743Z Architecture: x86_64 2025-05-07T19:42:58.1935121Z CPU op-mode(s): 32-bit, 64-bit 2025-05-07T19:42:58.1935585Z Address sizes: 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.1935995Z Byte Order: Little Endian 2025-05-07T19:42:58.1936339Z CPU(s): 96 2025-05-07T19:42:58.1936636Z On-line CPU(s) list: 0-95 2025-05-07T19:42:58.1936976Z Vendor ID: GenuineIntel 2025-05-07T19:42:58.1937391Z Model name: Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.1937784Z CPU family: 6 2025-05-07T19:42:58.1938082Z Model: 85 2025-05-07T19:42:58.1938373Z Thread(s) per core: 2 2025-05-07T19:42:58.1938701Z Core(s) per socket: 24 2025-05-07T19:42:58.1938986Z Socket(s): 2 2025-05-07T19:42:58.1939280Z Stepping: 7 2025-05-07T19:42:58.1939619Z BogoMIPS: 5999.98 2025-05-07T19:42:58.1942260Z Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.1944813Z Hypervisor vendor: KVM 2025-05-07T19:42:58.1945261Z Virtualization type: full 2025-05-07T19:42:58.1945580Z L1d cache: 1.5 MiB (48 instances) 2025-05-07T19:42:58.1945942Z L1i cache: 1.5 MiB (48 instances) 2025-05-07T19:42:58.1946283Z L2 cache: 48 MiB (48 instances) 2025-05-07T19:42:58.1946632Z L3 cache: 71.5 MiB (2 instances) 2025-05-07T19:42:58.1946951Z NUMA node(s): 2 2025-05-07T19:42:58.1947234Z NUMA node0 CPU(s): 0-23,48-71 2025-05-07T19:42:58.1947554Z NUMA node1 CPU(s): 24-47,72-95 2025-05-07T19:42:58.1947979Z Vulnerability Gather data sampling: Unknown: Dependent on hypervisor status 2025-05-07T19:42:58.1948518Z Vulnerability Itlb multihit: KVM: Mitigation: VMX unsupported 2025-05-07T19:42:58.1948979Z Vulnerability L1tf: Mitigation; PTE Inversion 2025-05-07T19:42:58.1949556Z Vulnerability Mds: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:42:58.1950112Z Vulnerability Meltdown: Mitigation; PTI 2025-05-07T19:42:58.1950676Z Vulnerability Mmio stale data: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:42:58.1951253Z Vulnerability Reg file data sampling: Not affected 2025-05-07T19:42:58.1951596Z Vulnerability Retbleed: Vulnerable 2025-05-07T19:42:58.1951949Z Vulnerability Spec rstack overflow: Not affected 2025-05-07T19:42:58.1952294Z Vulnerability Spec store bypass: Vulnerable 2025-05-07T19:42:58.1952823Z Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization 2025-05-07T19:42:58.1953611Z Vulnerability Spectre v2: Mitigation; Retpolines; STIBP disabled; RSB filling; PBRSB-eIBRS Not affected; BHI Retpoline 2025-05-07T19:42:58.1954491Z Vulnerability Srbds: Not affected 2025-05-07T19:42:58.1955009Z Vulnerability Tsx async abort: Not affected 2025-05-07T19:42:58.1955252Z 2025-05-07T19:42:58.1955395Z + cat /proc/cpuinfo 2025-05-07T19:42:58.1955552Z 2025-05-07T19:42:58.1957833Z processor : 0 2025-05-07T19:42:58.1958060Z vendor_id : GenuineIntel 2025-05-07T19:42:58.1958378Z cpu family : 6 2025-05-07T19:42:58.1958595Z model : 85 2025-05-07T19:42:58.1958875Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.1959238Z stepping : 7 2025-05-07T19:42:58.1959443Z microcode : 0x5003901 2025-05-07T19:42:58.1959684Z cpu MHz : 3236.188 2025-05-07T19:42:58.1959894Z cache size : 36608 KB 2025-05-07T19:42:58.1960128Z physical id : 0 2025-05-07T19:42:58.1960339Z siblings : 48 2025-05-07T19:42:58.1960574Z core id : 0 2025-05-07T19:42:58.1960797Z cpu cores : 24 2025-05-07T19:42:58.1961013Z apicid : 0 2025-05-07T19:42:58.1961250Z initial apicid : 0 2025-05-07T19:42:58.1961474Z fpu : yes 2025-05-07T19:42:58.1961724Z fpu_exception : yes 2025-05-07T19:42:58.1961956Z cpuid level : 13 2025-05-07T19:42:58.1962207Z wp : yes 2025-05-07T19:42:58.1964522Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.1967301Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.1967922Z bogomips : 5999.98 2025-05-07T19:42:58.1968166Z clflush size : 64 2025-05-07T19:42:58.1968449Z cache_alignment : 64 2025-05-07T19:42:58.1968788Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.1969215Z power management: 2025-05-07T19:42:58.1969365Z 2025-05-07T19:42:58.1969482Z processor : 1 2025-05-07T19:42:58.1969759Z vendor_id : GenuineIntel 2025-05-07T19:42:58.1970052Z cpu family : 6 2025-05-07T19:42:58.1970279Z model : 85 2025-05-07T19:42:58.1970610Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.1970979Z stepping : 7 2025-05-07T19:42:58.1971283Z microcode : 0x5003901 2025-05-07T19:42:58.1971505Z cpu MHz : 3360.479 2025-05-07T19:42:58.1971731Z cache size : 36608 KB 2025-05-07T19:42:58.1971975Z physical id : 0 2025-05-07T19:42:58.1972191Z siblings : 48 2025-05-07T19:42:58.1972412Z core id : 1 2025-05-07T19:42:58.1972603Z cpu cores : 24 2025-05-07T19:42:58.1972816Z apicid : 2 2025-05-07T19:42:58.1973021Z initial apicid : 2 2025-05-07T19:42:58.1973251Z fpu : yes 2025-05-07T19:42:58.1973456Z fpu_exception : yes 2025-05-07T19:42:58.1973698Z cpuid level : 13 2025-05-07T19:42:58.1973917Z wp : yes 2025-05-07T19:42:58.1976197Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.1978853Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.1979441Z bogomips : 5999.98 2025-05-07T19:42:58.1980144Z clflush size : 64 2025-05-07T19:42:58.1980365Z cache_alignment : 64 2025-05-07T19:42:58.1980655Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.1980986Z power management: 2025-05-07T19:42:58.1981134Z 2025-05-07T19:42:58.1981218Z processor : 2 2025-05-07T19:42:58.1981458Z vendor_id : GenuineIntel 2025-05-07T19:42:58.1981700Z cpu family : 6 2025-05-07T19:42:58.1981922Z model : 85 2025-05-07T19:42:58.1982199Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.1982582Z stepping : 7 2025-05-07T19:42:58.1982791Z microcode : 0x5003901 2025-05-07T19:42:58.1983031Z cpu MHz : 3241.639 2025-05-07T19:42:58.1983242Z cache size : 36608 KB 2025-05-07T19:42:58.1983501Z physical id : 0 2025-05-07T19:42:58.1983708Z siblings : 48 2025-05-07T19:42:58.1983929Z core id : 2 2025-05-07T19:42:58.1984123Z cpu cores : 24 2025-05-07T19:42:58.1984352Z apicid : 4 2025-05-07T19:42:58.1984570Z initial apicid : 4 2025-05-07T19:42:58.1984776Z fpu : yes 2025-05-07T19:42:58.1984986Z fpu_exception : yes 2025-05-07T19:42:58.1985199Z cpuid level : 13 2025-05-07T19:42:58.1985416Z wp : yes 2025-05-07T19:42:58.1987704Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.1990421Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.1991020Z bogomips : 5999.98 2025-05-07T19:42:58.1991238Z clflush size : 64 2025-05-07T19:42:58.1991466Z cache_alignment : 64 2025-05-07T19:42:58.1991735Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.1992069Z power management: 2025-05-07T19:42:58.1992200Z 2025-05-07T19:42:58.1992297Z processor : 3 2025-05-07T19:42:58.1992570Z vendor_id : GenuineIntel 2025-05-07T19:42:58.1992827Z cpu family : 6 2025-05-07T19:42:58.1993027Z model : 85 2025-05-07T19:42:58.1993308Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.1993760Z stepping : 7 2025-05-07T19:42:58.1993986Z microcode : 0x5003901 2025-05-07T19:42:58.1994207Z cpu MHz : 2141.255 2025-05-07T19:42:58.1994433Z cache size : 36608 KB 2025-05-07T19:42:58.1994684Z physical id : 0 2025-05-07T19:42:58.1994915Z siblings : 48 2025-05-07T19:42:58.1995129Z core id : 3 2025-05-07T19:42:58.1995325Z cpu cores : 24 2025-05-07T19:42:58.1995539Z apicid : 6 2025-05-07T19:42:58.1995746Z initial apicid : 6 2025-05-07T19:42:58.1995968Z fpu : yes 2025-05-07T19:42:58.1996168Z fpu_exception : yes 2025-05-07T19:42:58.1996400Z cpuid level : 13 2025-05-07T19:42:58.1996624Z wp : yes 2025-05-07T19:42:58.1998887Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2001532Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2002127Z bogomips : 5999.98 2025-05-07T19:42:58.2002344Z clflush size : 64 2025-05-07T19:42:58.2002582Z cache_alignment : 64 2025-05-07T19:42:58.2002852Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2003190Z power management: 2025-05-07T19:42:58.2003322Z 2025-05-07T19:42:58.2003412Z processor : 4 2025-05-07T19:42:58.2003643Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2003887Z cpu family : 6 2025-05-07T19:42:58.2004107Z model : 85 2025-05-07T19:42:58.2004394Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2004746Z stepping : 7 2025-05-07T19:42:58.2004974Z microcode : 0x5003901 2025-05-07T19:42:58.2005198Z cpu MHz : 2999.994 2025-05-07T19:42:58.2005424Z cache size : 36608 KB 2025-05-07T19:42:58.2005641Z physical id : 0 2025-05-07T19:42:58.2005861Z siblings : 48 2025-05-07T19:42:58.2006053Z core id : 4 2025-05-07T19:42:58.2006365Z cpu cores : 24 2025-05-07T19:42:58.2006561Z apicid : 8 2025-05-07T19:42:58.2006764Z initial apicid : 8 2025-05-07T19:42:58.2006965Z fpu : yes 2025-05-07T19:42:58.2007174Z fpu_exception : yes 2025-05-07T19:42:58.2007398Z cpuid level : 13 2025-05-07T19:42:58.2007594Z wp : yes 2025-05-07T19:42:58.2009815Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2012458Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2013022Z bogomips : 5999.98 2025-05-07T19:42:58.2013249Z clflush size : 64 2025-05-07T19:42:58.2013457Z cache_alignment : 64 2025-05-07T19:42:58.2013732Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2014048Z power management: 2025-05-07T19:42:58.2014191Z 2025-05-07T19:42:58.2014273Z processor : 5 2025-05-07T19:42:58.2014478Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2014724Z cpu family : 6 2025-05-07T19:42:58.2016925Z model : 85 2025-05-07T19:42:58.2017222Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2017580Z stepping : 7 2025-05-07T19:42:58.2017782Z microcode : 0x5003901 2025-05-07T19:42:58.2018022Z cpu MHz : 2999.994 2025-05-07T19:42:58.2018229Z cache size : 36608 KB 2025-05-07T19:42:58.2018457Z physical id : 0 2025-05-07T19:42:58.2018660Z siblings : 48 2025-05-07T19:42:58.2018866Z core id : 5 2025-05-07T19:42:58.2019057Z cpu cores : 24 2025-05-07T19:42:58.2019264Z apicid : 10 2025-05-07T19:42:58.2019460Z initial apicid : 10 2025-05-07T19:42:58.2019680Z fpu : yes 2025-05-07T19:42:58.2019871Z fpu_exception : yes 2025-05-07T19:42:58.2020090Z cpuid level : 13 2025-05-07T19:42:58.2020302Z wp : yes 2025-05-07T19:42:58.2022500Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2025070Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2025896Z bogomips : 5999.98 2025-05-07T19:42:58.2026108Z clflush size : 64 2025-05-07T19:42:58.2026335Z cache_alignment : 64 2025-05-07T19:42:58.2026604Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2026944Z power management: 2025-05-07T19:42:58.2027078Z 2025-05-07T19:42:58.2027163Z processor : 6 2025-05-07T19:42:58.2027387Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2027621Z cpu family : 6 2025-05-07T19:42:58.2027835Z model : 85 2025-05-07T19:42:58.2028123Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2028698Z stepping : 7 2025-05-07T19:42:58.2028917Z microcode : 0x5003901 2025-05-07T19:42:58.2029205Z cpu MHz : 2999.994 2025-05-07T19:42:58.2029438Z cache size : 36608 KB 2025-05-07T19:42:58.2029700Z physical id : 0 2025-05-07T19:42:58.2029918Z siblings : 48 2025-05-07T19:42:58.2030114Z core id : 6 2025-05-07T19:42:58.2030319Z cpu cores : 24 2025-05-07T19:42:58.2030515Z apicid : 12 2025-05-07T19:42:58.2030731Z initial apicid : 12 2025-05-07T19:42:58.2030937Z fpu : yes 2025-05-07T19:42:58.2031145Z fpu_exception : yes 2025-05-07T19:42:58.2031377Z cpuid level : 13 2025-05-07T19:42:58.2031579Z wp : yes 2025-05-07T19:42:58.2033978Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2036762Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2037340Z bogomips : 5999.98 2025-05-07T19:42:58.2037576Z clflush size : 64 2025-05-07T19:42:58.2037788Z cache_alignment : 64 2025-05-07T19:42:58.2038069Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2038388Z power management: 2025-05-07T19:42:58.2038534Z 2025-05-07T19:42:58.2038620Z processor : 7 2025-05-07T19:42:58.2038829Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2039079Z cpu family : 6 2025-05-07T19:42:58.2039291Z model : 85 2025-05-07T19:42:58.2039559Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2040007Z stepping : 7 2025-05-07T19:42:58.2040214Z microcode : 0x5003901 2025-05-07T19:42:58.2040456Z cpu MHz : 2999.994 2025-05-07T19:42:58.2040668Z cache size : 36608 KB 2025-05-07T19:42:58.2040911Z physical id : 0 2025-05-07T19:42:58.2041121Z siblings : 48 2025-05-07T19:42:58.2041333Z core id : 7 2025-05-07T19:42:58.2041530Z cpu cores : 24 2025-05-07T19:42:58.2041748Z apicid : 14 2025-05-07T19:42:58.2041948Z initial apicid : 14 2025-05-07T19:42:58.2042176Z fpu : yes 2025-05-07T19:42:58.2042378Z fpu_exception : yes 2025-05-07T19:42:58.2042609Z cpuid level : 13 2025-05-07T19:42:58.2042831Z wp : yes 2025-05-07T19:42:58.2045096Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2047709Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2048261Z bogomips : 5999.98 2025-05-07T19:42:58.2068670Z clflush size : 64 2025-05-07T19:42:58.2069153Z cache_alignment : 64 2025-05-07T19:42:58.2069434Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2069805Z power management: 2025-05-07T19:42:58.2069940Z 2025-05-07T19:42:58.2070024Z processor : 8 2025-05-07T19:42:58.2070252Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2070487Z cpu family : 6 2025-05-07T19:42:58.2070699Z model : 85 2025-05-07T19:42:58.2070986Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2071330Z stepping : 7 2025-05-07T19:42:58.2071563Z microcode : 0x5003901 2025-05-07T19:42:58.2071784Z cpu MHz : 2999.994 2025-05-07T19:42:58.2072006Z cache size : 36608 KB 2025-05-07T19:42:58.2072221Z physical id : 0 2025-05-07T19:42:58.2072439Z siblings : 48 2025-05-07T19:42:58.2072642Z core id : 8 2025-05-07T19:42:58.2072847Z cpu cores : 24 2025-05-07T19:42:58.2073044Z apicid : 16 2025-05-07T19:42:58.2073253Z initial apicid : 16 2025-05-07T19:42:58.2073461Z fpu : yes 2025-05-07T19:42:58.2073762Z fpu_exception : yes 2025-05-07T19:42:58.2073982Z cpuid level : 13 2025-05-07T19:42:58.2074369Z wp : yes 2025-05-07T19:42:58.2076656Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2079439Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2080018Z bogomips : 5999.98 2025-05-07T19:42:58.2080250Z clflush size : 64 2025-05-07T19:42:58.2080463Z cache_alignment : 64 2025-05-07T19:42:58.2080747Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2081066Z power management: 2025-05-07T19:42:58.2081213Z 2025-05-07T19:42:58.2081296Z processor : 9 2025-05-07T19:42:58.2081506Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2081755Z cpu family : 6 2025-05-07T19:42:58.2081956Z model : 85 2025-05-07T19:42:58.2082240Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2082599Z stepping : 7 2025-05-07T19:42:58.2082807Z microcode : 0x5003901 2025-05-07T19:42:58.2083116Z cpu MHz : 3252.144 2025-05-07T19:42:58.2083329Z cache size : 36608 KB 2025-05-07T19:42:58.2083568Z physical id : 0 2025-05-07T19:42:58.2083775Z siblings : 48 2025-05-07T19:42:58.2083993Z core id : 9 2025-05-07T19:42:58.2084195Z cpu cores : 24 2025-05-07T19:42:58.2084409Z apicid : 18 2025-05-07T19:42:58.2084605Z initial apicid : 18 2025-05-07T19:42:58.2084828Z fpu : yes 2025-05-07T19:42:58.2085030Z fpu_exception : yes 2025-05-07T19:42:58.2085260Z cpuid level : 13 2025-05-07T19:42:58.2085477Z wp : yes 2025-05-07T19:42:58.2088173Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2090806Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2091409Z bogomips : 5999.98 2025-05-07T19:42:58.2091624Z clflush size : 64 2025-05-07T19:42:58.2091848Z cache_alignment : 64 2025-05-07T19:42:58.2092117Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2092450Z power management: 2025-05-07T19:42:58.2092583Z 2025-05-07T19:42:58.2092668Z processor : 10 2025-05-07T19:42:58.2092898Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2093133Z cpu family : 6 2025-05-07T19:42:58.2093345Z model : 85 2025-05-07T19:42:58.2093634Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2093986Z stepping : 7 2025-05-07T19:42:58.2094205Z microcode : 0x5003901 2025-05-07T19:42:58.2094425Z cpu MHz : 3163.288 2025-05-07T19:42:58.2094659Z cache size : 36608 KB 2025-05-07T19:42:58.2094879Z physical id : 0 2025-05-07T19:42:58.2095098Z siblings : 48 2025-05-07T19:42:58.2095298Z core id : 10 2025-05-07T19:42:58.2095513Z cpu cores : 24 2025-05-07T19:42:58.2095715Z apicid : 20 2025-05-07T19:42:58.2095936Z initial apicid : 20 2025-05-07T19:42:58.2096145Z fpu : yes 2025-05-07T19:42:58.2096353Z fpu_exception : yes 2025-05-07T19:42:58.2096571Z cpuid level : 13 2025-05-07T19:42:58.2096788Z wp : yes 2025-05-07T19:42:58.2099069Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2101714Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2102356Z bogomips : 5999.98 2025-05-07T19:42:58.2102580Z clflush size : 64 2025-05-07T19:42:58.2102793Z cache_alignment : 64 2025-05-07T19:42:58.2103079Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2103395Z power management: 2025-05-07T19:42:58.2103542Z 2025-05-07T19:42:58.2103631Z processor : 11 2025-05-07T19:42:58.2103845Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2104098Z cpu family : 6 2025-05-07T19:42:58.2104303Z model : 85 2025-05-07T19:42:58.2104588Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2104955Z stepping : 7 2025-05-07T19:42:58.2105163Z microcode : 0x5003901 2025-05-07T19:42:58.2105402Z cpu MHz : 3196.502 2025-05-07T19:42:58.2105615Z cache size : 36608 KB 2025-05-07T19:42:58.2105922Z physical id : 0 2025-05-07T19:42:58.2106132Z siblings : 48 2025-05-07T19:42:58.2106343Z core id : 11 2025-05-07T19:42:58.2106540Z cpu cores : 24 2025-05-07T19:42:58.2106760Z apicid : 22 2025-05-07T19:42:58.2106965Z initial apicid : 22 2025-05-07T19:42:58.2107195Z fpu : yes 2025-05-07T19:42:58.2107397Z fpu_exception : yes 2025-05-07T19:42:58.2107630Z cpuid level : 13 2025-05-07T19:42:58.2107850Z wp : yes 2025-05-07T19:42:58.2110115Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2112750Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2113344Z bogomips : 5999.98 2025-05-07T19:42:58.2113564Z clflush size : 64 2025-05-07T19:42:58.2113874Z cache_alignment : 64 2025-05-07T19:42:58.2114140Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2114470Z power management: 2025-05-07T19:42:58.2114603Z 2025-05-07T19:42:58.2114774Z processor : 12 2025-05-07T19:42:58.2115006Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2115249Z cpu family : 6 2025-05-07T19:42:58.2115468Z model : 85 2025-05-07T19:42:58.2115753Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2116105Z stepping : 7 2025-05-07T19:42:58.2116320Z microcode : 0x5003901 2025-05-07T19:42:58.2116542Z cpu MHz : 2999.994 2025-05-07T19:42:58.2116770Z cache size : 36608 KB 2025-05-07T19:42:58.2116991Z physical id : 0 2025-05-07T19:42:58.2117210Z siblings : 48 2025-05-07T19:42:58.2117411Z core id : 12 2025-05-07T19:42:58.2117624Z cpu cores : 24 2025-05-07T19:42:58.2117824Z apicid : 24 2025-05-07T19:42:58.2118040Z initial apicid : 24 2025-05-07T19:42:58.2118260Z fpu : yes 2025-05-07T19:42:58.2118470Z fpu_exception : yes 2025-05-07T19:42:58.2118681Z cpuid level : 13 2025-05-07T19:42:58.2118896Z wp : yes 2025-05-07T19:42:58.2121177Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2123817Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2124396Z bogomips : 5999.98 2025-05-07T19:42:58.2124620Z clflush size : 64 2025-05-07T19:42:58.2124924Z cache_alignment : 64 2025-05-07T19:42:58.2125196Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2125514Z power management: 2025-05-07T19:42:58.2125662Z 2025-05-07T19:42:58.2125745Z processor : 13 2025-05-07T19:42:58.2125958Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2126202Z cpu family : 6 2025-05-07T19:42:58.2126401Z model : 85 2025-05-07T19:42:58.2126690Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2127056Z stepping : 7 2025-05-07T19:42:58.2127260Z microcode : 0x5003901 2025-05-07T19:42:58.2127502Z cpu MHz : 2999.994 2025-05-07T19:42:58.2127714Z cache size : 36608 KB 2025-05-07T19:42:58.2127950Z physical id : 0 2025-05-07T19:42:58.2128147Z siblings : 48 2025-05-07T19:42:58.2128631Z core id : 13 2025-05-07T19:42:58.2128838Z cpu cores : 24 2025-05-07T19:42:58.2129168Z apicid : 26 2025-05-07T19:42:58.2129368Z initial apicid : 26 2025-05-07T19:42:58.2129673Z fpu : yes 2025-05-07T19:42:58.2129879Z fpu_exception : yes 2025-05-07T19:42:58.2130107Z cpuid level : 13 2025-05-07T19:42:58.2130322Z wp : yes 2025-05-07T19:42:58.2132580Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2135213Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2135802Z bogomips : 5999.98 2025-05-07T19:42:58.2136015Z clflush size : 64 2025-05-07T19:42:58.2136245Z cache_alignment : 64 2025-05-07T19:42:58.2136513Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2136845Z power management: 2025-05-07T19:42:58.2136974Z 2025-05-07T19:42:58.2137063Z processor : 14 2025-05-07T19:42:58.2137288Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2137538Z cpu family : 6 2025-05-07T19:42:58.2137735Z model : 85 2025-05-07T19:42:58.2138015Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2138362Z stepping : 7 2025-05-07T19:42:58.2138578Z microcode : 0x5003901 2025-05-07T19:42:58.2138791Z cpu MHz : 3190.887 2025-05-07T19:42:58.2139016Z cache size : 36608 KB 2025-05-07T19:42:58.2139237Z physical id : 0 2025-05-07T19:42:58.2139458Z siblings : 48 2025-05-07T19:42:58.2139654Z core id : 14 2025-05-07T19:42:58.2139861Z cpu cores : 24 2025-05-07T19:42:58.2140066Z apicid : 28 2025-05-07T19:42:58.2140274Z initial apicid : 28 2025-05-07T19:42:58.2140493Z fpu : yes 2025-05-07T19:42:58.2140688Z fpu_exception : yes 2025-05-07T19:42:58.2140910Z cpuid level : 13 2025-05-07T19:42:58.2141118Z wp : yes 2025-05-07T19:42:58.2143396Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2146034Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2146612Z bogomips : 5999.98 2025-05-07T19:42:58.2146845Z clflush size : 64 2025-05-07T19:42:58.2147056Z cache_alignment : 64 2025-05-07T19:42:58.2147334Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2147745Z power management: 2025-05-07T19:42:58.2148048Z 2025-05-07T19:42:58.2148133Z processor : 15 2025-05-07T19:42:58.2148365Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2148599Z cpu family : 6 2025-05-07T19:42:58.2148815Z model : 85 2025-05-07T19:42:58.2149085Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2149450Z stepping : 7 2025-05-07T19:42:58.2149655Z microcode : 0x5003901 2025-05-07T19:42:58.2149892Z cpu MHz : 2999.994 2025-05-07T19:42:58.2150104Z cache size : 36608 KB 2025-05-07T19:42:58.2150336Z physical id : 0 2025-05-07T19:42:58.2150539Z siblings : 48 2025-05-07T19:42:58.2150747Z core id : 15 2025-05-07T19:42:58.2150945Z cpu cores : 24 2025-05-07T19:42:58.2151158Z apicid : 30 2025-05-07T19:42:58.2151432Z initial apicid : 30 2025-05-07T19:42:58.2151641Z fpu : yes 2025-05-07T19:42:58.2151849Z fpu_exception : yes 2025-05-07T19:42:58.2152060Z cpuid level : 13 2025-05-07T19:42:58.2152273Z wp : yes 2025-05-07T19:42:58.2154646Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2157279Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2157876Z bogomips : 5999.98 2025-05-07T19:42:58.2158086Z clflush size : 64 2025-05-07T19:42:58.2158312Z cache_alignment : 64 2025-05-07T19:42:58.2158575Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2158910Z power management: 2025-05-07T19:42:58.2159042Z 2025-05-07T19:42:58.2159123Z processor : 16 2025-05-07T19:42:58.2159345Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2159596Z cpu family : 6 2025-05-07T19:42:58.2159800Z model : 85 2025-05-07T19:42:58.2160084Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2160429Z stepping : 7 2025-05-07T19:42:58.2160645Z microcode : 0x5003901 2025-05-07T19:42:58.2160863Z cpu MHz : 3157.596 2025-05-07T19:42:58.2161086Z cache size : 36608 KB 2025-05-07T19:42:58.2161305Z physical id : 0 2025-05-07T19:42:58.2161523Z siblings : 48 2025-05-07T19:42:58.2161719Z core id : 16 2025-05-07T19:42:58.2161931Z cpu cores : 24 2025-05-07T19:42:58.2162134Z apicid : 32 2025-05-07T19:42:58.2162346Z initial apicid : 32 2025-05-07T19:42:58.2162564Z fpu : yes 2025-05-07T19:42:58.2162759Z fpu_exception : yes 2025-05-07T19:42:58.2162982Z cpuid level : 13 2025-05-07T19:42:58.2163185Z wp : yes 2025-05-07T19:42:58.2165468Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2168118Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2168736Z bogomips : 5999.98 2025-05-07T19:42:58.2168975Z clflush size : 64 2025-05-07T19:42:58.2169193Z cache_alignment : 64 2025-05-07T19:42:58.2169479Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2169812Z power management: 2025-05-07T19:42:58.2170023Z 2025-05-07T19:42:58.2170114Z processor : 17 2025-05-07T19:42:58.2170347Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2170651Z cpu family : 6 2025-05-07T19:42:58.2170873Z model : 85 2025-05-07T19:42:58.2171148Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2171514Z stepping : 7 2025-05-07T19:42:58.2171722Z microcode : 0x5003901 2025-05-07T19:42:58.2171959Z cpu MHz : 3718.193 2025-05-07T19:42:58.2172175Z cache size : 36608 KB 2025-05-07T19:42:58.2172412Z physical id : 0 2025-05-07T19:42:58.2172618Z siblings : 48 2025-05-07T19:42:58.2172837Z core id : 17 2025-05-07T19:42:58.2173036Z cpu cores : 24 2025-05-07T19:42:58.2173250Z apicid : 34 2025-05-07T19:42:58.2173467Z initial apicid : 34 2025-05-07T19:42:58.2173684Z fpu : yes 2025-05-07T19:42:58.2173959Z fpu_exception : yes 2025-05-07T19:42:58.2174172Z cpuid level : 13 2025-05-07T19:42:58.2174385Z wp : yes 2025-05-07T19:42:58.2176644Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2179276Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2179877Z bogomips : 5999.98 2025-05-07T19:42:58.2180095Z clflush size : 64 2025-05-07T19:42:58.2180325Z cache_alignment : 64 2025-05-07T19:42:58.2180595Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2180928Z power management: 2025-05-07T19:42:58.2181058Z 2025-05-07T19:42:58.2181158Z processor : 18 2025-05-07T19:42:58.2181373Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2181620Z cpu family : 6 2025-05-07T19:42:58.2181818Z model : 85 2025-05-07T19:42:58.2182107Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2182457Z stepping : 7 2025-05-07T19:42:58.2182675Z microcode : 0x5003901 2025-05-07T19:42:58.2182894Z cpu MHz : 3221.273 2025-05-07T19:42:58.2183273Z cache size : 36608 KB 2025-05-07T19:42:58.2183484Z physical id : 0 2025-05-07T19:42:58.2183697Z siblings : 48 2025-05-07T19:42:58.2183887Z core id : 18 2025-05-07T19:42:58.2184088Z cpu cores : 24 2025-05-07T19:42:58.2184283Z apicid : 36 2025-05-07T19:42:58.2184489Z initial apicid : 36 2025-05-07T19:42:58.2184706Z fpu : yes 2025-05-07T19:42:58.2184894Z fpu_exception : yes 2025-05-07T19:42:58.2185122Z cpuid level : 13 2025-05-07T19:42:58.2185319Z wp : yes 2025-05-07T19:42:58.2187733Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2190390Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2190964Z bogomips : 5999.98 2025-05-07T19:42:58.2191191Z clflush size : 64 2025-05-07T19:42:58.2191402Z cache_alignment : 64 2025-05-07T19:42:58.2191688Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2192011Z power management: 2025-05-07T19:42:58.2192156Z 2025-05-07T19:42:58.2192239Z processor : 19 2025-05-07T19:42:58.2192463Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2192762Z cpu family : 6 2025-05-07T19:42:58.2192976Z model : 85 2025-05-07T19:42:58.2193245Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2193608Z stepping : 7 2025-05-07T19:42:58.2193883Z microcode : 0x5003901 2025-05-07T19:42:58.2194116Z cpu MHz : 3435.680 2025-05-07T19:42:58.2194331Z cache size : 36608 KB 2025-05-07T19:42:58.2194564Z physical id : 0 2025-05-07T19:42:58.2194818Z siblings : 48 2025-05-07T19:42:58.2195028Z core id : 19 2025-05-07T19:42:58.2195219Z cpu cores : 24 2025-05-07T19:42:58.2195426Z apicid : 38 2025-05-07T19:42:58.2195627Z initial apicid : 38 2025-05-07T19:42:58.2195833Z fpu : yes 2025-05-07T19:42:58.2196039Z fpu_exception : yes 2025-05-07T19:42:58.2196265Z cpuid level : 13 2025-05-07T19:42:58.2196491Z wp : yes 2025-05-07T19:42:58.2198832Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2201468Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2202075Z bogomips : 5999.98 2025-05-07T19:42:58.2202291Z clflush size : 64 2025-05-07T19:42:58.2202518Z cache_alignment : 64 2025-05-07T19:42:58.2202819Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2203161Z power management: 2025-05-07T19:42:58.2203317Z 2025-05-07T19:42:58.2203409Z processor : 20 2025-05-07T19:42:58.2203621Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2203872Z cpu family : 6 2025-05-07T19:42:58.2204086Z model : 85 2025-05-07T19:42:58.2204380Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2204736Z stepping : 7 2025-05-07T19:42:58.2204967Z microcode : 0x5003901 2025-05-07T19:42:58.2205210Z cpu MHz : 2999.994 2025-05-07T19:42:58.2205427Z cache size : 36608 KB 2025-05-07T19:42:58.2205675Z physical id : 0 2025-05-07T19:42:58.2205890Z siblings : 48 2025-05-07T19:42:58.2206205Z core id : 20 2025-05-07T19:42:58.2206395Z cpu cores : 24 2025-05-07T19:42:58.2206604Z apicid : 40 2025-05-07T19:42:58.2206795Z initial apicid : 40 2025-05-07T19:42:58.2207011Z fpu : yes 2025-05-07T19:42:58.2207204Z fpu_exception : yes 2025-05-07T19:42:58.2207427Z cpuid level : 13 2025-05-07T19:42:58.2207619Z wp : yes 2025-05-07T19:42:58.2209736Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2212176Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2212732Z bogomips : 5999.98 2025-05-07T19:42:58.2212935Z clflush size : 64 2025-05-07T19:42:58.2213155Z cache_alignment : 64 2025-05-07T19:42:58.2213408Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2213731Z power management: 2025-05-07T19:42:58.2213859Z 2025-05-07T19:42:58.2213939Z processor : 21 2025-05-07T19:42:58.2214160Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2214382Z cpu family : 6 2025-05-07T19:42:58.2214586Z model : 85 2025-05-07T19:42:58.2214918Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2215266Z stepping : 7 2025-05-07T19:42:58.2215482Z microcode : 0x5003901 2025-05-07T19:42:58.2215694Z cpu MHz : 2999.994 2025-05-07T19:42:58.2215918Z cache size : 36608 KB 2025-05-07T19:42:58.2216135Z physical id : 0 2025-05-07T19:42:58.2216351Z siblings : 48 2025-05-07T19:42:58.2216544Z core id : 21 2025-05-07T19:42:58.2216753Z cpu cores : 24 2025-05-07T19:42:58.2216942Z apicid : 42 2025-05-07T19:42:58.2217126Z initial apicid : 42 2025-05-07T19:42:58.2217310Z fpu : yes 2025-05-07T19:42:58.2217497Z fpu_exception : yes 2025-05-07T19:42:58.2217683Z cpuid level : 13 2025-05-07T19:42:58.2217875Z wp : yes 2025-05-07T19:42:58.2220013Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2222427Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2222970Z bogomips : 5999.98 2025-05-07T19:42:58.2223167Z clflush size : 64 2025-05-07T19:42:58.2223359Z cache_alignment : 64 2025-05-07T19:42:58.2223612Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2223898Z power management: 2025-05-07T19:42:58.2224015Z 2025-05-07T19:42:58.2224105Z processor : 22 2025-05-07T19:42:58.2224298Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2224519Z cpu family : 6 2025-05-07T19:42:58.2224695Z model : 85 2025-05-07T19:42:58.2224945Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2225258Z stepping : 7 2025-05-07T19:42:58.2225450Z microcode : 0x5003901 2025-05-07T19:42:58.2225659Z cpu MHz : 2999.994 2025-05-07T19:42:58.2225847Z cache size : 36608 KB 2025-05-07T19:42:58.2226052Z physical id : 0 2025-05-07T19:42:58.2226235Z siblings : 48 2025-05-07T19:42:58.2226422Z core id : 22 2025-05-07T19:42:58.2226597Z cpu cores : 24 2025-05-07T19:42:58.2226784Z apicid : 44 2025-05-07T19:42:58.2226961Z initial apicid : 44 2025-05-07T19:42:58.2227155Z fpu : yes 2025-05-07T19:42:58.2227323Z fpu_exception : yes 2025-05-07T19:42:58.2227521Z cpuid level : 13 2025-05-07T19:42:58.2227693Z wp : yes 2025-05-07T19:42:58.2230200Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2232827Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2233393Z bogomips : 5999.98 2025-05-07T19:42:58.2233604Z clflush size : 64 2025-05-07T19:42:58.2233874Z cache_alignment : 64 2025-05-07T19:42:58.2234126Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2234442Z power management: 2025-05-07T19:42:58.2234569Z 2025-05-07T19:42:58.2234651Z processor : 23 2025-05-07T19:42:58.2234867Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2235088Z cpu family : 6 2025-05-07T19:42:58.2235291Z model : 85 2025-05-07T19:42:58.2235546Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2235997Z stepping : 7 2025-05-07T19:42:58.2236191Z microcode : 0x5003901 2025-05-07T19:42:58.2236408Z cpu MHz : 2999.994 2025-05-07T19:42:58.2236620Z cache size : 36608 KB 2025-05-07T19:42:58.2236830Z physical id : 0 2025-05-07T19:42:58.2237032Z siblings : 48 2025-05-07T19:42:58.2237217Z core id : 23 2025-05-07T19:42:58.2237417Z cpu cores : 24 2025-05-07T19:42:58.2237609Z apicid : 46 2025-05-07T19:42:58.2237810Z initial apicid : 46 2025-05-07T19:42:58.2238005Z fpu : yes 2025-05-07T19:42:58.2238203Z fpu_exception : yes 2025-05-07T19:42:58.2238405Z cpuid level : 13 2025-05-07T19:42:58.2238681Z wp : yes 2025-05-07T19:42:58.2241010Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2243626Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2244201Z bogomips : 5999.98 2025-05-07T19:42:58.2244413Z clflush size : 64 2025-05-07T19:42:58.2244614Z cache_alignment : 64 2025-05-07T19:42:58.2244881Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2245197Z power management: 2025-05-07T19:42:58.2245323Z 2025-05-07T19:42:58.2245414Z processor : 24 2025-05-07T19:42:58.2245722Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2246060Z cpu family : 6 2025-05-07T19:42:58.2246234Z model : 85 2025-05-07T19:42:58.2246480Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2246794Z stepping : 7 2025-05-07T19:42:58.2246981Z microcode : 0x5003901 2025-05-07T19:42:58.2247190Z cpu MHz : 2999.994 2025-05-07T19:42:58.2247377Z cache size : 36608 KB 2025-05-07T19:42:58.2247583Z physical id : 1 2025-05-07T19:42:58.2247759Z siblings : 48 2025-05-07T19:42:58.2247941Z core id : 0 2025-05-07T19:42:58.2248113Z cpu cores : 24 2025-05-07T19:42:58.2248298Z apicid : 64 2025-05-07T19:42:58.2248470Z initial apicid : 64 2025-05-07T19:42:58.2248675Z fpu : yes 2025-05-07T19:42:58.2248842Z fpu_exception : yes 2025-05-07T19:42:58.2249039Z cpuid level : 13 2025-05-07T19:42:58.2249225Z wp : yes 2025-05-07T19:42:58.2251314Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2253727Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2254262Z bogomips : 5999.98 2025-05-07T19:42:58.2254452Z clflush size : 64 2025-05-07T19:42:58.2254645Z cache_alignment : 64 2025-05-07T19:42:58.2254881Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2255179Z power management: 2025-05-07T19:42:58.2255294Z 2025-05-07T19:42:58.2255366Z processor : 25 2025-05-07T19:42:58.2255565Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2255770Z cpu family : 6 2025-05-07T19:42:58.2255951Z model : 85 2025-05-07T19:42:58.2256195Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2256514Z stepping : 7 2025-05-07T19:42:58.2256695Z microcode : 0x5003901 2025-05-07T19:42:58.2256902Z cpu MHz : 3132.000 2025-05-07T19:42:58.2257146Z cache size : 36608 KB 2025-05-07T19:42:58.2257338Z physical id : 1 2025-05-07T19:42:58.2257529Z siblings : 48 2025-05-07T19:42:58.2257703Z core id : 1 2025-05-07T19:42:58.2257885Z cpu cores : 24 2025-05-07T19:42:58.2258062Z apicid : 66 2025-05-07T19:42:58.2258251Z initial apicid : 66 2025-05-07T19:42:58.2258432Z fpu : yes 2025-05-07T19:42:58.2258614Z fpu_exception : yes 2025-05-07T19:42:58.2258803Z cpuid level : 13 2025-05-07T19:42:58.2258993Z wp : yes 2025-05-07T19:42:58.2261128Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2263541Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2264075Z bogomips : 5999.98 2025-05-07T19:42:58.2264270Z clflush size : 64 2025-05-07T19:42:58.2264461Z cache_alignment : 64 2025-05-07T19:42:58.2264707Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2264994Z power management: 2025-05-07T19:42:58.2265113Z 2025-05-07T19:42:58.2265210Z processor : 26 2025-05-07T19:42:58.2265409Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2265645Z cpu family : 6 2025-05-07T19:42:58.2265834Z model : 85 2025-05-07T19:42:58.2266094Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2266421Z stepping : 7 2025-05-07T19:42:58.2266628Z microcode : 0x5003901 2025-05-07T19:42:58.2266850Z cpu MHz : 1271.034 2025-05-07T19:42:58.2267045Z cache size : 36608 KB 2025-05-07T19:42:58.2267269Z physical id : 1 2025-05-07T19:42:58.2267456Z siblings : 48 2025-05-07T19:42:58.2267651Z core id : 2 2025-05-07T19:42:58.2267829Z cpu cores : 24 2025-05-07T19:42:58.2268024Z apicid : 68 2025-05-07T19:42:58.2268204Z initial apicid : 68 2025-05-07T19:42:58.2268404Z fpu : yes 2025-05-07T19:42:58.2268585Z fpu_exception : yes 2025-05-07T19:42:58.2268789Z cpuid level : 13 2025-05-07T19:42:58.2268973Z wp : yes 2025-05-07T19:42:58.2271072Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2273505Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2274284Z bogomips : 5999.98 2025-05-07T19:42:58.2274498Z clflush size : 64 2025-05-07T19:42:58.2274725Z cache_alignment : 64 2025-05-07T19:42:58.2275063Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2275394Z power management: 2025-05-07T19:42:58.2275522Z 2025-05-07T19:42:58.2275605Z processor : 27 2025-05-07T19:42:58.2275826Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2276060Z cpu family : 6 2025-05-07T19:42:58.2276268Z model : 85 2025-05-07T19:42:58.2276537Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2276901Z stepping : 7 2025-05-07T19:42:58.2277100Z microcode : 0x5003901 2025-05-07T19:42:58.2277331Z cpu MHz : 2999.994 2025-05-07T19:42:58.2277554Z cache size : 36608 KB 2025-05-07T19:42:58.2277770Z physical id : 1 2025-05-07T19:42:58.2278050Z siblings : 48 2025-05-07T19:42:58.2278248Z core id : 3 2025-05-07T19:42:58.2278460Z cpu cores : 24 2025-05-07T19:42:58.2278661Z apicid : 70 2025-05-07T19:42:58.2278877Z initial apicid : 70 2025-05-07T19:42:58.2279091Z fpu : yes 2025-05-07T19:42:58.2279296Z fpu_exception : yes 2025-05-07T19:42:58.2279506Z cpuid level : 13 2025-05-07T19:42:58.2279721Z wp : yes 2025-05-07T19:42:58.2282060Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2284847Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2285444Z bogomips : 5999.98 2025-05-07T19:42:58.2285669Z clflush size : 64 2025-05-07T19:42:58.2285878Z cache_alignment : 64 2025-05-07T19:42:58.2286158Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2286571Z power management: 2025-05-07T19:42:58.2286696Z 2025-05-07T19:42:58.2286788Z processor : 28 2025-05-07T19:42:58.2286986Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2287213Z cpu family : 6 2025-05-07T19:42:58.2287396Z model : 85 2025-05-07T19:42:58.2287660Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2287975Z stepping : 7 2025-05-07T19:42:58.2288178Z microcode : 0x5003901 2025-05-07T19:42:58.2288396Z cpu MHz : 2999.994 2025-05-07T19:42:58.2288590Z cache size : 36608 KB 2025-05-07T19:42:58.2288802Z physical id : 1 2025-05-07T19:42:58.2288992Z siblings : 48 2025-05-07T19:42:58.2289187Z core id : 4 2025-05-07T19:42:58.2289365Z cpu cores : 24 2025-05-07T19:42:58.2289561Z apicid : 72 2025-05-07T19:42:58.2289742Z initial apicid : 72 2025-05-07T19:42:58.2289946Z fpu : yes 2025-05-07T19:42:58.2290124Z fpu_exception : yes 2025-05-07T19:42:58.2290336Z cpuid level : 13 2025-05-07T19:42:58.2290522Z wp : yes 2025-05-07T19:42:58.2292620Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2295054Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2295603Z bogomips : 5999.98 2025-05-07T19:42:58.2295799Z clflush size : 64 2025-05-07T19:42:58.2296016Z cache_alignment : 64 2025-05-07T19:42:58.2296261Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2296567Z power management: 2025-05-07T19:42:58.2296689Z 2025-05-07T19:42:58.2296766Z processor : 29 2025-05-07T19:42:58.2296974Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2297192Z cpu family : 6 2025-05-07T19:42:58.2297387Z model : 85 2025-05-07T19:42:58.2297634Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2297983Z stepping : 7 2025-05-07T19:42:58.2298182Z microcode : 0x5003901 2025-05-07T19:42:58.2298420Z cpu MHz : 1330.491 2025-05-07T19:42:58.2298641Z cache size : 36608 KB 2025-05-07T19:42:58.2298859Z physical id : 1 2025-05-07T19:42:58.2299081Z siblings : 48 2025-05-07T19:42:58.2299274Z core id : 5 2025-05-07T19:42:58.2299552Z cpu cores : 24 2025-05-07T19:42:58.2299748Z apicid : 74 2025-05-07T19:42:58.2299964Z initial apicid : 74 2025-05-07T19:42:58.2300168Z fpu : yes 2025-05-07T19:42:58.2300387Z fpu_exception : yes 2025-05-07T19:42:58.2300595Z cpuid level : 13 2025-05-07T19:42:58.2300819Z wp : yes 2025-05-07T19:42:58.2303024Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2305462Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2306031Z bogomips : 5999.98 2025-05-07T19:42:58.2306263Z clflush size : 64 2025-05-07T19:42:58.2306471Z cache_alignment : 64 2025-05-07T19:42:58.2306745Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2307057Z power management: 2025-05-07T19:42:58.2307185Z 2025-05-07T19:42:58.2307293Z processor : 30 2025-05-07T19:42:58.2307502Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2307757Z cpu family : 6 2025-05-07T19:42:58.2307949Z model : 85 2025-05-07T19:42:58.2308226Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2308558Z stepping : 7 2025-05-07T19:42:58.2308773Z microcode : 0x5003901 2025-05-07T19:42:58.2308998Z cpu MHz : 2999.994 2025-05-07T19:42:58.2309208Z cache size : 36608 KB 2025-05-07T19:42:58.2309433Z physical id : 1 2025-05-07T19:42:58.2309629Z siblings : 48 2025-05-07T19:42:58.2309838Z core id : 6 2025-05-07T19:42:58.2310024Z cpu cores : 24 2025-05-07T19:42:58.2310236Z apicid : 76 2025-05-07T19:42:58.2310432Z initial apicid : 76 2025-05-07T19:42:58.2310634Z fpu : yes 2025-05-07T19:42:58.2310807Z fpu_exception : yes 2025-05-07T19:42:58.2311002Z cpuid level : 13 2025-05-07T19:42:58.2311176Z wp : yes 2025-05-07T19:42:58.2313263Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2316052Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2316640Z bogomips : 5999.98 2025-05-07T19:42:58.2316842Z clflush size : 64 2025-05-07T19:42:58.2317056Z cache_alignment : 64 2025-05-07T19:42:58.2317317Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2317636Z power management: 2025-05-07T19:42:58.2317759Z 2025-05-07T19:42:58.2317839Z processor : 31 2025-05-07T19:42:58.2318054Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2318282Z cpu family : 6 2025-05-07T19:42:58.2318479Z model : 85 2025-05-07T19:42:58.2318744Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2319088Z stepping : 7 2025-05-07T19:42:58.2319282Z microcode : 0x5003901 2025-05-07T19:42:58.2319507Z cpu MHz : 1510.793 2025-05-07T19:42:58.2319714Z cache size : 36608 KB 2025-05-07T19:42:58.2319922Z physical id : 1 2025-05-07T19:42:58.2320133Z siblings : 48 2025-05-07T19:42:58.2320320Z core id : 7 2025-05-07T19:42:58.2320513Z cpu cores : 24 2025-05-07T19:42:58.2320702Z apicid : 78 2025-05-07T19:42:58.2320898Z initial apicid : 78 2025-05-07T19:42:58.2322985Z fpu : yes 2025-05-07T19:42:58.2323184Z fpu_exception : yes 2025-05-07T19:42:58.2323389Z cpuid level : 13 2025-05-07T19:42:58.2323593Z wp : yes 2025-05-07T19:42:58.2325918Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2328614Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2329452Z bogomips : 5999.98 2025-05-07T19:42:58.2329673Z clflush size : 64 2025-05-07T19:42:58.2329894Z cache_alignment : 64 2025-05-07T19:42:58.2330163Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2330480Z power management: 2025-05-07T19:42:58.2330609Z 2025-05-07T19:42:58.2330702Z processor : 32 2025-05-07T19:42:58.2330904Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2331139Z cpu family : 6 2025-05-07T19:42:58.2331331Z model : 85 2025-05-07T19:42:58.2331598Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2331935Z stepping : 7 2025-05-07T19:42:58.2332140Z microcode : 0x5003901 2025-05-07T19:42:58.2332363Z cpu MHz : 1322.633 2025-05-07T19:42:58.2332560Z cache size : 36608 KB 2025-05-07T19:42:58.2332785Z physical id : 1 2025-05-07T19:42:58.2332975Z siblings : 48 2025-05-07T19:42:58.2333182Z core id : 8 2025-05-07T19:42:58.2333367Z cpu cores : 24 2025-05-07T19:42:58.2333568Z apicid : 80 2025-05-07T19:42:58.2333759Z initial apicid : 80 2025-05-07T19:42:58.2333975Z fpu : yes 2025-05-07T19:42:58.2334167Z fpu_exception : yes 2025-05-07T19:42:58.2334382Z cpuid level : 13 2025-05-07T19:42:58.2334577Z wp : yes 2025-05-07T19:42:58.2336840Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2339471Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2340061Z bogomips : 5999.98 2025-05-07T19:42:58.2340283Z clflush size : 64 2025-05-07T19:42:58.2340524Z cache_alignment : 64 2025-05-07T19:42:58.2340796Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2341144Z power management: 2025-05-07T19:42:58.2341279Z 2025-05-07T19:42:58.2341372Z processor : 33 2025-05-07T19:42:58.2341705Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2342040Z cpu family : 6 2025-05-07T19:42:58.2342233Z model : 85 2025-05-07T19:42:58.2342484Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2342816Z stepping : 7 2025-05-07T19:42:58.2343014Z microcode : 0x5003901 2025-05-07T19:42:58.2343215Z cpu MHz : 1481.719 2025-05-07T19:42:58.2343429Z cache size : 36608 KB 2025-05-07T19:42:58.2343629Z physical id : 1 2025-05-07T19:42:58.2343844Z siblings : 48 2025-05-07T19:42:58.2344024Z core id : 9 2025-05-07T19:42:58.2344215Z cpu cores : 24 2025-05-07T19:42:58.2344404Z apicid : 82 2025-05-07T19:42:58.2344608Z initial apicid : 82 2025-05-07T19:42:58.2344806Z fpu : yes 2025-05-07T19:42:58.2345003Z fpu_exception : yes 2025-05-07T19:42:58.2345293Z cpuid level : 13 2025-05-07T19:42:58.2345494Z wp : yes 2025-05-07T19:42:58.2347604Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2350092Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2350654Z bogomips : 5999.98 2025-05-07T19:42:58.2350868Z clflush size : 64 2025-05-07T19:42:58.2351074Z cache_alignment : 64 2025-05-07T19:42:58.2351349Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2351645Z power management: 2025-05-07T19:42:58.2351762Z 2025-05-07T19:42:58.2351854Z processor : 34 2025-05-07T19:42:58.2352045Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2352268Z cpu family : 6 2025-05-07T19:42:58.2352442Z model : 85 2025-05-07T19:42:58.2352687Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2352997Z stepping : 7 2025-05-07T19:42:58.2353181Z microcode : 0x5003901 2025-05-07T19:42:58.2353381Z cpu MHz : 2245.753 2025-05-07T19:42:58.2353565Z cache size : 36608 KB 2025-05-07T19:42:58.2353865Z physical id : 1 2025-05-07T19:42:58.2354212Z siblings : 48 2025-05-07T19:42:58.2354410Z core id : 10 2025-05-07T19:42:58.2354593Z cpu cores : 24 2025-05-07T19:42:58.2354831Z apicid : 84 2025-05-07T19:42:58.2355016Z initial apicid : 84 2025-05-07T19:42:58.2355218Z fpu : yes 2025-05-07T19:42:58.2355401Z fpu_exception : yes 2025-05-07T19:42:58.2355610Z cpuid level : 13 2025-05-07T19:42:58.2355686Z wp : yes 2025-05-07T19:42:58.2357833Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2358217Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2358307Z bogomips : 5999.98 2025-05-07T19:42:58.2358394Z clflush size : 64 2025-05-07T19:42:58.2358474Z cache_alignment : 64 2025-05-07T19:42:58.2358596Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2358679Z power management: 2025-05-07T19:42:58.2358691Z 2025-05-07T19:42:58.2358767Z processor : 35 2025-05-07T19:42:58.2358857Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2358934Z cpu family : 6 2025-05-07T19:42:58.2359016Z model : 85 2025-05-07T19:42:58.2359176Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2359257Z stepping : 7 2025-05-07T19:42:58.2359335Z microcode : 0x5003901 2025-05-07T19:42:58.2359422Z cpu MHz : 2999.994 2025-05-07T19:42:58.2359508Z cache size : 36608 KB 2025-05-07T19:42:58.2359589Z physical id : 1 2025-05-07T19:42:58.2359671Z siblings : 48 2025-05-07T19:42:58.2359742Z core id : 11 2025-05-07T19:42:58.2359815Z cpu cores : 24 2025-05-07T19:42:58.2359890Z apicid : 86 2025-05-07T19:42:58.2359989Z initial apicid : 86 2025-05-07T19:42:58.2360062Z fpu : yes 2025-05-07T19:42:58.2360144Z fpu_exception : yes 2025-05-07T19:42:58.2360236Z cpuid level : 13 2025-05-07T19:42:58.2360308Z wp : yes 2025-05-07T19:42:58.2362510Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2362906Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2362988Z bogomips : 5999.98 2025-05-07T19:42:58.2363133Z clflush size : 64 2025-05-07T19:42:58.2363225Z cache_alignment : 64 2025-05-07T19:42:58.2363350Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2363433Z power management: 2025-05-07T19:42:58.2363441Z 2025-05-07T19:42:58.2363524Z processor : 36 2025-05-07T19:42:58.2363618Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2363695Z cpu family : 6 2025-05-07T19:42:58.2363770Z model : 85 2025-05-07T19:42:58.2363933Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2364011Z stepping : 7 2025-05-07T19:42:58.2364094Z microcode : 0x5003901 2025-05-07T19:42:58.2364170Z cpu MHz : 1332.056 2025-05-07T19:42:58.2364259Z cache size : 36608 KB 2025-05-07T19:42:58.2364339Z physical id : 1 2025-05-07T19:42:58.2364419Z siblings : 48 2025-05-07T19:42:58.2364499Z core id : 12 2025-05-07T19:42:58.2364578Z cpu cores : 24 2025-05-07T19:42:58.2364653Z apicid : 88 2025-05-07T19:42:58.2364731Z initial apicid : 88 2025-05-07T19:42:58.2364818Z fpu : yes 2025-05-07T19:42:58.2364902Z fpu_exception : yes 2025-05-07T19:42:58.2364981Z cpuid level : 13 2025-05-07T19:42:58.2365069Z wp : yes 2025-05-07T19:42:58.2367249Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2367603Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2367687Z bogomips : 5999.98 2025-05-07T19:42:58.2367763Z clflush size : 64 2025-05-07T19:42:58.2367839Z cache_alignment : 64 2025-05-07T19:42:58.2367965Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2368044Z power management: 2025-05-07T19:42:58.2368048Z 2025-05-07T19:42:58.2368126Z processor : 37 2025-05-07T19:42:58.2368208Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2368289Z cpu family : 6 2025-05-07T19:42:58.2368358Z model : 85 2025-05-07T19:42:58.2368497Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2368583Z stepping : 7 2025-05-07T19:42:58.2368655Z microcode : 0x5003901 2025-05-07T19:42:58.2368728Z cpu MHz : 2999.994 2025-05-07T19:42:58.2368802Z cache size : 36608 KB 2025-05-07T19:42:58.2368886Z physical id : 1 2025-05-07T19:42:58.2368956Z siblings : 48 2025-05-07T19:42:58.2369028Z core id : 13 2025-05-07T19:42:58.2369110Z cpu cores : 24 2025-05-07T19:42:58.2369181Z apicid : 90 2025-05-07T19:42:58.2369258Z initial apicid : 90 2025-05-07T19:42:58.2369332Z fpu : yes 2025-05-07T19:42:58.2369418Z fpu_exception : yes 2025-05-07T19:42:58.2369497Z cpuid level : 13 2025-05-07T19:42:58.2369576Z wp : yes 2025-05-07T19:42:58.2371567Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2371975Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2372052Z bogomips : 5999.98 2025-05-07T19:42:58.2372136Z clflush size : 64 2025-05-07T19:42:58.2372214Z cache_alignment : 64 2025-05-07T19:42:58.2372381Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2372466Z power management: 2025-05-07T19:42:58.2372470Z 2025-05-07T19:42:58.2372543Z processor : 38 2025-05-07T19:42:58.2372627Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2372700Z cpu family : 6 2025-05-07T19:42:58.2372782Z model : 85 2025-05-07T19:42:58.2372925Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2372996Z stepping : 7 2025-05-07T19:42:58.2373083Z microcode : 0x5003901 2025-05-07T19:42:58.2373157Z cpu MHz : 2999.994 2025-05-07T19:42:58.2373232Z cache size : 36608 KB 2025-05-07T19:42:58.2373308Z physical id : 1 2025-05-07T19:42:58.2373394Z siblings : 48 2025-05-07T19:42:58.2373463Z core id : 14 2025-05-07T19:42:58.2373534Z cpu cores : 24 2025-05-07T19:42:58.2373615Z apicid : 92 2025-05-07T19:42:58.2373689Z initial apicid : 92 2025-05-07T19:42:58.2373757Z fpu : yes 2025-05-07T19:42:58.2373836Z fpu_exception : yes 2025-05-07T19:42:58.2373918Z cpuid level : 13 2025-05-07T19:42:58.2373991Z wp : yes 2025-05-07T19:42:58.2375971Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2376336Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2376413Z bogomips : 5999.98 2025-05-07T19:42:58.2376493Z clflush size : 64 2025-05-07T19:42:58.2376576Z cache_alignment : 64 2025-05-07T19:42:58.2376696Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2376785Z power management: 2025-05-07T19:42:58.2376789Z 2025-05-07T19:42:58.2376880Z processor : 39 2025-05-07T19:42:58.2376963Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2377048Z cpu family : 6 2025-05-07T19:42:58.2377121Z model : 85 2025-05-07T19:42:58.2377283Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2377360Z stepping : 7 2025-05-07T19:42:58.2377440Z microcode : 0x5003901 2025-05-07T19:42:58.2377528Z cpu MHz : 2999.994 2025-05-07T19:42:58.2377609Z cache size : 36608 KB 2025-05-07T19:42:58.2377688Z physical id : 1 2025-05-07T19:42:58.2377762Z siblings : 48 2025-05-07T19:42:58.2377850Z core id : 15 2025-05-07T19:42:58.2377926Z cpu cores : 24 2025-05-07T19:42:58.2377999Z apicid : 94 2025-05-07T19:42:58.2378098Z initial apicid : 94 2025-05-07T19:42:58.2378172Z fpu : yes 2025-05-07T19:42:58.2378252Z fpu_exception : yes 2025-05-07T19:42:58.2378331Z cpuid level : 13 2025-05-07T19:42:58.2378420Z wp : yes 2025-05-07T19:42:58.2380420Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2380827Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2380920Z bogomips : 5999.98 2025-05-07T19:42:58.2380999Z clflush size : 64 2025-05-07T19:42:58.2381083Z cache_alignment : 64 2025-05-07T19:42:58.2381220Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2381347Z power management: 2025-05-07T19:42:58.2381351Z 2025-05-07T19:42:58.2381431Z processor : 40 2025-05-07T19:42:58.2381532Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2381609Z cpu family : 6 2025-05-07T19:42:58.2381688Z model : 85 2025-05-07T19:42:58.2381837Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2381929Z stepping : 7 2025-05-07T19:42:58.2382012Z microcode : 0x5003901 2025-05-07T19:42:58.2382094Z cpu MHz : 2999.994 2025-05-07T19:42:58.2382188Z cache size : 36608 KB 2025-05-07T19:42:58.2382266Z physical id : 1 2025-05-07T19:42:58.2382343Z siblings : 48 2025-05-07T19:42:58.2382417Z core id : 16 2025-05-07T19:42:58.2382507Z cpu cores : 24 2025-05-07T19:42:58.2382585Z apicid : 96 2025-05-07T19:42:58.2382665Z initial apicid : 96 2025-05-07T19:42:58.2382737Z fpu : yes 2025-05-07T19:42:58.2382835Z fpu_exception : yes 2025-05-07T19:42:58.2382914Z cpuid level : 13 2025-05-07T19:42:58.2382988Z wp : yes 2025-05-07T19:42:58.2384993Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2385353Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2385434Z bogomips : 5999.98 2025-05-07T19:42:58.2385528Z clflush size : 64 2025-05-07T19:42:58.2385606Z cache_alignment : 64 2025-05-07T19:42:58.2385725Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2385824Z power management: 2025-05-07T19:42:58.2385828Z 2025-05-07T19:42:58.2385906Z processor : 41 2025-05-07T19:42:58.2385990Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2386076Z cpu family : 6 2025-05-07T19:42:58.2386148Z model : 85 2025-05-07T19:42:58.2386293Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2386373Z stepping : 7 2025-05-07T19:42:58.2386466Z microcode : 0x5003901 2025-05-07T19:42:58.2386541Z cpu MHz : 1337.363 2025-05-07T19:42:58.2386619Z cache size : 36608 KB 2025-05-07T19:42:58.2386696Z physical id : 1 2025-05-07T19:42:58.2386784Z siblings : 48 2025-05-07T19:42:58.2386860Z core id : 17 2025-05-07T19:42:58.2386939Z cpu cores : 24 2025-05-07T19:42:58.2387024Z apicid : 98 2025-05-07T19:42:58.2387102Z initial apicid : 98 2025-05-07T19:42:58.2387174Z fpu : yes 2025-05-07T19:42:58.2387253Z fpu_exception : yes 2025-05-07T19:42:58.2387341Z cpuid level : 13 2025-05-07T19:42:58.2387418Z wp : yes 2025-05-07T19:42:58.2389401Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2389844Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2389923Z bogomips : 5999.98 2025-05-07T19:42:58.2390003Z clflush size : 64 2025-05-07T19:42:58.2390098Z cache_alignment : 64 2025-05-07T19:42:58.2390221Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2390303Z power management: 2025-05-07T19:42:58.2390307Z 2025-05-07T19:42:58.2390398Z processor : 42 2025-05-07T19:42:58.2390528Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2390605Z cpu family : 6 2025-05-07T19:42:58.2390679Z model : 85 2025-05-07T19:42:58.2390842Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2390922Z stepping : 7 2025-05-07T19:42:58.2391001Z microcode : 0x5003901 2025-05-07T19:42:58.2391095Z cpu MHz : 2999.994 2025-05-07T19:42:58.2391172Z cache size : 36608 KB 2025-05-07T19:42:58.2391249Z physical id : 1 2025-05-07T19:42:58.2391326Z siblings : 48 2025-05-07T19:42:58.2391413Z core id : 18 2025-05-07T19:42:58.2391487Z cpu cores : 24 2025-05-07T19:42:58.2391563Z apicid : 100 2025-05-07T19:42:58.2391656Z initial apicid : 100 2025-05-07T19:42:58.2391729Z fpu : yes 2025-05-07T19:42:58.2391810Z fpu_exception : yes 2025-05-07T19:42:58.2391887Z cpuid level : 13 2025-05-07T19:42:58.2391971Z wp : yes 2025-05-07T19:42:58.2394024Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2394597Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2394683Z bogomips : 5999.98 2025-05-07T19:42:58.2394770Z clflush size : 64 2025-05-07T19:42:58.2394935Z cache_alignment : 64 2025-05-07T19:42:58.2395080Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2395169Z power management: 2025-05-07T19:42:58.2395173Z 2025-05-07T19:42:58.2395256Z processor : 43 2025-05-07T19:42:58.2395363Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2395450Z cpu family : 6 2025-05-07T19:42:58.2395535Z model : 85 2025-05-07T19:42:58.2395696Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2395793Z stepping : 7 2025-05-07T19:42:58.2395882Z microcode : 0x5003901 2025-05-07T19:42:58.2395963Z cpu MHz : 1597.789 2025-05-07T19:42:58.2396059Z cache size : 36608 KB 2025-05-07T19:42:58.2396146Z physical id : 1 2025-05-07T19:42:58.2396227Z siblings : 48 2025-05-07T19:42:58.2396305Z core id : 19 2025-05-07T19:42:58.2396399Z cpu cores : 24 2025-05-07T19:42:58.2396478Z apicid : 102 2025-05-07T19:42:58.2396563Z initial apicid : 102 2025-05-07T19:42:58.2396657Z fpu : yes 2025-05-07T19:42:58.2396742Z fpu_exception : yes 2025-05-07T19:42:58.2396823Z cpuid level : 13 2025-05-07T19:42:58.2396901Z wp : yes 2025-05-07T19:42:58.2399063Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2399522Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2399621Z bogomips : 5999.98 2025-05-07T19:42:58.2399705Z clflush size : 64 2025-05-07T19:42:58.2399790Z cache_alignment : 64 2025-05-07T19:42:58.2399919Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2400016Z power management: 2025-05-07T19:42:58.2400021Z 2025-05-07T19:42:58.2400102Z processor : 44 2025-05-07T19:42:58.2400194Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2400290Z cpu family : 6 2025-05-07T19:42:58.2400416Z model : 85 2025-05-07T19:42:58.2400576Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2400657Z stepping : 7 2025-05-07T19:42:58.2400760Z microcode : 0x5003901 2025-05-07T19:42:58.2400847Z cpu MHz : 1323.051 2025-05-07T19:42:58.2400931Z cache size : 36608 KB 2025-05-07T19:42:58.2401025Z physical id : 1 2025-05-07T19:42:58.2401106Z siblings : 48 2025-05-07T19:42:58.2401184Z core id : 20 2025-05-07T19:42:58.2401263Z cpu cores : 24 2025-05-07T19:42:58.2401358Z apicid : 104 2025-05-07T19:42:58.2401448Z initial apicid : 104 2025-05-07T19:42:58.2401527Z fpu : yes 2025-05-07T19:42:58.2401630Z fpu_exception : yes 2025-05-07T19:42:58.2401711Z cpuid level : 13 2025-05-07T19:42:58.2401788Z wp : yes 2025-05-07T19:42:58.2403953Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2404344Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2404427Z bogomips : 5999.98 2025-05-07T19:42:58.2404524Z clflush size : 64 2025-05-07T19:42:58.2404609Z cache_alignment : 64 2025-05-07T19:42:58.2404737Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2404822Z power management: 2025-05-07T19:42:58.2404826Z 2025-05-07T19:42:58.2404928Z processor : 45 2025-05-07T19:42:58.2405016Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2405097Z cpu family : 6 2025-05-07T19:42:58.2405190Z model : 85 2025-05-07T19:42:58.2405353Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2405434Z stepping : 7 2025-05-07T19:42:58.2405525Z microcode : 0x5003901 2025-05-07T19:42:58.2405618Z cpu MHz : 1314.677 2025-05-07T19:42:58.2405704Z cache size : 36608 KB 2025-05-07T19:42:58.2405787Z physical id : 1 2025-05-07T19:42:58.2405882Z siblings : 48 2025-05-07T19:42:58.2405961Z core id : 21 2025-05-07T19:42:58.2406044Z cpu cores : 24 2025-05-07T19:42:58.2406124Z apicid : 106 2025-05-07T19:42:58.2406332Z initial apicid : 106 2025-05-07T19:42:58.2406406Z fpu : yes 2025-05-07T19:42:58.2406486Z fpu_exception : yes 2025-05-07T19:42:58.2406580Z cpuid level : 13 2025-05-07T19:42:58.2406657Z wp : yes 2025-05-07T19:42:58.2408643Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2409061Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2409143Z bogomips : 5999.98 2025-05-07T19:42:58.2409220Z clflush size : 64 2025-05-07T19:42:58.2409316Z cache_alignment : 64 2025-05-07T19:42:58.2409436Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2409515Z power management: 2025-05-07T19:42:58.2409519Z 2025-05-07T19:42:58.2409595Z processor : 46 2025-05-07T19:42:58.2409690Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2409765Z cpu family : 6 2025-05-07T19:42:58.2409836Z model : 85 2025-05-07T19:42:58.2410048Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2410125Z stepping : 7 2025-05-07T19:42:58.2410204Z microcode : 0x5003901 2025-05-07T19:42:58.2410279Z cpu MHz : 2999.994 2025-05-07T19:42:58.2410368Z cache size : 36608 KB 2025-05-07T19:42:58.2410448Z physical id : 1 2025-05-07T19:42:58.2410523Z siblings : 48 2025-05-07T19:42:58.2410614Z core id : 22 2025-05-07T19:42:58.2410688Z cpu cores : 24 2025-05-07T19:42:58.2410762Z apicid : 108 2025-05-07T19:42:58.2410843Z initial apicid : 108 2025-05-07T19:42:58.2410930Z fpu : yes 2025-05-07T19:42:58.2411011Z fpu_exception : yes 2025-05-07T19:42:58.2411092Z cpuid level : 13 2025-05-07T19:42:58.2411166Z wp : yes 2025-05-07T19:42:58.2413171Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2413533Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2413626Z bogomips : 5999.98 2025-05-07T19:42:58.2413708Z clflush size : 64 2025-05-07T19:42:58.2413794Z cache_alignment : 64 2025-05-07T19:42:58.2413929Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2414009Z power management: 2025-05-07T19:42:58.2414013Z 2025-05-07T19:42:58.2414090Z processor : 47 2025-05-07T19:42:58.2414178Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2414270Z cpu family : 6 2025-05-07T19:42:58.2414347Z model : 85 2025-05-07T19:42:58.2414496Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2414589Z stepping : 7 2025-05-07T19:42:58.2414669Z microcode : 0x5003901 2025-05-07T19:42:58.2414744Z cpu MHz : 2999.994 2025-05-07T19:42:58.2414824Z cache size : 36608 KB 2025-05-07T19:42:58.2414919Z physical id : 1 2025-05-07T19:42:58.2414999Z siblings : 48 2025-05-07T19:42:58.2415074Z core id : 23 2025-05-07T19:42:58.2415166Z cpu cores : 24 2025-05-07T19:42:58.2415243Z apicid : 110 2025-05-07T19:42:58.2415325Z initial apicid : 110 2025-05-07T19:42:58.2415401Z fpu : yes 2025-05-07T19:42:58.2415497Z fpu_exception : yes 2025-05-07T19:42:58.2415579Z cpuid level : 13 2025-05-07T19:42:58.2415653Z wp : yes 2025-05-07T19:42:58.2417655Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2418057Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2418136Z bogomips : 5999.98 2025-05-07T19:42:58.2418228Z clflush size : 64 2025-05-07T19:42:58.2418309Z cache_alignment : 64 2025-05-07T19:42:58.2418427Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2418508Z power management: 2025-05-07T19:42:58.2418526Z 2025-05-07T19:42:58.2418605Z processor : 48 2025-05-07T19:42:58.2418691Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2418773Z cpu family : 6 2025-05-07T19:42:58.2418864Z model : 85 2025-05-07T19:42:58.2419011Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2419088Z stepping : 7 2025-05-07T19:42:58.2419226Z microcode : 0x5003901 2025-05-07T19:42:58.2419305Z cpu MHz : 3150.575 2025-05-07T19:42:58.2419390Z cache size : 36608 KB 2025-05-07T19:42:58.2419470Z physical id : 0 2025-05-07T19:42:58.2419562Z siblings : 48 2025-05-07T19:42:58.2419640Z core id : 0 2025-05-07T19:42:58.2419714Z cpu cores : 24 2025-05-07T19:42:58.2419786Z apicid : 1 2025-05-07T19:42:58.2419881Z initial apicid : 1 2025-05-07T19:42:58.2419954Z fpu : yes 2025-05-07T19:42:58.2420032Z fpu_exception : yes 2025-05-07T19:42:58.2420120Z cpuid level : 13 2025-05-07T19:42:58.2420192Z wp : yes 2025-05-07T19:42:58.2422176Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2422546Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2422627Z bogomips : 5999.98 2025-05-07T19:42:58.2422705Z clflush size : 64 2025-05-07T19:42:58.2422797Z cache_alignment : 64 2025-05-07T19:42:58.2422916Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2422994Z power management: 2025-05-07T19:42:58.2422998Z 2025-05-07T19:42:58.2423090Z processor : 49 2025-05-07T19:42:58.2423173Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2423247Z cpu family : 6 2025-05-07T19:42:58.2423319Z model : 85 2025-05-07T19:42:58.2438175Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2438316Z stepping : 7 2025-05-07T19:42:58.2438406Z microcode : 0x5003901 2025-05-07T19:42:58.2438492Z cpu MHz : 3245.338 2025-05-07T19:42:58.2438589Z cache size : 36608 KB 2025-05-07T19:42:58.2438682Z physical id : 0 2025-05-07T19:42:58.2438759Z siblings : 48 2025-05-07T19:42:58.2438837Z core id : 1 2025-05-07T19:42:58.2438923Z cpu cores : 24 2025-05-07T19:42:58.2439007Z apicid : 3 2025-05-07T19:42:58.2439088Z initial apicid : 3 2025-05-07T19:42:58.2439164Z fpu : yes 2025-05-07T19:42:58.2439257Z fpu_exception : yes 2025-05-07T19:42:58.2439337Z cpuid level : 13 2025-05-07T19:42:58.2439411Z wp : yes 2025-05-07T19:42:58.2441590Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2441979Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2442207Z bogomips : 5999.98 2025-05-07T19:42:58.2442298Z clflush size : 64 2025-05-07T19:42:58.2442380Z cache_alignment : 64 2025-05-07T19:42:58.2442510Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2442607Z power management: 2025-05-07T19:42:58.2442614Z 2025-05-07T19:42:58.2442695Z processor : 50 2025-05-07T19:42:58.2442784Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2442861Z cpu family : 6 2025-05-07T19:42:58.2442947Z model : 85 2025-05-07T19:42:58.2443107Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2443190Z stepping : 7 2025-05-07T19:42:58.2443284Z microcode : 0x5003901 2025-05-07T19:42:58.2443361Z cpu MHz : 3252.514 2025-05-07T19:42:58.2443440Z cache size : 36608 KB 2025-05-07T19:42:58.2443591Z physical id : 0 2025-05-07T19:42:58.2443683Z siblings : 48 2025-05-07T19:42:58.2443761Z core id : 2 2025-05-07T19:42:58.2443837Z cpu cores : 24 2025-05-07T19:42:58.2443913Z apicid : 5 2025-05-07T19:42:58.2444009Z initial apicid : 5 2025-05-07T19:42:58.2444085Z fpu : yes 2025-05-07T19:42:58.2444167Z fpu_exception : yes 2025-05-07T19:42:58.2444254Z cpuid level : 13 2025-05-07T19:42:58.2444333Z wp : yes 2025-05-07T19:42:58.2446647Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2447010Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2447091Z bogomips : 5999.98 2025-05-07T19:42:58.2447166Z clflush size : 64 2025-05-07T19:42:58.2447252Z cache_alignment : 64 2025-05-07T19:42:58.2447376Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2447456Z power management: 2025-05-07T19:42:58.2447460Z 2025-05-07T19:42:58.2447543Z processor : 51 2025-05-07T19:42:58.2447621Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2447693Z cpu family : 6 2025-05-07T19:42:58.2447760Z model : 85 2025-05-07T19:42:58.2447917Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2447990Z stepping : 7 2025-05-07T19:42:58.2448068Z microcode : 0x5003901 2025-05-07T19:42:58.2448145Z cpu MHz : 3162.373 2025-05-07T19:42:58.2448217Z cache size : 36608 KB 2025-05-07T19:42:58.2448288Z physical id : 0 2025-05-07T19:42:58.2448361Z siblings : 48 2025-05-07T19:42:58.2448442Z core id : 3 2025-05-07T19:42:58.2448512Z cpu cores : 24 2025-05-07T19:42:58.2448585Z apicid : 7 2025-05-07T19:42:58.2448658Z initial apicid : 7 2025-05-07T19:42:58.2448739Z fpu : yes 2025-05-07T19:42:58.2448819Z fpu_exception : yes 2025-05-07T19:42:58.2448893Z cpuid level : 13 2025-05-07T19:42:58.2448969Z wp : yes 2025-05-07T19:42:58.2450951Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2451306Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2451387Z bogomips : 5999.98 2025-05-07T19:42:58.2451524Z clflush size : 64 2025-05-07T19:42:58.2451601Z cache_alignment : 64 2025-05-07T19:42:58.2451722Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2451794Z power management: 2025-05-07T19:42:58.2451798Z 2025-05-07T19:42:58.2451868Z processor : 52 2025-05-07T19:42:58.2451948Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2452033Z cpu family : 6 2025-05-07T19:42:58.2452102Z model : 85 2025-05-07T19:42:58.2452246Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2452333Z stepping : 7 2025-05-07T19:42:58.2452414Z microcode : 0x5003901 2025-05-07T19:42:58.2452490Z cpu MHz : 3354.198 2025-05-07T19:42:58.2452567Z cache size : 36608 KB 2025-05-07T19:42:58.2452655Z physical id : 0 2025-05-07T19:42:58.2452731Z siblings : 48 2025-05-07T19:42:58.2452850Z core id : 4 2025-05-07T19:42:58.2452934Z cpu cores : 24 2025-05-07T19:42:58.2453009Z apicid : 9 2025-05-07T19:42:58.2453086Z initial apicid : 9 2025-05-07T19:42:58.2453159Z fpu : yes 2025-05-07T19:42:58.2453252Z fpu_exception : yes 2025-05-07T19:42:58.2453329Z cpuid level : 13 2025-05-07T19:42:58.2453402Z wp : yes 2025-05-07T19:42:58.2455392Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2455753Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2455835Z bogomips : 5999.98 2025-05-07T19:42:58.2455923Z clflush size : 64 2025-05-07T19:42:58.2456007Z cache_alignment : 64 2025-05-07T19:42:58.2456131Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2456219Z power management: 2025-05-07T19:42:58.2456224Z 2025-05-07T19:42:58.2456302Z processor : 53 2025-05-07T19:42:58.2456388Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2456464Z cpu family : 6 2025-05-07T19:42:58.2456544Z model : 85 2025-05-07T19:42:58.2456692Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2456770Z stepping : 7 2025-05-07T19:42:58.2456859Z microcode : 0x5003901 2025-05-07T19:42:58.2456934Z cpu MHz : 3262.241 2025-05-07T19:42:58.2457015Z cache size : 36608 KB 2025-05-07T19:42:58.2457095Z physical id : 0 2025-05-07T19:42:58.2457179Z siblings : 48 2025-05-07T19:42:58.2457252Z core id : 5 2025-05-07T19:42:58.2457331Z cpu cores : 24 2025-05-07T19:42:58.2457412Z apicid : 11 2025-05-07T19:42:58.2457491Z initial apicid : 11 2025-05-07T19:42:58.2457564Z fpu : yes 2025-05-07T19:42:58.2457643Z fpu_exception : yes 2025-05-07T19:42:58.2457726Z cpuid level : 13 2025-05-07T19:42:58.2457802Z wp : yes 2025-05-07T19:42:58.2459789Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2460155Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2460230Z bogomips : 5999.98 2025-05-07T19:42:58.2460303Z clflush size : 64 2025-05-07T19:42:58.2460389Z cache_alignment : 64 2025-05-07T19:42:58.2460508Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2460632Z power management: 2025-05-07T19:42:58.2460636Z 2025-05-07T19:42:58.2460722Z processor : 54 2025-05-07T19:42:58.2460800Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2460871Z cpu family : 6 2025-05-07T19:42:58.2460942Z model : 85 2025-05-07T19:42:58.2461094Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2461166Z stepping : 7 2025-05-07T19:42:58.2461247Z microcode : 0x5003901 2025-05-07T19:42:58.2461330Z cpu MHz : 3238.297 2025-05-07T19:42:58.2461404Z cache size : 36608 KB 2025-05-07T19:42:58.2461477Z physical id : 0 2025-05-07T19:42:58.2461548Z siblings : 48 2025-05-07T19:42:58.2461627Z core id : 6 2025-05-07T19:42:58.2461701Z cpu cores : 24 2025-05-07T19:42:58.2461770Z apicid : 13 2025-05-07T19:42:58.2461898Z initial apicid : 13 2025-05-07T19:42:58.2461969Z fpu : yes 2025-05-07T19:42:58.2462044Z fpu_exception : yes 2025-05-07T19:42:58.2462115Z cpuid level : 13 2025-05-07T19:42:58.2462194Z wp : yes 2025-05-07T19:42:58.2464175Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2464535Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2464613Z bogomips : 5999.98 2025-05-07T19:42:58.2464686Z clflush size : 64 2025-05-07T19:42:58.2464765Z cache_alignment : 64 2025-05-07T19:42:58.2464886Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2464967Z power management: 2025-05-07T19:42:58.2464972Z 2025-05-07T19:42:58.2465046Z processor : 55 2025-05-07T19:42:58.2465130Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2465199Z cpu family : 6 2025-05-07T19:42:58.2465270Z model : 85 2025-05-07T19:42:58.2465412Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2465492Z stepping : 7 2025-05-07T19:42:58.2465568Z microcode : 0x5003901 2025-05-07T19:42:58.2465639Z cpu MHz : 3265.141 2025-05-07T19:42:58.2465718Z cache size : 36608 KB 2025-05-07T19:42:58.2465794Z physical id : 0 2025-05-07T19:42:58.2465865Z siblings : 48 2025-05-07T19:42:58.2465940Z core id : 7 2025-05-07T19:42:58.2466023Z cpu cores : 24 2025-05-07T19:42:58.2466093Z apicid : 15 2025-05-07T19:42:58.2466166Z initial apicid : 15 2025-05-07T19:42:58.2466245Z fpu : yes 2025-05-07T19:42:58.2466323Z fpu_exception : yes 2025-05-07T19:42:58.2466401Z cpuid level : 13 2025-05-07T19:42:58.2466466Z wp : yes 2025-05-07T19:42:58.2468454Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2468813Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2468892Z bogomips : 5999.98 2025-05-07T19:42:58.2468967Z clflush size : 64 2025-05-07T19:42:58.2469050Z cache_alignment : 64 2025-05-07T19:42:58.2469164Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2469247Z power management: 2025-05-07T19:42:58.2469294Z 2025-05-07T19:42:58.2469369Z processor : 56 2025-05-07T19:42:58.2469446Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2469522Z cpu family : 6 2025-05-07T19:42:58.2469591Z model : 85 2025-05-07T19:42:58.2469732Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2469804Z stepping : 7 2025-05-07T19:42:58.2469883Z microcode : 0x5003901 2025-05-07T19:42:58.2469952Z cpu MHz : 3285.537 2025-05-07T19:42:58.2470027Z cache size : 36608 KB 2025-05-07T19:42:58.2470108Z physical id : 0 2025-05-07T19:42:58.2470177Z siblings : 48 2025-05-07T19:42:58.2470246Z core id : 8 2025-05-07T19:42:58.2470317Z cpu cores : 24 2025-05-07T19:42:58.2470396Z apicid : 17 2025-05-07T19:42:58.2470469Z initial apicid : 17 2025-05-07T19:42:58.2470539Z fpu : yes 2025-05-07T19:42:58.2471110Z fpu_exception : yes 2025-05-07T19:42:58.2471190Z cpuid level : 13 2025-05-07T19:42:58.2471260Z wp : yes 2025-05-07T19:42:58.2473240Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2473604Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2473758Z bogomips : 5999.98 2025-05-07T19:42:58.2473836Z clflush size : 64 2025-05-07T19:42:58.2473924Z cache_alignment : 64 2025-05-07T19:42:58.2474040Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2474281Z power management: 2025-05-07T19:42:58.2474286Z 2025-05-07T19:42:58.2474374Z processor : 57 2025-05-07T19:42:58.2474463Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2474542Z cpu family : 6 2025-05-07T19:42:58.2474624Z model : 85 2025-05-07T19:42:58.2474781Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2474857Z stepping : 7 2025-05-07T19:42:58.2474936Z microcode : 0x5003901 2025-05-07T19:42:58.2475051Z cpu MHz : 2999.994 2025-05-07T19:42:58.2475135Z cache size : 36608 KB 2025-05-07T19:42:58.2475213Z physical id : 0 2025-05-07T19:42:58.2475289Z siblings : 48 2025-05-07T19:42:58.2475372Z core id : 9 2025-05-07T19:42:58.2475446Z cpu cores : 24 2025-05-07T19:42:58.2475521Z apicid : 19 2025-05-07T19:42:58.2475614Z initial apicid : 19 2025-05-07T19:42:58.2475686Z fpu : yes 2025-05-07T19:42:58.2475766Z fpu_exception : yes 2025-05-07T19:42:58.2475849Z cpuid level : 13 2025-05-07T19:42:58.2475931Z wp : yes 2025-05-07T19:42:58.2478076Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2478468Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2478548Z bogomips : 5999.98 2025-05-07T19:42:58.2478627Z clflush size : 64 2025-05-07T19:42:58.2478709Z cache_alignment : 64 2025-05-07T19:42:58.2478847Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2478927Z power management: 2025-05-07T19:42:58.2478932Z 2025-05-07T19:42:58.2479010Z processor : 58 2025-05-07T19:42:58.2479103Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2479238Z cpu family : 6 2025-05-07T19:42:58.2479311Z model : 85 2025-05-07T19:42:58.2479466Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2479551Z stepping : 7 2025-05-07T19:42:58.2479633Z microcode : 0x5003901 2025-05-07T19:42:58.2479710Z cpu MHz : 2999.994 2025-05-07T19:42:58.2479796Z cache size : 36608 KB 2025-05-07T19:42:58.2479875Z physical id : 0 2025-05-07T19:42:58.2479954Z siblings : 48 2025-05-07T19:42:58.2480028Z core id : 10 2025-05-07T19:42:58.2480116Z cpu cores : 24 2025-05-07T19:42:58.2480196Z apicid : 21 2025-05-07T19:42:58.2480276Z initial apicid : 21 2025-05-07T19:42:58.2480359Z fpu : yes 2025-05-07T19:42:58.2480440Z fpu_exception : yes 2025-05-07T19:42:58.2480518Z cpuid level : 13 2025-05-07T19:42:58.2480651Z wp : yes 2025-05-07T19:42:58.2482807Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2483192Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2483281Z bogomips : 5999.98 2025-05-07T19:42:58.2483359Z clflush size : 64 2025-05-07T19:42:58.2483441Z cache_alignment : 64 2025-05-07T19:42:58.2483565Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2483663Z power management: 2025-05-07T19:42:58.2483667Z 2025-05-07T19:42:58.2483746Z processor : 59 2025-05-07T19:42:58.2483834Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2483923Z cpu family : 6 2025-05-07T19:42:58.2484000Z model : 85 2025-05-07T19:42:58.2484156Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2484237Z stepping : 7 2025-05-07T19:42:58.2484329Z microcode : 0x5003901 2025-05-07T19:42:58.2484405Z cpu MHz : 2999.994 2025-05-07T19:42:58.2484485Z cache size : 36608 KB 2025-05-07T19:42:58.2484575Z physical id : 0 2025-05-07T19:42:58.2484650Z siblings : 48 2025-05-07T19:42:58.2484729Z core id : 11 2025-05-07T19:42:58.2484806Z cpu cores : 24 2025-05-07T19:42:58.2484891Z apicid : 23 2025-05-07T19:42:58.2484970Z initial apicid : 23 2025-05-07T19:42:58.2485042Z fpu : yes 2025-05-07T19:42:58.2485134Z fpu_exception : yes 2025-05-07T19:42:58.2485215Z cpuid level : 13 2025-05-07T19:42:58.2485289Z wp : yes 2025-05-07T19:42:58.2487462Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2487817Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2487893Z bogomips : 5999.98 2025-05-07T19:42:58.2487974Z clflush size : 64 2025-05-07T19:42:58.2488053Z cache_alignment : 64 2025-05-07T19:42:58.2488170Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2488244Z power management: 2025-05-07T19:42:58.2488252Z 2025-05-07T19:42:58.2488336Z processor : 60 2025-05-07T19:42:58.2488415Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2488484Z cpu family : 6 2025-05-07T19:42:58.2488558Z model : 85 2025-05-07T19:42:58.2488755Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2488829Z stepping : 7 2025-05-07T19:42:58.2488903Z microcode : 0x5003901 2025-05-07T19:42:58.2488980Z cpu MHz : 3225.318 2025-05-07T19:42:58.2489054Z cache size : 36608 KB 2025-05-07T19:42:58.2489126Z physical id : 0 2025-05-07T19:42:58.2489206Z siblings : 48 2025-05-07T19:42:58.2489273Z core id : 12 2025-05-07T19:42:58.2489344Z cpu cores : 24 2025-05-07T19:42:58.2489415Z apicid : 25 2025-05-07T19:42:58.2489495Z initial apicid : 25 2025-05-07T19:42:58.2489562Z fpu : yes 2025-05-07T19:42:58.2489638Z fpu_exception : yes 2025-05-07T19:42:58.2489722Z cpuid level : 13 2025-05-07T19:42:58.2489790Z wp : yes 2025-05-07T19:42:58.2491814Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2492177Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2492251Z bogomips : 5999.98 2025-05-07T19:42:58.2492325Z clflush size : 64 2025-05-07T19:42:58.2492410Z cache_alignment : 64 2025-05-07T19:42:58.2492525Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2492606Z power management: 2025-05-07T19:42:58.2492610Z 2025-05-07T19:42:58.2492686Z processor : 61 2025-05-07T19:42:58.2492772Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2492846Z cpu family : 6 2025-05-07T19:42:58.2492916Z model : 85 2025-05-07T19:42:58.2493068Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2493142Z stepping : 7 2025-05-07T19:42:58.2493216Z microcode : 0x5003901 2025-05-07T19:42:58.2493288Z cpu MHz : 3236.009 2025-05-07T19:42:58.2493369Z cache size : 36608 KB 2025-05-07T19:42:58.2493440Z physical id : 0 2025-05-07T19:42:58.2493511Z siblings : 48 2025-05-07T19:42:58.2493583Z core id : 13 2025-05-07T19:42:58.2493660Z cpu cores : 24 2025-05-07T19:42:58.2493730Z apicid : 27 2025-05-07T19:42:58.2493808Z initial apicid : 27 2025-05-07T19:42:58.2493884Z fpu : yes 2025-05-07T19:42:58.2493957Z fpu_exception : yes 2025-05-07T19:42:58.2494026Z cpuid level : 13 2025-05-07T19:42:58.2494094Z wp : yes 2025-05-07T19:42:58.2496085Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2496441Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2496528Z bogomips : 5999.98 2025-05-07T19:42:58.2496600Z clflush size : 64 2025-05-07T19:42:58.2496675Z cache_alignment : 64 2025-05-07T19:42:58.2496791Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2496870Z power management: 2025-05-07T19:42:58.2496875Z 2025-05-07T19:42:58.2496948Z processor : 62 2025-05-07T19:42:58.2497037Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2497113Z cpu family : 6 2025-05-07T19:42:58.2497180Z model : 85 2025-05-07T19:42:58.2497321Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2497445Z stepping : 7 2025-05-07T19:42:58.2497525Z microcode : 0x5003901 2025-05-07T19:42:58.2497596Z cpu MHz : 2999.994 2025-05-07T19:42:58.2497668Z cache size : 36608 KB 2025-05-07T19:42:58.2497748Z physical id : 0 2025-05-07T19:42:58.2497818Z siblings : 48 2025-05-07T19:42:58.2497888Z core id : 14 2025-05-07T19:42:58.2497960Z cpu cores : 24 2025-05-07T19:42:58.2498038Z apicid : 29 2025-05-07T19:42:58.2498112Z initial apicid : 29 2025-05-07T19:42:58.2498184Z fpu : yes 2025-05-07T19:42:58.2498263Z fpu_exception : yes 2025-05-07T19:42:58.2498334Z cpuid level : 13 2025-05-07T19:42:58.2498403Z wp : yes 2025-05-07T19:42:58.2500449Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2500806Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2500882Z bogomips : 5999.98 2025-05-07T19:42:58.2500964Z clflush size : 64 2025-05-07T19:42:58.2501036Z cache_alignment : 64 2025-05-07T19:42:58.2501150Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2501227Z power management: 2025-05-07T19:42:58.2501230Z 2025-05-07T19:42:58.2501309Z processor : 63 2025-05-07T19:42:58.2501389Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2501461Z cpu family : 6 2025-05-07T19:42:58.2501537Z model : 85 2025-05-07T19:42:58.2501677Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2501750Z stepping : 7 2025-05-07T19:42:58.2501825Z microcode : 0x5003901 2025-05-07T19:42:58.2501906Z cpu MHz : 2999.994 2025-05-07T19:42:58.2501979Z cache size : 36608 KB 2025-05-07T19:42:58.2502051Z physical id : 0 2025-05-07T19:42:58.2502130Z siblings : 48 2025-05-07T19:42:58.2502196Z core id : 15 2025-05-07T19:42:58.2502269Z cpu cores : 24 2025-05-07T19:42:58.2502338Z apicid : 31 2025-05-07T19:42:58.2502417Z initial apicid : 31 2025-05-07T19:42:58.2502486Z fpu : yes 2025-05-07T19:42:58.2502561Z fpu_exception : yes 2025-05-07T19:42:58.2502640Z cpuid level : 13 2025-05-07T19:42:58.2502707Z wp : yes 2025-05-07T19:42:58.2504693Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2505049Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2505121Z bogomips : 5999.98 2025-05-07T19:42:58.2505189Z clflush size : 64 2025-05-07T19:42:58.2505268Z cache_alignment : 64 2025-05-07T19:42:58.2505381Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2505452Z power management: 2025-05-07T19:42:58.2505455Z 2025-05-07T19:42:58.2505524Z processor : 64 2025-05-07T19:42:58.2505606Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2505672Z cpu family : 6 2025-05-07T19:42:58.2505737Z model : 85 2025-05-07T19:42:58.2505887Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2505957Z stepping : 7 2025-05-07T19:42:58.2506028Z microcode : 0x5003901 2025-05-07T19:42:58.2506098Z cpu MHz : 2999.994 2025-05-07T19:42:58.2506222Z cache size : 36608 KB 2025-05-07T19:42:58.2506294Z physical id : 0 2025-05-07T19:42:58.2506366Z siblings : 48 2025-05-07T19:42:58.2506445Z core id : 16 2025-05-07T19:42:58.2506514Z cpu cores : 24 2025-05-07T19:42:58.2506582Z apicid : 33 2025-05-07T19:42:58.2506657Z initial apicid : 33 2025-05-07T19:42:58.2506733Z fpu : yes 2025-05-07T19:42:58.2506811Z fpu_exception : yes 2025-05-07T19:42:58.2506886Z cpuid level : 13 2025-05-07T19:42:58.2506959Z wp : yes 2025-05-07T19:42:58.2508979Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2509329Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2509409Z bogomips : 5999.98 2025-05-07T19:42:58.2509483Z clflush size : 64 2025-05-07T19:42:58.2509557Z cache_alignment : 64 2025-05-07T19:42:58.2509679Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2509750Z power management: 2025-05-07T19:42:58.2509754Z 2025-05-07T19:42:58.2509823Z processor : 65 2025-05-07T19:42:58.2509902Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2509977Z cpu family : 6 2025-05-07T19:42:58.2510050Z model : 85 2025-05-07T19:42:58.2510195Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2510272Z stepping : 7 2025-05-07T19:42:58.2510347Z microcode : 0x5003901 2025-05-07T19:42:58.2510417Z cpu MHz : 2999.994 2025-05-07T19:42:58.2510492Z cache size : 36608 KB 2025-05-07T19:42:58.2510580Z physical id : 0 2025-05-07T19:42:58.2510648Z siblings : 48 2025-05-07T19:42:58.2510723Z core id : 17 2025-05-07T19:42:58.2510800Z cpu cores : 24 2025-05-07T19:42:58.2510868Z apicid : 35 2025-05-07T19:42:58.2510946Z initial apicid : 35 2025-05-07T19:42:58.2511020Z fpu : yes 2025-05-07T19:42:58.2511102Z fpu_exception : yes 2025-05-07T19:42:58.2511171Z cpuid level : 13 2025-05-07T19:42:58.2511238Z wp : yes 2025-05-07T19:42:58.2513231Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2513583Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2513729Z bogomips : 5999.98 2025-05-07T19:42:58.2513816Z clflush size : 64 2025-05-07T19:42:58.2513892Z cache_alignment : 64 2025-05-07T19:42:58.2514010Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2514259Z power management: 2025-05-07T19:42:58.2514263Z 2025-05-07T19:42:58.2514341Z processor : 66 2025-05-07T19:42:58.2514427Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2514504Z cpu family : 6 2025-05-07T19:42:58.2514584Z model : 85 2025-05-07T19:42:58.2514735Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2514812Z stepping : 7 2025-05-07T19:42:58.2514901Z microcode : 0x5003901 2025-05-07T19:42:58.2514980Z cpu MHz : 2999.994 2025-05-07T19:42:58.2515060Z cache size : 36608 KB 2025-05-07T19:42:58.2515141Z physical id : 0 2025-05-07T19:42:58.2515274Z siblings : 48 2025-05-07T19:42:58.2515350Z core id : 18 2025-05-07T19:42:58.2515428Z cpu cores : 24 2025-05-07T19:42:58.2515516Z apicid : 37 2025-05-07T19:42:58.2515600Z initial apicid : 37 2025-05-07T19:42:58.2515671Z fpu : yes 2025-05-07T19:42:58.2515755Z fpu_exception : yes 2025-05-07T19:42:58.2515840Z cpuid level : 13 2025-05-07T19:42:58.2515913Z wp : yes 2025-05-07T19:42:58.2518119Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2518511Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2518593Z bogomips : 5999.98 2025-05-07T19:42:58.2518672Z clflush size : 64 2025-05-07T19:42:58.2518764Z cache_alignment : 64 2025-05-07T19:42:58.2518889Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2518968Z power management: 2025-05-07T19:42:58.2518972Z 2025-05-07T19:42:58.2519063Z processor : 67 2025-05-07T19:42:58.2519145Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2519218Z cpu family : 6 2025-05-07T19:42:58.2519288Z model : 85 2025-05-07T19:42:58.2519449Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2519525Z stepping : 7 2025-05-07T19:42:58.2519613Z microcode : 0x5003901 2025-05-07T19:42:58.2519697Z cpu MHz : 2999.994 2025-05-07T19:42:58.2519774Z cache size : 36608 KB 2025-05-07T19:42:58.2519854Z physical id : 0 2025-05-07T19:42:58.2519928Z siblings : 48 2025-05-07T19:42:58.2520012Z core id : 19 2025-05-07T19:42:58.2520090Z cpu cores : 24 2025-05-07T19:42:58.2520166Z apicid : 39 2025-05-07T19:42:58.2520248Z initial apicid : 39 2025-05-07T19:42:58.2520325Z fpu : yes 2025-05-07T19:42:58.2520407Z fpu_exception : yes 2025-05-07T19:42:58.2520486Z cpuid level : 13 2025-05-07T19:42:58.2520567Z wp : yes 2025-05-07T19:42:58.2522715Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2523096Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2523185Z bogomips : 5999.98 2025-05-07T19:42:58.2523261Z clflush size : 64 2025-05-07T19:42:58.2523341Z cache_alignment : 64 2025-05-07T19:42:58.2523472Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2523550Z power management: 2025-05-07T19:42:58.2523554Z 2025-05-07T19:42:58.2523631Z processor : 68 2025-05-07T19:42:58.2523723Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2523797Z cpu family : 6 2025-05-07T19:42:58.2523868Z model : 85 2025-05-07T19:42:58.2524023Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2524108Z stepping : 7 2025-05-07T19:42:58.2524187Z microcode : 0x5003901 2025-05-07T19:42:58.2524265Z cpu MHz : 3382.567 2025-05-07T19:42:58.2524345Z cache size : 36608 KB 2025-05-07T19:42:58.2524429Z physical id : 0 2025-05-07T19:42:58.2524502Z siblings : 48 2025-05-07T19:42:58.2524578Z core id : 20 2025-05-07T19:42:58.2524706Z cpu cores : 24 2025-05-07T19:42:58.2524779Z apicid : 41 2025-05-07T19:42:58.2524859Z initial apicid : 41 2025-05-07T19:42:58.2524934Z fpu : yes 2025-05-07T19:42:58.2525018Z fpu_exception : yes 2025-05-07T19:42:58.2525096Z cpuid level : 13 2025-05-07T19:42:58.2525166Z wp : yes 2025-05-07T19:42:58.2527392Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2527741Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2527820Z bogomips : 5999.98 2025-05-07T19:42:58.2527898Z clflush size : 64 2025-05-07T19:42:58.2527977Z cache_alignment : 64 2025-05-07T19:42:58.2528092Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2528175Z power management: 2025-05-07T19:42:58.2528179Z 2025-05-07T19:42:58.2528246Z processor : 69 2025-05-07T19:42:58.2528490Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2528562Z cpu family : 6 2025-05-07T19:42:58.2528804Z model : 85 2025-05-07T19:42:58.2528958Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2529034Z stepping : 7 2025-05-07T19:42:58.2529122Z microcode : 0x5003901 2025-05-07T19:42:58.2529198Z cpu MHz : 3251.217 2025-05-07T19:42:58.2529379Z cache size : 36608 KB 2025-05-07T19:42:58.2529460Z physical id : 0 2025-05-07T19:42:58.2529540Z siblings : 48 2025-05-07T19:42:58.2529612Z core id : 21 2025-05-07T19:42:58.2529687Z cpu cores : 24 2025-05-07T19:42:58.2529771Z apicid : 43 2025-05-07T19:42:58.2529906Z initial apicid : 43 2025-05-07T19:42:58.2529980Z fpu : yes 2025-05-07T19:42:58.2530065Z fpu_exception : yes 2025-05-07T19:42:58.2530149Z cpuid level : 13 2025-05-07T19:42:58.2530222Z wp : yes 2025-05-07T19:42:58.2532380Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2532763Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2532848Z bogomips : 5999.98 2025-05-07T19:42:58.2532932Z clflush size : 64 2025-05-07T19:42:58.2533019Z cache_alignment : 64 2025-05-07T19:42:58.2533140Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2533220Z power management: 2025-05-07T19:42:58.2533224Z 2025-05-07T19:42:58.2533307Z processor : 70 2025-05-07T19:42:58.2533390Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2533472Z cpu family : 6 2025-05-07T19:42:58.2533543Z model : 85 2025-05-07T19:42:58.2533702Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2533777Z stepping : 7 2025-05-07T19:42:58.2533859Z microcode : 0x5003901 2025-05-07T19:42:58.2533938Z cpu MHz : 3171.247 2025-05-07T19:42:58.2534018Z cache size : 36608 KB 2025-05-07T19:42:58.2534100Z physical id : 0 2025-05-07T19:42:58.2534177Z siblings : 48 2025-05-07T19:42:58.2534253Z core id : 22 2025-05-07T19:42:58.2534325Z cpu cores : 24 2025-05-07T19:42:58.2534396Z apicid : 45 2025-05-07T19:42:58.2534484Z initial apicid : 45 2025-05-07T19:42:58.2534672Z fpu : yes 2025-05-07T19:42:58.2534752Z fpu_exception : yes 2025-05-07T19:42:58.2534826Z cpuid level : 13 2025-05-07T19:42:58.2534906Z wp : yes 2025-05-07T19:42:58.2537111Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2537495Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2537573Z bogomips : 5999.98 2025-05-07T19:42:58.2537654Z clflush size : 64 2025-05-07T19:42:58.2537733Z cache_alignment : 64 2025-05-07T19:42:58.2537861Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2537938Z power management: 2025-05-07T19:42:58.2537943Z 2025-05-07T19:42:58.2538019Z processor : 71 2025-05-07T19:42:58.2538105Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2538179Z cpu family : 6 2025-05-07T19:42:58.2538250Z model : 85 2025-05-07T19:42:58.2538405Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2538500Z stepping : 7 2025-05-07T19:42:58.2538585Z microcode : 0x5003901 2025-05-07T19:42:58.2538671Z cpu MHz : 3544.715 2025-05-07T19:42:58.2538770Z cache size : 36608 KB 2025-05-07T19:42:58.2538853Z physical id : 0 2025-05-07T19:42:58.2538934Z siblings : 48 2025-05-07T19:42:58.2539016Z core id : 23 2025-05-07T19:42:58.2539110Z cpu cores : 24 2025-05-07T19:42:58.2539195Z apicid : 47 2025-05-07T19:42:58.2539279Z initial apicid : 47 2025-05-07T19:42:58.2539448Z fpu : yes 2025-05-07T19:42:58.2539540Z fpu_exception : yes 2025-05-07T19:42:58.2539622Z cpuid level : 13 2025-05-07T19:42:58.2539703Z wp : yes 2025-05-07T19:42:58.2541943Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2542302Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2542393Z bogomips : 5999.98 2025-05-07T19:42:58.2542470Z clflush size : 64 2025-05-07T19:42:58.2542554Z cache_alignment : 64 2025-05-07T19:42:58.2542674Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2542770Z power management: 2025-05-07T19:42:58.2542775Z 2025-05-07T19:42:58.2542851Z processor : 72 2025-05-07T19:42:58.2542936Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2543025Z cpu family : 6 2025-05-07T19:42:58.2543096Z model : 85 2025-05-07T19:42:58.2543243Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2543320Z stepping : 7 2025-05-07T19:42:58.2543617Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:42:58.2543701Z microcode : 0x5003901 2025-05-07T19:42:58.2543778Z cpu MHz : 2999.994 2025-05-07T19:42:58.2543873Z cache size : 36608 KB 2025-05-07T19:42:58.2543957Z physical id : 1 2025-05-07T19:42:58.2544038Z siblings : 48 2025-05-07T19:42:58.2544113Z core id : 0 2025-05-07T19:42:58.2544202Z cpu cores : 24 2025-05-07T19:42:58.2544277Z apicid : 65 2025-05-07T19:42:58.2544356Z initial apicid : 65 2025-05-07T19:42:58.2544494Z fpu : yes 2025-05-07T19:42:58.2544578Z fpu_exception : yes 2025-05-07T19:42:58.2544655Z cpuid level : 13 2025-05-07T19:42:58.2544728Z wp : yes 2025-05-07T19:42:58.2546728Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2547172Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2547271Z bogomips : 5999.98 2025-05-07T19:42:58.2547354Z clflush size : 64 2025-05-07T19:42:58.2547436Z cache_alignment : 64 2025-05-07T19:42:58.2547556Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2547654Z power management: 2025-05-07T19:42:58.2547658Z 2025-05-07T19:42:58.2547735Z processor : 73 2025-05-07T19:42:58.2547823Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2547913Z cpu family : 6 2025-05-07T19:42:58.2547987Z model : 85 2025-05-07T19:42:58.2548135Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2548212Z stepping : 7 2025-05-07T19:42:58.2548308Z microcode : 0x5003901 2025-05-07T19:42:58.2548386Z cpu MHz : 2999.994 2025-05-07T19:42:58.2548464Z cache size : 36608 KB 2025-05-07T19:42:58.2548557Z physical id : 1 2025-05-07T19:42:58.2548633Z siblings : 48 2025-05-07T19:42:58.2548711Z core id : 1 2025-05-07T19:42:58.2548788Z cpu cores : 24 2025-05-07T19:42:58.2548876Z apicid : 67 2025-05-07T19:42:58.2548959Z initial apicid : 67 2025-05-07T19:42:58.2549032Z fpu : yes 2025-05-07T19:42:58.2549128Z fpu_exception : yes 2025-05-07T19:42:58.2549205Z cpuid level : 13 2025-05-07T19:42:58.2549279Z wp : yes 2025-05-07T19:42:58.2551278Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2551641Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2551720Z bogomips : 5999.98 2025-05-07T19:42:58.2551811Z clflush size : 64 2025-05-07T19:42:58.2551895Z cache_alignment : 64 2025-05-07T19:42:58.2552015Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2552094Z power management: 2025-05-07T19:42:58.2552098Z 2025-05-07T19:42:58.2552194Z processor : 74 2025-05-07T19:42:58.2552455Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2552531Z cpu family : 6 2025-05-07T19:42:58.2552791Z model : 85 2025-05-07T19:42:58.2552950Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2553153Z stepping : 7 2025-05-07T19:42:58.2553240Z microcode : 0x5003901 2025-05-07T19:42:58.2553338Z cpu MHz : 2999.994 2025-05-07T19:42:58.2553422Z cache size : 36608 KB 2025-05-07T19:42:58.2553556Z physical id : 1 2025-05-07T19:42:58.2553692Z siblings : 48 2025-05-07T19:42:58.2553784Z core id : 2 2025-05-07T19:42:58.2553870Z cpu cores : 24 2025-05-07T19:42:58.2553949Z apicid : 69 2025-05-07T19:42:58.2554048Z initial apicid : 69 2025-05-07T19:42:58.2554126Z fpu : yes 2025-05-07T19:42:58.2554211Z fpu_exception : yes 2025-05-07T19:42:58.2554377Z cpuid level : 13 2025-05-07T19:42:58.2554457Z wp : yes 2025-05-07T19:42:58.2556605Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2557051Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2557137Z bogomips : 5999.98 2025-05-07T19:42:58.2557220Z clflush size : 64 2025-05-07T19:42:58.2557320Z cache_alignment : 64 2025-05-07T19:42:58.2557451Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2557539Z power management: 2025-05-07T19:42:58.2557544Z 2025-05-07T19:42:58.2557628Z processor : 75 2025-05-07T19:42:58.2557735Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2557817Z cpu family : 6 2025-05-07T19:42:58.2557894Z model : 85 2025-05-07T19:42:58.2558068Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2558155Z stepping : 7 2025-05-07T19:42:58.2558240Z microcode : 0x5003901 2025-05-07T19:42:58.2558324Z cpu MHz : 2999.994 2025-05-07T19:42:58.2558425Z cache size : 36608 KB 2025-05-07T19:42:58.2558508Z physical id : 1 2025-05-07T19:42:58.2558589Z siblings : 48 2025-05-07T19:42:58.2558688Z core id : 3 2025-05-07T19:42:58.2558769Z cpu cores : 24 2025-05-07T19:42:58.2558850Z apicid : 71 2025-05-07T19:42:58.2558938Z initial apicid : 71 2025-05-07T19:42:58.2559028Z fpu : yes 2025-05-07T19:42:58.2559113Z fpu_exception : yes 2025-05-07T19:42:58.2559196Z cpuid level : 13 2025-05-07T19:42:58.2559276Z wp : yes 2025-05-07T19:42:58.2561442Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2561826Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2561928Z bogomips : 5999.98 2025-05-07T19:42:58.2562012Z clflush size : 64 2025-05-07T19:42:58.2562101Z cache_alignment : 64 2025-05-07T19:42:58.2562254Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2562351Z power management: 2025-05-07T19:42:58.2562355Z 2025-05-07T19:42:58.2562440Z processor : 76 2025-05-07T19:42:58.2562523Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2562610Z cpu family : 6 2025-05-07T19:42:58.2562687Z model : 85 2025-05-07T19:42:58.2562837Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2562919Z stepping : 7 2025-05-07T19:42:58.2562999Z microcode : 0x5003901 2025-05-07T19:42:58.2563078Z cpu MHz : 2999.994 2025-05-07T19:42:58.2563155Z cache size : 36608 KB 2025-05-07T19:42:58.2563247Z physical id : 1 2025-05-07T19:42:58.2563320Z siblings : 48 2025-05-07T19:42:58.2563393Z core id : 4 2025-05-07T19:42:58.2563471Z cpu cores : 24 2025-05-07T19:42:58.2563544Z apicid : 73 2025-05-07T19:42:58.2563629Z initial apicid : 73 2025-05-07T19:42:58.2563708Z fpu : yes 2025-05-07T19:42:58.2563799Z fpu_exception : yes 2025-05-07T19:42:58.2563875Z cpuid level : 13 2025-05-07T19:42:58.2563945Z wp : yes 2025-05-07T19:42:58.2566097Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2566604Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2566677Z bogomips : 5999.98 2025-05-07T19:42:58.2566802Z clflush size : 64 2025-05-07T19:42:58.2566876Z cache_alignment : 64 2025-05-07T19:42:58.2566992Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2567064Z power management: 2025-05-07T19:42:58.2567079Z 2025-05-07T19:42:58.2567150Z processor : 77 2025-05-07T19:42:58.2567230Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2567304Z cpu family : 6 2025-05-07T19:42:58.2567380Z model : 85 2025-05-07T19:42:58.2567526Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2567597Z stepping : 7 2025-05-07T19:42:58.2567677Z microcode : 0x5003901 2025-05-07T19:42:58.2567748Z cpu MHz : 2999.994 2025-05-07T19:42:58.2567823Z cache size : 36608 KB 2025-05-07T19:42:58.2567893Z physical id : 1 2025-05-07T19:42:58.2567968Z siblings : 48 2025-05-07T19:42:58.2568034Z core id : 5 2025-05-07T19:42:58.2568106Z cpu cores : 24 2025-05-07T19:42:58.2568174Z apicid : 75 2025-05-07T19:42:58.2568254Z initial apicid : 75 2025-05-07T19:42:58.2568322Z fpu : yes 2025-05-07T19:42:58.2568400Z fpu_exception : yes 2025-05-07T19:42:58.2568478Z cpuid level : 13 2025-05-07T19:42:58.2568547Z wp : yes 2025-05-07T19:42:58.2570528Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2570894Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2570969Z bogomips : 5999.98 2025-05-07T19:42:58.2571040Z clflush size : 64 2025-05-07T19:42:58.2571126Z cache_alignment : 64 2025-05-07T19:42:58.2571237Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2571311Z power management: 2025-05-07T19:42:58.2571315Z 2025-05-07T19:42:58.2571398Z processor : 78 2025-05-07T19:42:58.2571480Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2571549Z cpu family : 6 2025-05-07T19:42:58.2571618Z model : 85 2025-05-07T19:42:58.2571769Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2571837Z stepping : 7 2025-05-07T19:42:58.2571912Z microcode : 0x5003901 2025-05-07T19:42:58.2571983Z cpu MHz : 2999.994 2025-05-07T19:42:58.2572064Z cache size : 36608 KB 2025-05-07T19:42:58.2572135Z physical id : 1 2025-05-07T19:42:58.2572202Z siblings : 48 2025-05-07T19:42:58.2572279Z core id : 6 2025-05-07T19:42:58.2572350Z cpu cores : 24 2025-05-07T19:42:58.2572421Z apicid : 77 2025-05-07T19:42:58.2572495Z initial apicid : 77 2025-05-07T19:42:58.2572568Z fpu : yes 2025-05-07T19:42:58.2572644Z fpu_exception : yes 2025-05-07T19:42:58.2572716Z cpuid level : 13 2025-05-07T19:42:58.2572796Z wp : yes 2025-05-07T19:42:58.2574778Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2575174Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2575253Z bogomips : 5999.98 2025-05-07T19:42:58.2575324Z clflush size : 64 2025-05-07T19:42:58.2575395Z cache_alignment : 64 2025-05-07T19:42:58.2575561Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2575637Z power management: 2025-05-07T19:42:58.2575641Z 2025-05-07T19:42:58.2575709Z processor : 79 2025-05-07T19:42:58.2575788Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2575867Z cpu family : 6 2025-05-07T19:42:58.2575932Z model : 85 2025-05-07T19:42:58.2576070Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2576146Z stepping : 7 2025-05-07T19:42:58.2576219Z microcode : 0x5003901 2025-05-07T19:42:58.2576285Z cpu MHz : 1503.345 2025-05-07T19:42:58.2576356Z cache size : 36608 KB 2025-05-07T19:42:58.2576432Z physical id : 1 2025-05-07T19:42:58.2576502Z siblings : 48 2025-05-07T19:42:58.2576570Z core id : 7 2025-05-07T19:42:58.2576647Z cpu cores : 24 2025-05-07T19:42:58.2576716Z apicid : 79 2025-05-07T19:42:58.2576788Z initial apicid : 79 2025-05-07T19:42:58.2576853Z fpu : yes 2025-05-07T19:42:58.2576931Z fpu_exception : yes 2025-05-07T19:42:58.2577002Z cpuid level : 13 2025-05-07T19:42:58.2577074Z wp : yes 2025-05-07T19:42:58.2579061Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2579415Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2579489Z bogomips : 5999.98 2025-05-07T19:42:58.2579565Z clflush size : 64 2025-05-07T19:42:58.2579641Z cache_alignment : 64 2025-05-07T19:42:58.2579755Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2579834Z power management: 2025-05-07T19:42:58.2579839Z 2025-05-07T19:42:58.2579908Z processor : 80 2025-05-07T19:42:58.2579988Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2580056Z cpu family : 6 2025-05-07T19:42:58.2580137Z model : 85 2025-05-07T19:42:58.2580281Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2580350Z stepping : 7 2025-05-07T19:42:58.2580434Z microcode : 0x5003901 2025-05-07T19:42:58.2580503Z cpu MHz : 2999.994 2025-05-07T19:42:58.2580575Z cache size : 36608 KB 2025-05-07T19:42:58.2580645Z physical id : 1 2025-05-07T19:42:58.2580726Z siblings : 48 2025-05-07T19:42:58.2580792Z core id : 8 2025-05-07T19:42:58.2580861Z cpu cores : 24 2025-05-07T19:42:58.2580936Z apicid : 81 2025-05-07T19:42:58.2581007Z initial apicid : 81 2025-05-07T19:42:58.2581073Z fpu : yes 2025-05-07T19:42:58.2581149Z fpu_exception : yes 2025-05-07T19:42:58.2581225Z cpuid level : 13 2025-05-07T19:42:58.2581291Z wp : yes 2025-05-07T19:42:58.2583271Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2583671Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2583744Z bogomips : 5999.98 2025-05-07T19:42:58.2583818Z clflush size : 64 2025-05-07T19:42:58.2583899Z cache_alignment : 64 2025-05-07T19:42:58.2584013Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2584085Z power management: 2025-05-07T19:42:58.2584135Z 2025-05-07T19:42:58.2584211Z processor : 81 2025-05-07T19:42:58.2584292Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2584360Z cpu family : 6 2025-05-07T19:42:58.2584427Z model : 85 2025-05-07T19:42:58.2584576Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2584643Z stepping : 7 2025-05-07T19:42:58.2584717Z microcode : 0x5003901 2025-05-07T19:42:58.2584790Z cpu MHz : 1346.779 2025-05-07T19:42:58.2584860Z cache size : 36608 KB 2025-05-07T19:42:58.2584929Z physical id : 1 2025-05-07T19:42:58.2584997Z siblings : 48 2025-05-07T19:42:58.2585069Z core id : 9 2025-05-07T19:42:58.2585136Z cpu cores : 24 2025-05-07T19:42:58.2585203Z apicid : 83 2025-05-07T19:42:58.2585283Z initial apicid : 83 2025-05-07T19:42:58.2585354Z fpu : yes 2025-05-07T19:42:58.2585428Z fpu_exception : yes 2025-05-07T19:42:58.2585498Z cpuid level : 13 2025-05-07T19:42:58.2585572Z wp : yes 2025-05-07T19:42:58.2587556Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2587915Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2587993Z bogomips : 5999.98 2025-05-07T19:42:58.2588063Z clflush size : 64 2025-05-07T19:42:58.2588137Z cache_alignment : 64 2025-05-07T19:42:58.2588256Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2588330Z power management: 2025-05-07T19:42:58.2588334Z 2025-05-07T19:42:58.2588407Z processor : 82 2025-05-07T19:42:58.2588498Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2588567Z cpu family : 6 2025-05-07T19:42:58.2588635Z model : 85 2025-05-07T19:42:58.2588773Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2588850Z stepping : 7 2025-05-07T19:42:58.2588923Z microcode : 0x5003901 2025-05-07T19:42:58.2588995Z cpu MHz : 2999.994 2025-05-07T19:42:58.2589071Z cache size : 36608 KB 2025-05-07T19:42:58.2589140Z physical id : 1 2025-05-07T19:42:58.2589212Z siblings : 48 2025-05-07T19:42:58.2589280Z core id : 10 2025-05-07T19:42:58.2589354Z cpu cores : 24 2025-05-07T19:42:58.2589422Z apicid : 85 2025-05-07T19:42:58.2589498Z initial apicid : 85 2025-05-07T19:42:58.2589564Z fpu : yes 2025-05-07T19:42:58.2589644Z fpu_exception : yes 2025-05-07T19:42:58.2589715Z cpuid level : 13 2025-05-07T19:42:58.2589782Z wp : yes 2025-05-07T19:42:58.2591772Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2592412Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2592485Z bogomips : 5999.98 2025-05-07T19:42:58.2592564Z clflush size : 64 2025-05-07T19:42:58.2592638Z cache_alignment : 64 2025-05-07T19:42:58.2592752Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2592836Z power management: 2025-05-07T19:42:58.2592840Z 2025-05-07T19:42:58.2592908Z processor : 83 2025-05-07T19:42:58.2593034Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2593112Z cpu family : 6 2025-05-07T19:42:58.2593179Z model : 85 2025-05-07T19:42:58.2593320Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2593399Z stepping : 7 2025-05-07T19:42:58.2593492Z microcode : 0x5003901 2025-05-07T19:42:58.2593565Z cpu MHz : 1311.094 2025-05-07T19:42:58.2593698Z cache size : 36608 KB 2025-05-07T19:42:58.2593791Z physical id : 1 2025-05-07T19:42:58.2593863Z siblings : 48 2025-05-07T19:42:58.2593935Z core id : 11 2025-05-07T19:42:58.2594020Z cpu cores : 24 2025-05-07T19:42:58.2594265Z apicid : 87 2025-05-07T19:42:58.2594348Z initial apicid : 87 2025-05-07T19:42:58.2594427Z fpu : yes 2025-05-07T19:42:58.2594507Z fpu_exception : yes 2025-05-07T19:42:58.2594600Z cpuid level : 13 2025-05-07T19:42:58.2594680Z wp : yes 2025-05-07T19:42:58.2596830Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2597226Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2597304Z bogomips : 5999.98 2025-05-07T19:42:58.2597382Z clflush size : 64 2025-05-07T19:42:58.2597471Z cache_alignment : 64 2025-05-07T19:42:58.2597597Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2597676Z power management: 2025-05-07T19:42:58.2597680Z 2025-05-07T19:42:58.2597774Z processor : 84 2025-05-07T19:42:58.2597860Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2597937Z cpu family : 6 2025-05-07T19:42:58.2598009Z model : 85 2025-05-07T19:42:58.2598171Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2598244Z stepping : 7 2025-05-07T19:42:58.2598320Z microcode : 0x5003901 2025-05-07T19:42:58.2598415Z cpu MHz : 2999.994 2025-05-07T19:42:58.2598494Z cache size : 36608 KB 2025-05-07T19:42:58.2598569Z physical id : 1 2025-05-07T19:42:58.2598650Z siblings : 48 2025-05-07T19:42:58.2598728Z core id : 12 2025-05-07T19:42:58.2598804Z cpu cores : 24 2025-05-07T19:42:58.2598874Z apicid : 89 2025-05-07T19:42:58.2598964Z initial apicid : 89 2025-05-07T19:42:58.2599044Z fpu : yes 2025-05-07T19:42:58.2599125Z fpu_exception : yes 2025-05-07T19:42:58.2599205Z cpuid level : 13 2025-05-07T19:42:58.2599298Z wp : yes 2025-05-07T19:42:58.2601448Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2601888Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2601966Z bogomips : 5999.98 2025-05-07T19:42:58.2602049Z clflush size : 64 2025-05-07T19:42:58.2602135Z cache_alignment : 64 2025-05-07T19:42:58.2602267Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2602347Z power management: 2025-05-07T19:42:58.2602351Z 2025-05-07T19:42:58.2602427Z processor : 85 2025-05-07T19:42:58.2602522Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2602599Z cpu family : 6 2025-05-07T19:42:58.2602678Z model : 85 2025-05-07T19:42:58.2602879Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2602974Z stepping : 7 2025-05-07T19:42:58.2603058Z microcode : 0x5003901 2025-05-07T19:42:58.2603132Z cpu MHz : 1295.718 2025-05-07T19:42:58.2603222Z cache size : 36608 KB 2025-05-07T19:42:58.2603303Z physical id : 1 2025-05-07T19:42:58.2603381Z siblings : 48 2025-05-07T19:42:58.2603454Z core id : 13 2025-05-07T19:42:58.2603537Z cpu cores : 24 2025-05-07T19:42:58.2603608Z apicid : 91 2025-05-07T19:42:58.2603685Z initial apicid : 91 2025-05-07T19:42:58.2603767Z fpu : yes 2025-05-07T19:42:58.2603845Z fpu_exception : yes 2025-05-07T19:42:58.2603921Z cpuid level : 13 2025-05-07T19:42:58.2603992Z wp : yes 2025-05-07T19:42:58.2606140Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2606609Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2606687Z bogomips : 5999.98 2025-05-07T19:42:58.2606757Z clflush size : 64 2025-05-07T19:42:58.2606830Z cache_alignment : 64 2025-05-07T19:42:58.2606944Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2607021Z power management: 2025-05-07T19:42:58.2607025Z 2025-05-07T19:42:58.2607094Z processor : 86 2025-05-07T19:42:58.2607172Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2607246Z cpu family : 6 2025-05-07T19:42:58.2607312Z model : 85 2025-05-07T19:42:58.2607455Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2607523Z stepping : 7 2025-05-07T19:42:58.2607605Z microcode : 0x5003901 2025-05-07T19:42:58.2607673Z cpu MHz : 1312.552 2025-05-07T19:42:58.2607747Z cache size : 36608 KB 2025-05-07T19:42:58.2607829Z physical id : 1 2025-05-07T19:42:58.2607895Z siblings : 48 2025-05-07T19:42:58.2607961Z core id : 14 2025-05-07T19:42:58.2608028Z cpu cores : 24 2025-05-07T19:42:58.2608101Z apicid : 93 2025-05-07T19:42:58.2608176Z initial apicid : 93 2025-05-07T19:42:58.2608244Z fpu : yes 2025-05-07T19:42:58.2608324Z fpu_exception : yes 2025-05-07T19:42:58.2608393Z cpuid level : 13 2025-05-07T19:42:58.2608457Z wp : yes 2025-05-07T19:42:58.2610447Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2610850Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2610920Z bogomips : 5999.98 2025-05-07T19:42:58.2610995Z clflush size : 64 2025-05-07T19:42:58.2611069Z cache_alignment : 64 2025-05-07T19:42:58.2611180Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2611252Z power management: 2025-05-07T19:42:58.2611256Z 2025-05-07T19:42:58.2611335Z processor : 87 2025-05-07T19:42:58.2611409Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2611477Z cpu family : 6 2025-05-07T19:42:58.2611549Z model : 85 2025-05-07T19:42:58.2611689Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2611802Z stepping : 7 2025-05-07T19:42:58.2611878Z microcode : 0x5003901 2025-05-07T19:42:58.2611955Z cpu MHz : 1445.928 2025-05-07T19:42:58.2612025Z cache size : 36608 KB 2025-05-07T19:42:58.2612096Z physical id : 1 2025-05-07T19:42:58.2612174Z siblings : 48 2025-05-07T19:42:58.2612239Z core id : 15 2025-05-07T19:42:58.2612308Z cpu cores : 24 2025-05-07T19:42:58.2612377Z apicid : 95 2025-05-07T19:42:58.2612459Z initial apicid : 95 2025-05-07T19:42:58.2612523Z fpu : yes 2025-05-07T19:42:58.2612595Z fpu_exception : yes 2025-05-07T19:42:58.2612671Z cpuid level : 13 2025-05-07T19:42:58.2612739Z wp : yes 2025-05-07T19:42:58.2614723Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2615080Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2615155Z bogomips : 5999.98 2025-05-07T19:42:58.2615226Z clflush size : 64 2025-05-07T19:42:58.2615304Z cache_alignment : 64 2025-05-07T19:42:58.2615419Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2615498Z power management: 2025-05-07T19:42:58.2615502Z 2025-05-07T19:42:58.2615572Z processor : 88 2025-05-07T19:42:58.2615656Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2615726Z cpu family : 6 2025-05-07T19:42:58.2615792Z model : 85 2025-05-07T19:42:58.2615939Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2616012Z stepping : 7 2025-05-07T19:42:58.2616086Z microcode : 0x5003901 2025-05-07T19:42:58.2616156Z cpu MHz : 1279.297 2025-05-07T19:42:58.2616235Z cache size : 36608 KB 2025-05-07T19:42:58.2616303Z physical id : 1 2025-05-07T19:42:58.2616372Z siblings : 48 2025-05-07T19:42:58.2616448Z core id : 16 2025-05-07T19:42:58.2616514Z cpu cores : 24 2025-05-07T19:42:58.2616580Z apicid : 97 2025-05-07T19:42:58.2616653Z initial apicid : 97 2025-05-07T19:42:58.2616724Z fpu : yes 2025-05-07T19:42:58.2616796Z fpu_exception : yes 2025-05-07T19:42:58.2616866Z cpuid level : 13 2025-05-07T19:42:58.2616931Z wp : yes 2025-05-07T19:42:58.2618930Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2619281Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2619413Z bogomips : 5999.98 2025-05-07T19:42:58.2619490Z clflush size : 64 2025-05-07T19:42:58.2619566Z cache_alignment : 64 2025-05-07T19:42:58.2619688Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2619766Z power management: 2025-05-07T19:42:58.2619770Z 2025-05-07T19:42:58.2619845Z processor : 89 2025-05-07T19:42:58.2619928Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2620007Z cpu family : 6 2025-05-07T19:42:58.2620076Z model : 85 2025-05-07T19:42:58.2620218Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2620301Z stepping : 7 2025-05-07T19:42:58.2620379Z microcode : 0x5003901 2025-05-07T19:42:58.2620507Z cpu MHz : 1530.743 2025-05-07T19:42:58.2620585Z cache size : 36608 KB 2025-05-07T19:42:58.2620666Z physical id : 1 2025-05-07T19:42:58.2620739Z siblings : 48 2025-05-07T19:42:58.2620811Z core id : 17 2025-05-07T19:42:58.2620887Z cpu cores : 24 2025-05-07T19:42:58.2620969Z apicid : 99 2025-05-07T19:42:58.2621046Z initial apicid : 99 2025-05-07T19:42:58.2621116Z fpu : yes 2025-05-07T19:42:58.2621206Z fpu_exception : yes 2025-05-07T19:42:58.2621281Z cpuid level : 13 2025-05-07T19:42:58.2621354Z wp : yes 2025-05-07T19:42:58.2623348Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2623708Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2623788Z bogomips : 5999.98 2025-05-07T19:42:58.2623874Z clflush size : 64 2025-05-07T19:42:58.2623953Z cache_alignment : 64 2025-05-07T19:42:58.2624064Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2624138Z power management: 2025-05-07T19:42:58.2624149Z 2025-05-07T19:42:58.2624220Z processor : 90 2025-05-07T19:42:58.2624297Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2624366Z cpu family : 6 2025-05-07T19:42:58.2624438Z model : 85 2025-05-07T19:42:58.2624577Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2624646Z stepping : 7 2025-05-07T19:42:58.2624724Z microcode : 0x5003901 2025-05-07T19:42:58.2624794Z cpu MHz : 2283.213 2025-05-07T19:42:58.2624871Z cache size : 36608 KB 2025-05-07T19:42:58.2624941Z physical id : 1 2025-05-07T19:42:58.2625017Z siblings : 48 2025-05-07T19:42:58.2625082Z core id : 18 2025-05-07T19:42:58.2625151Z cpu cores : 24 2025-05-07T19:42:58.2625223Z apicid : 101 2025-05-07T19:42:58.2625304Z initial apicid : 101 2025-05-07T19:42:58.2625371Z fpu : yes 2025-05-07T19:42:58.2625442Z fpu_exception : yes 2025-05-07T19:42:58.2625520Z cpuid level : 13 2025-05-07T19:42:58.2625584Z wp : yes 2025-05-07T19:42:58.2627567Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2627922Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2628041Z bogomips : 5999.98 2025-05-07T19:42:58.2628111Z clflush size : 64 2025-05-07T19:42:58.2628192Z cache_alignment : 64 2025-05-07T19:42:58.2628427Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2628501Z power management: 2025-05-07T19:42:58.2628505Z 2025-05-07T19:42:58.2628584Z processor : 91 2025-05-07T19:42:58.2628829Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2628904Z cpu family : 6 2025-05-07T19:42:58.2628976Z model : 85 2025-05-07T19:42:58.2629141Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2629218Z stepping : 7 2025-05-07T19:42:58.2629390Z microcode : 0x5003901 2025-05-07T19:42:58.2629470Z cpu MHz : 1505.099 2025-05-07T19:42:58.2629553Z cache size : 36608 KB 2025-05-07T19:42:58.2629715Z physical id : 1 2025-05-07T19:42:58.2629863Z siblings : 48 2025-05-07T19:42:58.2629946Z core id : 19 2025-05-07T19:42:58.2630022Z cpu cores : 24 2025-05-07T19:42:58.2630096Z apicid : 103 2025-05-07T19:42:58.2630180Z initial apicid : 103 2025-05-07T19:42:58.2630261Z fpu : yes 2025-05-07T19:42:58.2630340Z fpu_exception : yes 2025-05-07T19:42:58.2630417Z cpuid level : 13 2025-05-07T19:42:58.2630494Z wp : yes 2025-05-07T19:42:58.2632643Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2633022Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2633111Z bogomips : 5999.98 2025-05-07T19:42:58.2633190Z clflush size : 64 2025-05-07T19:42:58.2633271Z cache_alignment : 64 2025-05-07T19:42:58.2633398Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2633481Z power management: 2025-05-07T19:42:58.2633485Z 2025-05-07T19:42:58.2633559Z processor : 92 2025-05-07T19:42:58.2633694Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2633784Z cpu family : 6 2025-05-07T19:42:58.2633856Z model : 85 2025-05-07T19:42:58.2634008Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2634094Z stepping : 7 2025-05-07T19:42:58.2634176Z microcode : 0x5003901 2025-05-07T19:42:58.2634250Z cpu MHz : 2999.994 2025-05-07T19:42:58.2634327Z cache size : 36608 KB 2025-05-07T19:42:58.2634410Z physical id : 1 2025-05-07T19:42:58.2634548Z siblings : 48 2025-05-07T19:42:58.2634622Z core id : 20 2025-05-07T19:42:58.2634702Z cpu cores : 24 2025-05-07T19:42:58.2634775Z apicid : 105 2025-05-07T19:42:58.2634852Z initial apicid : 105 2025-05-07T19:42:58.2634928Z fpu : yes 2025-05-07T19:42:58.2635012Z fpu_exception : yes 2025-05-07T19:42:58.2635090Z cpuid level : 13 2025-05-07T19:42:58.2635166Z wp : yes 2025-05-07T19:42:58.2637319Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2637701Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2637782Z bogomips : 5999.98 2025-05-07T19:42:58.2637865Z clflush size : 64 2025-05-07T19:42:58.2638017Z cache_alignment : 64 2025-05-07T19:42:58.2638217Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2638304Z power management: 2025-05-07T19:42:58.2638309Z 2025-05-07T19:42:58.2638386Z processor : 93 2025-05-07T19:42:58.2638471Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2638543Z cpu family : 6 2025-05-07T19:42:58.2638619Z model : 85 2025-05-07T19:42:58.2638773Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2638848Z stepping : 7 2025-05-07T19:42:58.2638936Z microcode : 0x5003901 2025-05-07T19:42:58.2639009Z cpu MHz : 2999.994 2025-05-07T19:42:58.2639084Z cache size : 36608 KB 2025-05-07T19:42:58.2639162Z physical id : 1 2025-05-07T19:42:58.2639244Z siblings : 48 2025-05-07T19:42:58.2639315Z core id : 21 2025-05-07T19:42:58.2639435Z cpu cores : 24 2025-05-07T19:42:58.2639517Z apicid : 107 2025-05-07T19:42:58.2639594Z initial apicid : 107 2025-05-07T19:42:58.2639664Z fpu : yes 2025-05-07T19:42:58.2639746Z fpu_exception : yes 2025-05-07T19:42:58.2639830Z cpuid level : 13 2025-05-07T19:42:58.2639901Z wp : yes 2025-05-07T19:42:58.2642045Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2642432Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2642510Z bogomips : 5999.98 2025-05-07T19:42:58.2642587Z clflush size : 64 2025-05-07T19:42:58.2642670Z cache_alignment : 64 2025-05-07T19:42:58.2642793Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2642871Z power management: 2025-05-07T19:42:58.2642876Z 2025-05-07T19:42:58.2642956Z processor : 94 2025-05-07T19:42:58.2643039Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2643117Z cpu family : 6 2025-05-07T19:42:58.2643190Z model : 85 2025-05-07T19:42:58.2643350Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2643424Z stepping : 7 2025-05-07T19:42:58.2643504Z microcode : 0x5003901 2025-05-07T19:42:58.2643586Z cpu MHz : 1309.111 2025-05-07T19:42:58.2643663Z cache size : 36608 KB 2025-05-07T19:42:58.2643743Z physical id : 1 2025-05-07T19:42:58.2643816Z siblings : 48 2025-05-07T19:42:58.2643896Z core id : 22 2025-05-07T19:42:58.2643968Z cpu cores : 24 2025-05-07T19:42:58.2644044Z apicid : 109 2025-05-07T19:42:58.2644134Z initial apicid : 109 2025-05-07T19:42:58.2644208Z fpu : yes 2025-05-07T19:42:58.2644286Z fpu_exception : yes 2025-05-07T19:42:58.2644361Z cpuid level : 13 2025-05-07T19:42:58.2644440Z wp : yes 2025-05-07T19:42:58.2646714Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2647072Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2647151Z bogomips : 5999.98 2025-05-07T19:42:58.2647224Z clflush size : 64 2025-05-07T19:42:58.2647298Z cache_alignment : 64 2025-05-07T19:42:58.2647421Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2647542Z power management: 2025-05-07T19:42:58.2647546Z 2025-05-07T19:42:58.2647617Z processor : 95 2025-05-07T19:42:58.2647706Z vendor_id : GenuineIntel 2025-05-07T19:42:58.2647776Z cpu family : 6 2025-05-07T19:42:58.2647844Z model : 85 2025-05-07T19:42:58.2647983Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.2648060Z stepping : 7 2025-05-07T19:42:58.2648135Z microcode : 0x5003901 2025-05-07T19:42:58.2648207Z cpu MHz : 2999.994 2025-05-07T19:42:58.2648287Z cache size : 36608 KB 2025-05-07T19:42:58.2648358Z physical id : 1 2025-05-07T19:42:58.2648430Z siblings : 48 2025-05-07T19:42:58.2648500Z core id : 23 2025-05-07T19:42:58.2648574Z cpu cores : 24 2025-05-07T19:42:58.2648644Z apicid : 111 2025-05-07T19:42:58.2648775Z initial apicid : 111 2025-05-07T19:42:58.2648851Z fpu : yes 2025-05-07T19:42:58.2648926Z fpu_exception : yes 2025-05-07T19:42:58.2648994Z cpuid level : 13 2025-05-07T19:42:58.2649062Z wp : yes 2025-05-07T19:42:58.2651050Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.2651402Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.2651484Z bogomips : 5999.98 2025-05-07T19:42:58.2651557Z clflush size : 64 2025-05-07T19:42:58.2651632Z cache_alignment : 64 2025-05-07T19:42:58.2651748Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.2651830Z power management: 2025-05-07T19:42:58.2651834Z 2025-05-07T19:42:58.2651838Z 2025-05-07T19:42:58.2651941Z ################################################################################ 2025-05-07T19:42:58.2652025Z [INFO] Print PCI info ... 2025-05-07T19:42:58.2652102Z + lspci -v 2025-05-07T19:42:58.2652107Z 2025-05-07T19:42:58.2652275Z 00:00.0 Host bridge: Intel Corporation 440FX - 82441FX PMC [Natoma] 2025-05-07T19:42:58.2652378Z Subsystem: Amazon.com, Inc. Device 1237 2025-05-07T19:42:58.2652494Z Flags: bus master, medium devsel, latency 0 2025-05-07T19:42:58.2652498Z 2025-05-07T19:42:58.2652860Z 00:01.0 ISA bridge: Intel Corporation 82371SB PIIX3 ISA [Natoma/Triton II] 2025-05-07T19:42:58.2652948Z Physical Slot: 1 2025-05-07T19:42:58.2653156Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:42:58.2653166Z 2025-05-07T19:42:58.2653421Z 00:01.3 Non-VGA unclassified device: Intel Corporation 82371AB/EB/MB PIIX4 ACPI (rev 08) 2025-05-07T19:42:58.2653504Z Physical Slot: 1 2025-05-07T19:42:58.2653635Z Flags: bus master, fast devsel, latency 0, IRQ 9 2025-05-07T19:42:58.2653639Z 2025-05-07T19:42:58.2653918Z 00:03.0 VGA compatible controller: Amazon.com, Inc. Device 1111 (prog-if 00 [VGA controller]) 2025-05-07T19:42:58.2654000Z Physical Slot: 3 2025-05-07T19:42:58.2654108Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:42:58.2654255Z Memory at c0000000 (32-bit, prefetchable) [size=4M] 2025-05-07T19:42:58.2654379Z Expansion ROM at 000c0000 [disabled] [size=128K] 2025-05-07T19:42:58.2654383Z 2025-05-07T19:42:58.2654698Z 00:04.0 Non-Volatile memory controller: Amazon.com, Inc. NVMe EBS Controller (prog-if 02 [NVM Express]) 2025-05-07T19:42:58.2654815Z Subsystem: Amazon.com, Inc. Device 0000 2025-05-07T19:42:58.2654903Z Physical Slot: 4 2025-05-07T19:42:58.2655035Z Flags: bus master, fast devsel, latency 0, IRQ 11 2025-05-07T19:42:58.2655190Z Memory at c0514000 (32-bit, non-prefetchable) [size=16K] 2025-05-07T19:42:58.2655299Z Capabilities: 2025-05-07T19:42:58.2655442Z Kernel driver in use: nvme 2025-05-07T19:42:58.2655446Z 2025-05-07T19:42:58.2655664Z 00:05.0 Ethernet controller: Amazon.com, Inc. Elastic Network Adapter (ENA) 2025-05-07T19:42:58.2655759Z Physical Slot: 5 2025-05-07T19:42:58.2655867Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:42:58.2656019Z Memory at c0510000 (32-bit, non-prefetchable) [size=16K] 2025-05-07T19:42:58.2656157Z Memory at c0400000 (32-bit, prefetchable) [size=1M] 2025-05-07T19:42:58.2656305Z Memory at c0500000 (32-bit, non-prefetchable) [size=64K] 2025-05-07T19:42:58.2656405Z Capabilities: 2025-05-07T19:42:58.2656495Z Kernel driver in use: ena 2025-05-07T19:42:58.2656499Z 2025-05-07T19:42:58.2656511Z 2025-05-07T19:42:58.2656684Z ################################################################################ 2025-05-07T19:42:58.2656794Z [INFO] Print Linux distribution info ... 2025-05-07T19:42:58.2656876Z + uname -a 2025-05-07T19:42:58.2656880Z 2025-05-07T19:42:58.2657280Z Linux 5cbac523ab1b 6.1.130-139.222.amzn2023.x86_64 #1 SMP PREEMPT_DYNAMIC Tue Mar 11 01:10:58 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux 2025-05-07T19:42:58.2657289Z 2025-05-07T19:42:58.2657366Z + uname -m 2025-05-07T19:42:58.2657371Z 2025-05-07T19:42:58.2657443Z x86_64 2025-05-07T19:42:58.2657447Z 2025-05-07T19:42:58.2657530Z + cat /proc/version 2025-05-07T19:42:58.2657534Z 2025-05-07T19:42:58.2658122Z Linux version 6.1.130-139.222.amzn2023.x86_64 (mockbuild@ip-10-0-55-76) (gcc (GCC) 11.5.0 20240719 (Red Hat 11.5.0-5), GNU ld version 2.39-6.amzn2023.0.11) #1 SMP PREEMPT_DYNAMIC Tue Mar 11 01:10:58 UTC 2025 2025-05-07T19:42:58.2658126Z 2025-05-07T19:42:58.2658206Z + cat /etc/os-release 2025-05-07T19:42:58.2658218Z 2025-05-07T19:42:58.2658295Z NAME="Amazon Linux" 2025-05-07T19:42:58.2658373Z VERSION="2023" 2025-05-07T19:42:58.2658450Z ID="amzn" 2025-05-07T19:42:58.2658533Z ID_LIKE="fedora" 2025-05-07T19:42:58.2658614Z VERSION_ID="2023" 2025-05-07T19:42:58.2658711Z PLATFORM_ID="platform:al2023" 2025-05-07T19:42:58.2658820Z PRETTY_NAME="Amazon Linux 2023.7.20250428" 2025-05-07T19:42:58.2658905Z ANSI_COLOR="0;33" 2025-05-07T19:42:58.2659021Z CPE_NAME="cpe:2.3:o:amazon:amazon_linux:2023" 2025-05-07T19:42:58.2659202Z HOME_URL="https://aws.amazon.com/linux/amazon-linux-2023/" 2025-05-07T19:42:58.2659373Z DOCUMENTATION_URL="https://docs.aws.amazon.com/linux/" 2025-05-07T19:42:58.2659525Z SUPPORT_URL="https://aws.amazon.com/premiumsupport/" 2025-05-07T19:42:58.2659714Z BUG_REPORT_URL="https://github.com/amazonlinux/amazon-linux-2023" 2025-05-07T19:42:58.2659801Z VENDOR_NAME="AWS" 2025-05-07T19:42:58.2659911Z VENDOR_URL="https://aws.amazon.com/" 2025-05-07T19:42:58.2659993Z SUPPORT_END="2029-06-30" 2025-05-07T19:42:58.2659997Z 2025-05-07T19:42:58.2698560Z ##[group]Run . $PRELUDE; print_gpu_info 2025-05-07T19:42:58.2698720Z . $PRELUDE; print_gpu_info 2025-05-07T19:42:58.2699009Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:42:58.2699081Z env: 2025-05-07T19:42:58.2699189Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:42:58.2699289Z BUILD_ENV: build_binary 2025-05-07T19:42:58.2699372Z BUILD_TARGET: default 2025-05-07T19:42:58.2699449Z BUILD_VARIANT: cuda 2025-05-07T19:42:58.2699542Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:42:58.2699619Z ##[endgroup] 2025-05-07T19:42:58.7073172Z ################################################################################ 2025-05-07T19:42:58.7074989Z [INFO] Printing general display info ... 2025-05-07T19:42:58.7093742Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:42:58.7971658Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:42:58.7977025Z /usr/bin/sudo 2025-05-07T19:42:58.7987962Z which: no apt-get in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:42:58.7995937Z /usr/bin/yum 2025-05-07T19:42:58.7996711Z [INSTALL] Updating system repositories ... 2025-05-07T19:42:58.8023507Z [EXEC] [ATTEMPT 0/3] + sudo yum update -y 2025-05-07T19:42:59.0237018Z Last metadata expiration check: 0:00:18 ago on Wed May 7 19:42:41 2025. 2025-05-07T19:42:59.1202596Z Dependencies resolved. 2025-05-07T19:42:59.1417251Z Nothing to do. 2025-05-07T19:42:59.1417992Z Complete! 2025-05-07T19:42:59.2222780Z [INSTALL] Installing system package(s): hostname lshw ... 2025-05-07T19:42:59.2246788Z [EXEC] [ATTEMPT 0/3] + sudo yum install -y hostname lshw 2025-05-07T19:42:59.4424648Z Last metadata expiration check: 0:00:18 ago on Wed May 7 19:42:41 2025. 2025-05-07T19:42:59.4938496Z Dependencies resolved. 2025-05-07T19:42:59.5103654Z ================================================================================ 2025-05-07T19:42:59.5104444Z Package Arch Version Repository Size 2025-05-07T19:42:59.5104883Z ================================================================================ 2025-05-07T19:42:59.5105242Z Installing: 2025-05-07T19:42:59.5105575Z hostname x86_64 3.23-4.amzn2023.0.3 amazonlinux 28 k 2025-05-07T19:42:59.5106090Z lshw x86_64 B.02.19.2-7.amzn2023.0.3 amazonlinux 319 k 2025-05-07T19:42:59.5106395Z 2025-05-07T19:42:59.5106526Z Transaction Summary 2025-05-07T19:42:59.5106798Z ================================================================================ 2025-05-07T19:42:59.5107157Z Install 2 Packages 2025-05-07T19:42:59.5107313Z 2025-05-07T19:42:59.5107421Z Total download size: 347 k 2025-05-07T19:42:59.5107719Z Installed size: 883 k 2025-05-07T19:42:59.5107977Z Downloading Packages: 2025-05-07T19:42:59.8290202Z (1/2): hostname-3.23-4.amzn2023.0.3.x86_64.rpm 1.6 MB/s | 28 kB 00:00 2025-05-07T19:42:59.8391882Z (2/2): lshw-B.02.19.2-7.amzn2023.0.3.x86_64.rpm 11 MB/s | 319 kB 00:00 2025-05-07T19:42:59.8397806Z -------------------------------------------------------------------------------- 2025-05-07T19:42:59.8401605Z Total 1.0 MB/s | 347 kB 00:00 2025-05-07T19:42:59.8647419Z Running transaction check 2025-05-07T19:42:59.8702897Z Transaction check succeeded. 2025-05-07T19:42:59.8703300Z Running transaction test 2025-05-07T19:42:59.8868539Z Transaction test succeeded. 2025-05-07T19:42:59.8868989Z Running transaction 2025-05-07T19:42:59.9159375Z Preparing : 1/1 2025-05-07T19:42:59.9256896Z Installing : lshw-B.02.19.2-7.amzn2023.0.3.x86_64 1/2 2025-05-07T19:42:59.9293008Z Installing : hostname-3.23-4.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:00.9755263Z Running scriptlet: hostname-3.23-4.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:00.9757563Z Verifying : hostname-3.23-4.amzn2023.0.3.x86_64 1/2 2025-05-07T19:43:01.0249210Z Verifying : lshw-B.02.19.2-7.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:01.0250238Z 2025-05-07T19:43:01.0250495Z Installed: 2025-05-07T19:43:01.0251505Z hostname-3.23-4.amzn2023.0.3.x86_64 lshw-B.02.19.2-7.amzn2023.0.3.x86_64 2025-05-07T19:43:01.0252530Z 2025-05-07T19:43:01.0252765Z Complete! 2025-05-07T19:43:01.0995947Z + hostname 2025-05-07T19:43:01.0996113Z 2025-05-07T19:43:01.1007798Z 5cbac523ab1b 2025-05-07T19:43:01.1008284Z 2025-05-07T19:43:01.1008601Z + sudo lshw -C display 2025-05-07T19:43:01.1009121Z 2025-05-07T19:43:01.3038645Z *-display UNCLAIMED 2025-05-07T19:43:01.3039586Z description: VGA compatible controller 2025-05-07T19:43:01.3040622Z product: Amazon.com, Inc. 2025-05-07T19:43:01.3041478Z vendor: Amazon.com, Inc. 2025-05-07T19:43:01.3042253Z physical id: 3 2025-05-07T19:43:01.3042984Z bus info: pci@0000:00:03.0 2025-05-07T19:43:01.3043754Z version: 00 2025-05-07T19:43:01.3044388Z width: 32 bits 2025-05-07T19:43:01.3044626Z clock: 33MHz 2025-05-07T19:43:01.3044904Z capabilities: vga_controller bus_master 2025-05-07T19:43:01.3045255Z configuration: latency=0 2025-05-07T19:43:01.3045585Z resources: memory:c0000000-c03fffff memory:c0000-dffff 2025-05-07T19:43:01.3064315Z 2025-05-07T19:43:01.3064898Z ################################################################################ 2025-05-07T19:43:01.3065322Z [INFO] Printing NVIDIA GPU info ... 2025-05-07T19:43:01.3191629Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:43:01.3220668Z which: no nvidia-smi in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:01.3222117Z [CHECK] nvidia-smi not found 2025-05-07T19:43:01.3223027Z ################################################################################ 2025-05-07T19:43:01.3224007Z [INFO] Printing AMD GPU info ... 2025-05-07T19:43:01.3332050Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:43:01.3356465Z which: no rocminfo in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:01.3357501Z [CHECK] rocminfo not found 2025-05-07T19:43:01.3361658Z which: no rocm-smi in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:01.3362435Z [CHECK] rocm-smi not found 2025-05-07T19:43:01.3441448Z ##[group]Run . $PRELUDE; setup_miniconda $HOME/miniconda 2025-05-07T19:43:01.3441920Z . $PRELUDE; setup_miniconda $HOME/miniconda 2025-05-07T19:43:01.3442441Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:01.3442766Z env: 2025-05-07T19:43:01.3443000Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:01.3443299Z BUILD_ENV: build_binary 2025-05-07T19:43:01.3443566Z BUILD_TARGET: default 2025-05-07T19:43:01.3443794Z BUILD_VARIANT: cuda 2025-05-07T19:43:01.3444041Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:01.3444291Z ##[endgroup] 2025-05-07T19:43:01.8032216Z ################################################################################ 2025-05-07T19:43:01.8033217Z # Setup Miniconda 2025-05-07T19:43:01.8034084Z # 2025-05-07T19:43:01.8046790Z # [2025-05-07T19:43:01.804Z] + setup_miniconda /github/home/miniconda 2025-05-07T19:43:01.8047304Z ################################################################################ 2025-05-07T19:43:01.8047687Z 2025-05-07T19:43:01.8066369Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:43:01.8921102Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:43:01.8921538Z + mkdir -p /github/home/miniconda 2025-05-07T19:43:01.8921784Z 2025-05-07T19:43:01.8941649Z 2025-05-07T19:43:01.8942361Z [SETUP] Downloading the Miniconda installer ... 2025-05-07T19:43:01.8971161Z [EXEC] [ATTEMPT 0/3] + wget -q https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh -O miniconda.sh 2025-05-07T19:43:03.3124218Z [SETUP] Installing Miniconda ... 2025-05-07T19:43:03.3124756Z + bash miniconda.sh -b -p /github/home/miniconda -u 2025-05-07T19:43:03.3125073Z 2025-05-07T19:43:03.3266805Z PREFIX=/github/home/miniconda 2025-05-07T19:43:03.6844696Z Unpacking payload ... 2025-05-07T19:43:04.1636438Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:04.8278159Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:06.7028628Z 2025-05-07T19:43:06.7029278Z Installing base environment... 2025-05-07T19:43:06.7029586Z 2025-05-07T19:43:07.6978010Z Preparing transaction: ...working... done 2025-05-07T19:43:10.6105340Z Executing transaction: ...working... done 2025-05-07T19:43:11.1625695Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:11.2319644Z installation finished. 2025-05-07T19:43:11.2328926Z 2025-05-07T19:43:11.2329405Z + rm -f miniconda.sh 2025-05-07T19:43:11.2329624Z 2025-05-07T19:43:11.2523661Z 2025-05-07T19:43:11.2524340Z [SETUP] Reloading the bash configuration ... 2025-05-07T19:43:11.2525954Z + /github/home/miniconda/bin/conda init bash 2025-05-07T19:43:11.6215593Z 2025-05-07T19:43:11.6216237Z no change /github/home/miniconda/condabin/conda 2025-05-07T19:43:11.6217429Z no change /github/home/miniconda/bin/conda 2025-05-07T19:43:11.6218471Z no change /github/home/miniconda/bin/conda-env 2025-05-07T19:43:11.6219560Z no change /github/home/miniconda/bin/activate 2025-05-07T19:43:11.6220610Z no change /github/home/miniconda/bin/deactivate 2025-05-07T19:43:11.6221801Z no change /github/home/miniconda/etc/profile.d/conda.sh 2025-05-07T19:43:11.6223095Z no change /github/home/miniconda/etc/fish/conf.d/conda.fish 2025-05-07T19:43:11.6224442Z no change /github/home/miniconda/shell/condabin/Conda.psm1 2025-05-07T19:43:11.6225822Z no change /github/home/miniconda/shell/condabin/conda-hook.ps1 2025-05-07T19:43:11.6226850Z no change /github/home/miniconda/lib/python3.13/site-packages/xontrib/conda.xsh 2025-05-07T19:43:11.6227744Z no change /github/home/miniconda/etc/profile.d/conda.csh 2025-05-07T19:43:11.6228154Z modified /github/home/.bashrc 2025-05-07T19:43:11.6228700Z 2025-05-07T19:43:11.6229087Z ==> For changes to take effect, close and re-open your current shell. <== 2025-05-07T19:43:11.6229415Z 2025-05-07T19:43:11.6755489Z 2025-05-07T19:43:11.6756235Z + . /github/home/.bashrc 2025-05-07T19:43:11.6756472Z 2025-05-07T19:43:12.4699455Z 2025-05-07T19:43:12.4700316Z [SETUP] Installing libmamba-solver (required since Anaconda 2024.02-1) and libarchive ... 2025-05-07T19:43:12.4726663Z [EXEC] [ATTEMPT 0/3] + conda install --solver=classic -c conda-forge --override-channels -y conda-libmamba-solver libmamba libmambapy libarchive 2025-05-07T19:43:24.3065006Z Collecting package metadata (current_repodata.json): - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - done 2025-05-07T19:43:25.7713239Z Solving environment: | / - \ | / - \ | / - done 2025-05-07T19:43:25.8597643Z 2025-05-07T19:43:25.8598060Z ## Package Plan ## 2025-05-07T19:43:25.8598309Z 2025-05-07T19:43:25.8598475Z environment location: /github/home/miniconda 2025-05-07T19:43:25.8598744Z 2025-05-07T19:43:25.8598863Z added / updated specs: 2025-05-07T19:43:25.8599204Z - conda-libmamba-solver 2025-05-07T19:43:25.8599499Z - libarchive 2025-05-07T19:43:25.8599804Z - libmamba 2025-05-07T19:43:25.8600068Z - libmambapy 2025-05-07T19:43:25.8600211Z 2025-05-07T19:43:25.8600216Z 2025-05-07T19:43:25.8600355Z The following packages will be downloaded: 2025-05-07T19:43:25.8600624Z 2025-05-07T19:43:25.8600759Z package | build 2025-05-07T19:43:25.8601135Z ---------------------------|----------------- 2025-05-07T19:43:25.8601682Z ca-certificates-2025.4.26 | hbd8a1cb_0 149 KB conda-forge 2025-05-07T19:43:25.8602242Z certifi-2025.4.26 | pyhd8ed1ab_0 154 KB conda-forge 2025-05-07T19:43:25.8602720Z conda-25.3.1 | py313h78bf25f_1 1.1 MB conda-forge 2025-05-07T19:43:25.8603284Z conda-libmamba-solver-25.4.0| pyhd8ed1ab_0 41 KB conda-forge 2025-05-07T19:43:25.8603780Z ------------------------------------------------------------ 2025-05-07T19:43:25.8604192Z Total: 1.4 MB 2025-05-07T19:43:25.8604425Z 2025-05-07T19:43:25.8604554Z The following packages will be UPDATED: 2025-05-07T19:43:25.8604809Z 2025-05-07T19:43:25.8610609Z ca-certificates pkgs/main/linux-64::ca-certificates-2~ --> conda-forge/noarch::ca-certificates-2025.4.26-hbd8a1cb_0 2025-05-07T19:43:25.8611519Z conda pkgs/main::conda-25.3.1-py313h06a4308~ --> conda-forge::conda-25.3.1-py313h78bf25f_1 2025-05-07T19:43:25.8612221Z 2025-05-07T19:43:25.8612465Z The following packages will be SUPERSEDED by a higher-priority channel: 2025-05-07T19:43:25.8612854Z 2025-05-07T19:43:25.8613208Z certifi pkgs/main/linux-64::certifi-2025.4.26~ --> conda-forge/noarch::certifi-2025.4.26-pyhd8ed1ab_0 2025-05-07T19:43:25.8614119Z conda-libmamba-so~ pkgs/main::conda-libmamba-solver-25.4~ --> conda-forge::conda-libmamba-solver-25.4.0-pyhd8ed1ab_0 2025-05-07T19:43:25.8614658Z 2025-05-07T19:43:25.8614662Z 2025-05-07T19:43:25.8614666Z 2025-05-07T19:43:25.8614828Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:25.8615284Z conda-25.3.1 | 1.1 MB | | 0% 2025-05-07T19:43:25.8615536Z 2025-05-07T19:43:25.8615949Z certifi-2025.4.26 | 154 KB | | 0%  2025-05-07T19:43:25.8616215Z 2025-05-07T19:43:25.8616219Z 2025-05-07T19:43:25.8620940Z ca-certificates-2025 | 149 KB | | 0%  2025-05-07T19:43:25.8621261Z 2025-05-07T19:43:25.8621274Z 2025-05-07T19:43:25.8621466Z 2025-05-07T19:43:25.9141016Z conda-libmamba-solve | 41 KB | | 0%  2025-05-07T19:43:25.9141357Z 2025-05-07T19:43:25.9279695Z certifi-2025.4.26 | 154 KB | ########## | 100%  2025-05-07T19:43:25.9279999Z 2025-05-07T19:43:25.9296823Z certifi-2025.4.26 | 154 KB | ########## | 100%  2025-05-07T19:43:25.9297158Z 2025-05-07T19:43:25.9297350Z 2025-05-07T19:43:25.9297359Z 2025-05-07T19:43:25.9392431Z conda-libmamba-solve | 41 KB | ########## | 100%  2025-05-07T19:43:25.9430786Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:25.9431096Z 2025-05-07T19:43:25.9431103Z 2025-05-07T19:43:25.9431108Z 2025-05-07T19:43:25.9756760Z conda-libmamba-solve | 41 KB | ########## | 100%  2025-05-07T19:43:25.9757087Z 2025-05-07T19:43:25.9757096Z 2025-05-07T19:43:25.9796258Z ca-certificates-2025 | 149 KB | # | 11%  2025-05-07T19:43:25.9796616Z 2025-05-07T19:43:25.9796657Z 2025-05-07T19:43:25.9962085Z ca-certificates-2025 | 149 KB | ########## | 100%  2025-05-07T19:43:25.9962405Z 2025-05-07T19:43:25.9962419Z 2025-05-07T19:43:26.0474234Z ca-certificates-2025 | 149 KB | ########## | 100%  2025-05-07T19:43:26.0474704Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:26.0477031Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:26.0477425Z 2025-05-07T19:43:26.0477647Z 2025-05-07T19:43:26.0480471Z  2025-05-07T19:43:26.0480747Z 2025-05-07T19:43:26.0480751Z 2025-05-07T19:43:26.0480946Z  2025-05-07T19:43:26.0481178Z 2025-05-07T19:43:26.0481182Z 2025-05-07T19:43:26.0481185Z 2025-05-07T19:43:26.0481411Z  done 2025-05-07T19:43:26.1489474Z Preparing transaction: | done 2025-05-07T19:43:26.2502684Z Verifying transaction: - done 2025-05-07T19:43:27.5530862Z Executing transaction: | / - \ | / - \ | / - \ | done 2025-05-07T19:43:29.1416600Z [SETUP] Updating Miniconda base packages ... 2025-05-07T19:43:29.1437663Z [EXEC] [ATTEMPT 0/3] + conda update -n base -c defaults --update-deps -y conda 2025-05-07T19:43:29.8574383Z Channels: 2025-05-07T19:43:29.8574748Z - defaults 2025-05-07T19:43:29.8575006Z Platform: linux-64 2025-05-07T19:43:30.9407945Z Collecting package metadata (repodata.json): - \ | / - \ done 2025-05-07T19:43:31.0710524Z Solving environment: / - Channels: 2025-05-07T19:43:31.0711070Z - defaults 2025-05-07T19:43:31.0711322Z Platform: linux-64 2025-05-07T19:43:31.3545595Z Collecting package metadata (repodata.json): | / - \ done 2025-05-07T19:43:31.5637038Z Solving environment: / - \ done 2025-05-07T19:43:31.6839307Z | done 2025-05-07T19:43:31.7472979Z 2025-05-07T19:43:31.7473336Z ## Package Plan ## 2025-05-07T19:43:31.7473880Z 2025-05-07T19:43:31.7474169Z environment location: /github/home/miniconda 2025-05-07T19:43:31.7474463Z 2025-05-07T19:43:31.7474609Z added / updated specs: 2025-05-07T19:43:31.7474897Z - conda 2025-05-07T19:43:31.7475071Z 2025-05-07T19:43:31.7475077Z 2025-05-07T19:43:31.7475215Z The following packages will be downloaded: 2025-05-07T19:43:31.7475456Z 2025-05-07T19:43:31.7475590Z package | build 2025-05-07T19:43:31.7475974Z ---------------------------|----------------- 2025-05-07T19:43:31.7476384Z pip-25.1 | pyhc872135_2 1.3 MB 2025-05-07T19:43:31.7476812Z tzdata-2025b | h04d1e81_0 116 KB 2025-05-07T19:43:31.7477251Z ------------------------------------------------------------ 2025-05-07T19:43:31.7477627Z Total: 1.4 MB 2025-05-07T19:43:31.7477886Z 2025-05-07T19:43:31.7478016Z The following packages will be UPDATED: 2025-05-07T19:43:31.7478255Z 2025-05-07T19:43:31.7478773Z pip pkgs/main/linux-64::pip-25.0-py313h06~ --> pkgs/main/noarch::pip-25.1-pyhc872135_2 2025-05-07T19:43:31.7479364Z tzdata 2025a-h04d1e81_0 --> 2025b-h04d1e81_0 2025-05-07T19:43:31.7479640Z 2025-05-07T19:43:31.7479644Z 2025-05-07T19:43:31.7479681Z 2025-05-07T19:43:31.7479843Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:31.7480240Z pip-25.1 | 1.3 MB | | 0% 2025-05-07T19:43:31.7480510Z 2025-05-07T19:43:31.7925569Z tzdata-2025b | 116 KB | | 0%  2025-05-07T19:43:31.8119456Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:31.8119739Z 2025-05-07T19:43:31.9651892Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:31.9652380Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:31.9895173Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:31.9895592Z 2025-05-07T19:43:31.9895998Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:31.9896304Z 2025-05-07T19:43:31.9897095Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:31.9897507Z 2025-05-07T19:43:31.9897732Z 2025-05-07T19:43:31.9898011Z  done 2025-05-07T19:43:32.0909831Z Preparing transaction: - done 2025-05-07T19:43:32.1920898Z Verifying transaction: | done 2025-05-07T19:43:34.1970017Z Executing transaction: - \ | / - \ | / - \ | / - \ | / - \ | / done 2025-05-07T19:43:34.7478078Z [SETUP] Cleaning up Conda packages ... 2025-05-07T19:43:34.7486631Z + conda clean --packages --tarball -y 2025-05-07T19:43:34.7487254Z 2025-05-07T19:43:35.1884327Z Will remove 99 (117.8 MB) tarball(s). 2025-05-07T19:43:35.1884752Z Will remove 11 (16.0 MB) package(s). 2025-05-07T19:43:35.2431398Z 2025-05-07T19:43:35.2440516Z + conda clean --all -y 2025-05-07T19:43:35.2441127Z 2025-05-07T19:43:35.7040574Z There are no unused tarball(s) to remove. 2025-05-07T19:43:35.7041633Z Will remove 1 index cache(s). 2025-05-07T19:43:35.7042319Z There are no unused package(s) to remove. 2025-05-07T19:43:35.7042658Z There are no tempfile(s) to remove. 2025-05-07T19:43:35.7042998Z There are no logfile(s) to remove. 2025-05-07T19:43:35.7594149Z 2025-05-07T19:43:35.7598701Z + conda info 2025-05-07T19:43:35.7599145Z 2025-05-07T19:43:36.3203936Z 2025-05-07T19:43:36.3204573Z active environment : base 2025-05-07T19:43:36.3205568Z active env location : /github/home/miniconda 2025-05-07T19:43:36.3206601Z shell level : 1 2025-05-07T19:43:36.3207397Z user config file : /github/home/.condarc 2025-05-07T19:43:36.3208555Z populated config files : /github/home/miniconda/.condarc 2025-05-07T19:43:36.3209630Z conda version : 25.3.1 2025-05-07T19:43:36.3210491Z conda-build version : not installed 2025-05-07T19:43:36.3211905Z python version : 3.13.2.final.0 2025-05-07T19:43:36.3212669Z solver : libmamba (default) 2025-05-07T19:43:36.3213024Z virtual packages : __archspec=1=cascadelake 2025-05-07T19:43:36.3213346Z __conda=25.3.1=0 2025-05-07T19:43:36.3213839Z __glibc=2.34=0 2025-05-07T19:43:36.3214139Z __linux=6.1.130=0 2025-05-07T19:43:36.3214463Z __unix=0=0 2025-05-07T19:43:36.3214816Z base environment : /github/home/miniconda (writable) 2025-05-07T19:43:36.3215261Z conda av data dir : /github/home/miniconda/etc/conda 2025-05-07T19:43:36.3215623Z conda av metadata url : None 2025-05-07T19:43:36.3216043Z channel URLs : https://repo.anaconda.com/pkgs/main/linux-64 2025-05-07T19:43:36.3216521Z https://repo.anaconda.com/pkgs/main/noarch 2025-05-07T19:43:36.3216922Z https://repo.anaconda.com/pkgs/r/linux-64 2025-05-07T19:43:36.3217515Z https://repo.anaconda.com/pkgs/r/noarch 2025-05-07T19:43:36.3217902Z package cache : /github/home/miniconda/pkgs 2025-05-07T19:43:36.3218279Z /github/home/.conda/pkgs 2025-05-07T19:43:36.3218633Z envs directories : /github/home/miniconda/envs 2025-05-07T19:43:36.3219011Z /github/home/.conda/envs 2025-05-07T19:43:36.3219334Z platform : linux-64 2025-05-07T19:43:36.3220366Z user-agent : conda/25.3.1 requests/2.32.3 CPython/3.13.2 Linux/6.1.130-139.222.amzn2023.x86_64 amzn/2023.7.20250428 glibc/2.34 solver/libmamba conda-libmamba-solver/25.4.0 libmambapy/2.0.5 aau/0.7.0 c/. s/. e/. 2025-05-07T19:43:36.3221244Z UID:GID : 0:0 2025-05-07T19:43:36.3221508Z netrc file : None 2025-05-07T19:43:36.3221798Z offline mode : False 2025-05-07T19:43:36.3221973Z 2025-05-07T19:43:36.3793035Z 2025-05-07T19:43:36.3793382Z [SETUP] Exporting Miniconda variables ... 2025-05-07T19:43:36.3794219Z [SETUP] Saving Miniconda variables to /__w/_temp/_runner_file_commands/add_path_b35abb5c-6b6c-4f17-8c01-9e39237bae16 ... 2025-05-07T19:43:36.3794960Z [SETUP] Successfully set up Miniconda at /github/home/miniconda 2025-05-07T19:43:36.3940493Z ##[group]Run . $PRELUDE; create_conda_environment $BUILD_ENV 3.11 2025-05-07T19:43:36.3941080Z . $PRELUDE; create_conda_environment $BUILD_ENV 3.11 2025-05-07T19:43:36.3941943Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:36.3942323Z env: 2025-05-07T19:43:36.3942571Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:36.3942918Z BUILD_ENV: build_binary 2025-05-07T19:43:36.3943181Z BUILD_TARGET: default 2025-05-07T19:43:36.3943461Z BUILD_VARIANT: cuda 2025-05-07T19:43:36.3943718Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:36.3944011Z ##[endgroup] 2025-05-07T19:43:36.8117939Z ################################################################################ 2025-05-07T19:43:36.8119035Z # Create Conda Environment 2025-05-07T19:43:36.8119792Z # 2025-05-07T19:43:36.8131513Z # [2025-05-07T19:43:36.812Z] + create_conda_environment build_binary 3.11 2025-05-07T19:43:36.8132876Z ################################################################################ 2025-05-07T19:43:36.8133538Z 2025-05-07T19:43:36.8150083Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:43:36.8991336Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:43:36.8992216Z [SETUP] Listing existing Conda environments ... 2025-05-07T19:43:36.8992647Z + conda info --envs 2025-05-07T19:43:36.8992812Z 2025-05-07T19:43:37.4860153Z 2025-05-07T19:43:37.4860958Z # conda environments: 2025-05-07T19:43:37.4861772Z # 2025-05-07T19:43:37.4862026Z base /github/home/miniconda 2025-05-07T19:43:37.4862293Z 2025-05-07T19:43:37.5592236Z 2025-05-07T19:43:37.5592643Z [SETUP] Deleting the prefix directory if it exists ... 2025-05-07T19:43:39.1641316Z + rm -rf /github/home/miniconda/envs/build_binary 2025-05-07T19:43:39.1642697Z 2025-05-07T19:43:39.1666815Z 2025-05-07T19:43:39.1674654Z [SETUP] Creating new Conda environment (Python 3.11) ... 2025-05-07T19:43:39.1703318Z [EXEC] [ATTEMPT 0/3] + conda create -y -n build_binary python=3.11 2025-05-07T19:43:39.7426707Z Channels: 2025-05-07T19:43:39.7427383Z - defaults 2025-05-07T19:43:39.7428035Z Platform: linux-64 2025-05-07T19:43:41.1350925Z Collecting package metadata (repodata.json): - \ | / - \ | / - done 2025-05-07T19:43:41.2357707Z Solving environment: | done 2025-05-07T19:43:41.2657532Z 2025-05-07T19:43:41.2657988Z ## Package Plan ## 2025-05-07T19:43:41.2658468Z 2025-05-07T19:43:41.2659170Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:43:41.2660241Z 2025-05-07T19:43:41.2660573Z added / updated specs: 2025-05-07T19:43:41.2661481Z - python=3.11 2025-05-07T19:43:41.2661887Z 2025-05-07T19:43:41.2661899Z 2025-05-07T19:43:41.2662259Z The following packages will be downloaded: 2025-05-07T19:43:41.2662993Z 2025-05-07T19:43:41.2663331Z package | build 2025-05-07T19:43:41.2664303Z ---------------------------|----------------- 2025-05-07T19:43:41.2665453Z _libgcc_mutex-0.1 | main 3 KB 2025-05-07T19:43:41.2666557Z _openmp_mutex-5.1 | 1_gnu 21 KB 2025-05-07T19:43:41.2667033Z ca-certificates-2025.2.25 | h06a4308_0 129 KB 2025-05-07T19:43:41.2667511Z python-3.11.11 | he870216_0 32.9 MB 2025-05-07T19:43:41.2667951Z setuptools-78.1.1 | py311h06a4308_0 2.3 MB 2025-05-07T19:43:41.2668409Z wheel-0.45.1 | py311h06a4308_0 151 KB 2025-05-07T19:43:41.2668813Z ------------------------------------------------------------ 2025-05-07T19:43:41.2669208Z Total: 35.4 MB 2025-05-07T19:43:41.2669437Z 2025-05-07T19:43:41.2669611Z The following NEW packages will be INSTALLED: 2025-05-07T19:43:41.2669861Z 2025-05-07T19:43:41.2670095Z _libgcc_mutex pkgs/main/linux-64::_libgcc_mutex-0.1-main 2025-05-07T19:43:41.2670602Z _openmp_mutex pkgs/main/linux-64::_openmp_mutex-5.1-1_gnu 2025-05-07T19:43:41.2671470Z bzip2 pkgs/main/linux-64::bzip2-1.0.8-h5eee18b_6 2025-05-07T19:43:41.2672082Z ca-certificates pkgs/main/linux-64::ca-certificates-2025.2.25-h06a4308_0 2025-05-07T19:43:41.2672710Z ld_impl_linux-64 pkgs/main/linux-64::ld_impl_linux-64-2.40-h12ee557_0 2025-05-07T19:43:41.2673275Z libffi pkgs/main/linux-64::libffi-3.4.4-h6a678d5_1 2025-05-07T19:43:41.2673771Z libgcc-ng pkgs/main/linux-64::libgcc-ng-11.2.0-h1234567_1 2025-05-07T19:43:41.2674379Z libgomp pkgs/main/linux-64::libgomp-11.2.0-h1234567_1 2025-05-07T19:43:41.2674948Z libstdcxx-ng pkgs/main/linux-64::libstdcxx-ng-11.2.0-h1234567_1 2025-05-07T19:43:41.2675479Z libuuid pkgs/main/linux-64::libuuid-1.41.5-h5eee18b_0 2025-05-07T19:43:41.2675947Z ncurses pkgs/main/linux-64::ncurses-6.4-h6a678d5_0 2025-05-07T19:43:41.2676432Z openssl pkgs/main/linux-64::openssl-3.0.16-h5eee18b_0 2025-05-07T19:43:41.2676876Z pip pkgs/main/noarch::pip-25.1-pyhc872135_2 2025-05-07T19:43:41.2677337Z python pkgs/main/linux-64::python-3.11.11-he870216_0 2025-05-07T19:43:41.2677825Z readline pkgs/main/linux-64::readline-8.2-h5eee18b_0 2025-05-07T19:43:41.2678339Z setuptools pkgs/main/linux-64::setuptools-78.1.1-py311h06a4308_0 2025-05-07T19:43:41.2678873Z sqlite pkgs/main/linux-64::sqlite-3.45.3-h5eee18b_0 2025-05-07T19:43:41.2679293Z tk pkgs/main/linux-64::tk-8.6.14-h39e8969_0 2025-05-07T19:43:41.2679730Z tzdata pkgs/main/noarch::tzdata-2025b-h04d1e81_0 2025-05-07T19:43:41.2680208Z wheel pkgs/main/linux-64::wheel-0.45.1-py311h06a4308_0 2025-05-07T19:43:41.2680787Z xz pkgs/main/linux-64::xz-5.6.4-h5eee18b_1 2025-05-07T19:43:41.2681392Z zlib pkgs/main/linux-64::zlib-1.2.13-h5eee18b_1 2025-05-07T19:43:41.2681641Z 2025-05-07T19:43:41.2681645Z 2025-05-07T19:43:41.2681649Z 2025-05-07T19:43:41.2681801Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:41.2682263Z python-3.11.11 | 32.9 MB | | 0% 2025-05-07T19:43:41.2682495Z 2025-05-07T19:43:41.2682877Z setuptools-78.1.1 | 2.3 MB | | 0%  2025-05-07T19:43:41.2683320Z 2025-05-07T19:43:41.2683324Z 2025-05-07T19:43:41.2683564Z wheel-0.45.1 | 151 KB | | 0%  2025-05-07T19:43:41.2683904Z 2025-05-07T19:43:41.2683909Z 2025-05-07T19:43:41.2683912Z 2025-05-07T19:43:41.2684571Z ca-certificates-2025 | 129 KB | | 0%  2025-05-07T19:43:41.2684879Z 2025-05-07T19:43:41.2684883Z 2025-05-07T19:43:41.2684887Z 2025-05-07T19:43:41.2687164Z 2025-05-07T19:43:41.2696249Z _openmp_mutex-5.1 | 21 KB | | 0%  2025-05-07T19:43:41.2697105Z 2025-05-07T19:43:41.2697116Z 2025-05-07T19:43:41.2697126Z 2025-05-07T19:43:41.2697136Z 2025-05-07T19:43:41.2708212Z 2025-05-07T19:43:41.3067150Z _libgcc_mutex-0.1 | 3 KB | | 0%  2025-05-07T19:43:41.3067477Z 2025-05-07T19:43:41.3067483Z 2025-05-07T19:43:41.3067487Z 2025-05-07T19:43:41.3067490Z 2025-05-07T19:43:41.3150433Z _openmp_mutex-5.1 | 21 KB | ########## | 100%  2025-05-07T19:43:41.3150759Z 2025-05-07T19:43:41.3150776Z 2025-05-07T19:43:41.3236761Z wheel-0.45.1 | 151 KB | ########## | 100%  2025-05-07T19:43:41.3237611Z 2025-05-07T19:43:41.3237626Z 2025-05-07T19:43:41.3237658Z 2025-05-07T19:43:41.3345719Z ca-certificates-2025 | 129 KB | ########## | 100%  2025-05-07T19:43:41.3346254Z 2025-05-07T19:43:41.3346259Z 2025-05-07T19:43:41.3346294Z 2025-05-07T19:43:41.3346298Z 2025-05-07T19:43:41.3346302Z 2025-05-07T19:43:41.3453472Z _libgcc_mutex-0.1 | 3 KB | ########## | 100%  2025-05-07T19:43:41.3454003Z 2025-05-07T19:43:41.3454008Z 2025-05-07T19:43:41.3454014Z 2025-05-07T19:43:41.3550845Z ca-certificates-2025 | 129 KB | ########## | 100%  2025-05-07T19:43:41.3551644Z 2025-05-07T19:43:41.3551667Z 2025-05-07T19:43:41.3551671Z 2025-05-07T19:43:41.3551675Z 2025-05-07T19:43:41.3551678Z 2025-05-07T19:43:41.3596623Z _libgcc_mutex-0.1 | 3 KB | ########## | 100%  2025-05-07T19:43:41.3596972Z 2025-05-07T19:43:41.3596977Z 2025-05-07T19:43:41.3596982Z 2025-05-07T19:43:41.3596987Z 2025-05-07T19:43:41.3660217Z _openmp_mutex-5.1 | 21 KB | ########## | 100%  2025-05-07T19:43:41.3661504Z python-3.11.11 | 32.9 MB | #1 | 11% 2025-05-07T19:43:41.3661766Z 2025-05-07T19:43:41.3751543Z setuptools-78.1.1 | 2.3 MB | ########## | 100%  2025-05-07T19:43:41.3751867Z 2025-05-07T19:43:41.3752042Z 2025-05-07T19:43:41.3752807Z wheel-0.45.1 | 151 KB | ########## | 100%  2025-05-07T19:43:41.3753135Z 2025-05-07T19:43:41.3753141Z 2025-05-07T19:43:41.4673165Z wheel-0.45.1 | 151 KB | ########## | 100%  2025-05-07T19:43:41.5792679Z python-3.11.11 | 32.9 MB | #####9 | 59% 2025-05-07T19:43:41.5880173Z python-3.11.11 | 32.9 MB | #########4 | 94% 2025-05-07T19:43:41.5880521Z 2025-05-07T19:43:41.5881002Z setuptools-78.1.1 | 2.3 MB | ########## | 100%  2025-05-07T19:43:41.5881303Z 2025-05-07T19:43:41.6884354Z setuptools-78.1.1 | 2.3 MB | ########## | 100%  2025-05-07T19:43:42.2475397Z python-3.11.11 | 32.9 MB | ########## | 100% 2025-05-07T19:43:42.2476744Z python-3.11.11 | 32.9 MB | ########## | 100% 2025-05-07T19:43:42.2477164Z 2025-05-07T19:43:42.2477384Z 2025-05-07T19:43:42.2477682Z  2025-05-07T19:43:42.2477906Z 2025-05-07T19:43:42.2477912Z 2025-05-07T19:43:42.2478367Z  2025-05-07T19:43:42.2478624Z 2025-05-07T19:43:42.2478628Z 2025-05-07T19:43:42.2478631Z 2025-05-07T19:43:42.2478820Z  2025-05-07T19:43:42.2479058Z 2025-05-07T19:43:42.2479062Z 2025-05-07T19:43:42.2479066Z 2025-05-07T19:43:42.2479071Z 2025-05-07T19:43:42.2479293Z  2025-05-07T19:43:42.2479521Z 2025-05-07T19:43:42.2479525Z 2025-05-07T19:43:42.2479529Z 2025-05-07T19:43:42.2479532Z 2025-05-07T19:43:42.2479536Z 2025-05-07T19:43:42.2479740Z  done 2025-05-07T19:43:42.4592052Z Preparing transaction: - \ done 2025-05-07T19:43:43.8432123Z Verifying transaction: / - \ | / - \ | / - \ | / done 2025-05-07T19:43:45.9581137Z Executing transaction: \ | / - \ | / - \ | / - \ | / - \ | / - \ done 2025-05-07T19:43:45.9620099Z # 2025-05-07T19:43:45.9620581Z # To activate this environment, use 2025-05-07T19:43:45.9621063Z # 2025-05-07T19:43:45.9621291Z # $ conda activate build_binary 2025-05-07T19:43:45.9621642Z # 2025-05-07T19:43:45.9621908Z # To deactivate an active environment, use 2025-05-07T19:43:45.9622259Z # 2025-05-07T19:43:45.9622493Z # $ conda deactivate 2025-05-07T19:43:45.9622666Z 2025-05-07T19:43:46.0463585Z [SETUP] Upgrading PIP to latest ... 2025-05-07T19:43:46.0488910Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --upgrade pip 2025-05-07T19:43:48.9328982Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:43:48.9330523Z 2025-05-07T19:43:48.9330987Z Requirement already satisfied: pip in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (25.1) 2025-05-07T19:43:48.9331649Z Collecting pip 2025-05-07T19:43:48.9332025Z Downloading pip-25.1.1-py3-none-any.whl.metadata (3.6 kB) 2025-05-07T19:43:48.9332806Z Downloading pip-25.1.1-py3-none-any.whl (1.8 MB) 2025-05-07T19:43:48.9333573Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.8/1.8 MB 69.4 MB/s eta 0:00:00 2025-05-07T19:43:48.9333997Z Installing collected packages: pip 2025-05-07T19:43:48.9334444Z Attempting uninstall: pip 2025-05-07T19:43:48.9334777Z Found existing installation: pip 25.1 2025-05-07T19:43:48.9335107Z Uninstalling pip-25.1: 2025-05-07T19:43:48.9335536Z Successfully uninstalled pip-25.1 2025-05-07T19:43:48.9335845Z Successfully installed pip-25.1.1 2025-05-07T19:43:48.9336061Z 2025-05-07T19:43:48.9920102Z [SETUP] Upgrading pyOpenSSL ... 2025-05-07T19:43:48.9946125Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y pyOpenSSL>22.1.0 2025-05-07T19:43:49.6529482Z Channels: 2025-05-07T19:43:49.6529888Z - conda-forge 2025-05-07T19:43:49.6530147Z Platform: linux-64 2025-05-07T19:43:59.3979923Z Collecting package metadata (repodata.json): - \ | / - \ | / - \ | / - \ | / - done 2025-05-07T19:44:01.3177803Z Solving environment: | / - \ | done 2025-05-07T19:44:01.3654567Z 2025-05-07T19:44:01.3654874Z ## Package Plan ## 2025-05-07T19:44:01.3655150Z 2025-05-07T19:44:01.3656043Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:01.3656433Z 2025-05-07T19:44:01.3656551Z added / updated specs: 2025-05-07T19:44:01.3656873Z - pyopenssl[version='>22.1.0'] 2025-05-07T19:44:01.3657085Z 2025-05-07T19:44:01.3657134Z 2025-05-07T19:44:01.3657269Z The following packages will be downloaded: 2025-05-07T19:44:01.3657531Z 2025-05-07T19:44:01.3657660Z package | build 2025-05-07T19:44:01.3658307Z ---------------------------|----------------- 2025-05-07T19:44:01.3659035Z cffi-1.17.1 | py311hf29c0ef_0 295 KB conda-forge 2025-05-07T19:44:01.3659555Z cryptography-44.0.3 | py311hafd3f86_0 1.5 MB conda-forge 2025-05-07T19:44:01.3660047Z libgcc-15.1.0 | h767d61c_2 810 KB conda-forge 2025-05-07T19:44:01.3660508Z libgcc-ng-15.1.0 | h69a702a_2 34 KB conda-forge 2025-05-07T19:44:01.3660957Z libgomp-15.1.0 | h767d61c_2 442 KB conda-forge 2025-05-07T19:44:01.3661417Z openssl-3.5.0 | h7b32b05_1 3.0 MB conda-forge 2025-05-07T19:44:01.3661888Z pycparser-2.22 | pyh29332c3_1 108 KB conda-forge 2025-05-07T19:44:01.3662354Z pyopenssl-25.0.0 | pyhd8ed1ab_0 120 KB conda-forge 2025-05-07T19:44:01.3662828Z python_abi-3.11 | 2_cp311 5 KB conda-forge 2025-05-07T19:44:01.3663311Z typing-extensions-4.13.2 | h0e9735f_0 88 KB conda-forge 2025-05-07T19:44:01.3663862Z typing_extensions-4.13.2 | pyh29332c3_0 51 KB conda-forge 2025-05-07T19:44:01.3664321Z ------------------------------------------------------------ 2025-05-07T19:44:01.3664704Z Total: 6.4 MB 2025-05-07T19:44:01.3664928Z 2025-05-07T19:44:01.3665092Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:01.3665330Z 2025-05-07T19:44:01.3665552Z cffi conda-forge/linux-64::cffi-1.17.1-py311hf29c0ef_0 2025-05-07T19:44:01.3666109Z cryptography conda-forge/linux-64::cryptography-44.0.3-py311hafd3f86_0 2025-05-07T19:44:01.3666640Z libgcc conda-forge/linux-64::libgcc-15.1.0-h767d61c_2 2025-05-07T19:44:01.3667141Z pycparser conda-forge/noarch::pycparser-2.22-pyh29332c3_1 2025-05-07T19:44:01.3667673Z pyopenssl conda-forge/noarch::pyopenssl-25.0.0-pyhd8ed1ab_0 2025-05-07T19:44:01.3668186Z python_abi conda-forge/linux-64::python_abi-3.11-2_cp311 2025-05-07T19:44:01.3671619Z typing-extensions conda-forge/noarch::typing-extensions-4.13.2-h0e9735f_0 2025-05-07T19:44:01.3672450Z typing_extensions conda-forge/noarch::typing_extensions-4.13.2-pyh29332c3_0 2025-05-07T19:44:01.3672850Z 2025-05-07T19:44:01.3672977Z The following packages will be UPDATED: 2025-05-07T19:44:01.3673196Z 2025-05-07T19:44:01.3673644Z ca-certificates pkgs/main/linux-64::ca-certificates-2~ --> conda-forge/noarch::ca-certificates-2025.4.26-hbd8a1cb_0 2025-05-07T19:44:01.3674579Z libgcc-ng pkgs/main::libgcc-ng-11.2.0-h1234567_1 --> conda-forge::libgcc-ng-15.1.0-h69a702a_2 2025-05-07T19:44:01.3675371Z libgomp pkgs/main::libgomp-11.2.0-h1234567_1 --> conda-forge::libgomp-15.1.0-h767d61c_2 2025-05-07T19:44:01.3676076Z openssl pkgs/main::openssl-3.0.16-h5eee18b_0 --> conda-forge::openssl-3.5.0-h7b32b05_1 2025-05-07T19:44:01.3676468Z 2025-05-07T19:44:01.3676486Z 2025-05-07T19:44:01.3676490Z 2025-05-07T19:44:01.3676640Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:01.3677061Z openssl-3.5.0 | 3.0 MB | | 0% 2025-05-07T19:44:01.3677308Z 2025-05-07T19:44:01.3677655Z cryptography-44.0.3 | 1.5 MB | | 0%  2025-05-07T19:44:01.3677953Z 2025-05-07T19:44:01.3677957Z 2025-05-07T19:44:01.3678175Z libgcc-15.1.0 | 810 KB | | 0%  2025-05-07T19:44:01.3678430Z 2025-05-07T19:44:01.3678459Z 2025-05-07T19:44:01.3678462Z 2025-05-07T19:44:01.3683695Z libgomp-15.1.0 | 442 KB | | 0%  2025-05-07T19:44:01.3683962Z 2025-05-07T19:44:01.3683966Z 2025-05-07T19:44:01.3683969Z 2025-05-07T19:44:01.3688134Z 2025-05-07T19:44:01.3705971Z cffi-1.17.1 | 295 KB | | 0%  2025-05-07T19:44:01.3706322Z 2025-05-07T19:44:01.3706328Z 2025-05-07T19:44:01.3706334Z 2025-05-07T19:44:01.3706337Z 2025-05-07T19:44:01.3706542Z 2025-05-07T19:44:01.3706827Z pyopenssl-25.0.0 | 120 KB | | 0%  2025-05-07T19:44:01.3707125Z 2025-05-07T19:44:01.3707156Z 2025-05-07T19:44:01.3707160Z 2025-05-07T19:44:01.3707163Z 2025-05-07T19:44:01.3707167Z 2025-05-07T19:44:01.3707184Z 2025-05-07T19:44:01.3707447Z pycparser-2.22 | 108 KB | | 0%  2025-05-07T19:44:01.3707736Z 2025-05-07T19:44:01.3707740Z 2025-05-07T19:44:01.3707743Z 2025-05-07T19:44:01.3707746Z 2025-05-07T19:44:01.3707750Z 2025-05-07T19:44:01.3707777Z 2025-05-07T19:44:01.3707781Z 2025-05-07T19:44:01.3708065Z typing-extensions-4. | 88 KB | | 0%  2025-05-07T19:44:01.3708388Z 2025-05-07T19:44:01.3708391Z 2025-05-07T19:44:01.3708395Z 2025-05-07T19:44:01.3708398Z 2025-05-07T19:44:01.3708402Z 2025-05-07T19:44:01.3708405Z 2025-05-07T19:44:01.3708409Z 2025-05-07T19:44:01.3708412Z 2025-05-07T19:44:01.3708724Z typing_extensions-4. | 51 KB | | 0%  2025-05-07T19:44:01.3709039Z 2025-05-07T19:44:01.3709042Z 2025-05-07T19:44:01.3709046Z 2025-05-07T19:44:01.3709049Z 2025-05-07T19:44:01.3709053Z 2025-05-07T19:44:01.3709056Z 2025-05-07T19:44:01.3709060Z 2025-05-07T19:44:01.3709063Z 2025-05-07T19:44:01.3709097Z 2025-05-07T19:44:01.3709353Z libgcc-ng-15.1.0 | 34 KB | | 0%  2025-05-07T19:44:01.3709644Z 2025-05-07T19:44:01.3709648Z 2025-05-07T19:44:01.3709651Z 2025-05-07T19:44:01.3709655Z 2025-05-07T19:44:01.3709659Z 2025-05-07T19:44:01.3709662Z 2025-05-07T19:44:01.3709666Z 2025-05-07T19:44:01.3709669Z 2025-05-07T19:44:01.3709673Z 2025-05-07T19:44:01.3709700Z 2025-05-07T19:44:01.4343370Z python_abi-3.11 | 5 KB | | 0%  2025-05-07T19:44:01.4344288Z 2025-05-07T19:44:01.4344302Z 2025-05-07T19:44:01.4344313Z 2025-05-07T19:44:01.4344323Z 2025-05-07T19:44:01.4454322Z cffi-1.17.1 | 295 KB | ########## | 100%  2025-05-07T19:44:01.4454830Z 2025-05-07T19:44:01.4454835Z 2025-05-07T19:44:01.4454839Z 2025-05-07T19:44:01.4658586Z libgomp-15.1.0 | 442 KB | ########## | 100%  2025-05-07T19:44:01.4666913Z openssl-3.5.0 | 3.0 MB | 1 | 1% 2025-05-07T19:44:01.4667672Z 2025-05-07T19:44:01.4667688Z 2025-05-07T19:44:01.4716951Z libgcc-15.1.0 | 810 KB | ###3 | 34%  2025-05-07T19:44:01.4717257Z 2025-05-07T19:44:01.4717262Z 2025-05-07T19:44:01.4717266Z 2025-05-07T19:44:01.4717270Z 2025-05-07T19:44:01.4718676Z cffi-1.17.1 | 295 KB | ########## | 100%  2025-05-07T19:44:01.4718945Z 2025-05-07T19:44:01.4718958Z 2025-05-07T19:44:01.4718962Z 2025-05-07T19:44:01.4718966Z 2025-05-07T19:44:01.4725227Z cffi-1.17.1 | 295 KB | ########## | 100%  2025-05-07T19:44:01.4726004Z 2025-05-07T19:44:01.4726149Z 2025-05-07T19:44:01.4726162Z 2025-05-07T19:44:01.4726209Z 2025-05-07T19:44:01.4726257Z 2025-05-07T19:44:01.4748958Z pyopenssl-25.0.0 | 120 KB | #3 | 13%  2025-05-07T19:44:01.4750079Z 2025-05-07T19:44:01.4750296Z 2025-05-07T19:44:01.4750308Z 2025-05-07T19:44:01.4750318Z 2025-05-07T19:44:01.4750329Z 2025-05-07T19:44:01.4787437Z pyopenssl-25.0.0 | 120 KB | ########## | 100%  2025-05-07T19:44:01.4787815Z 2025-05-07T19:44:01.4788007Z 2025-05-07T19:44:01.4882847Z libgcc-15.1.0 | 810 KB | ########## | 100%  2025-05-07T19:44:01.4883154Z 2025-05-07T19:44:01.4913936Z cryptography-44.0.3 | 1.5 MB | ###8 | 39%  2025-05-07T19:44:01.4914398Z 2025-05-07T19:44:01.4914403Z 2025-05-07T19:44:01.4914407Z 2025-05-07T19:44:01.4915679Z libgomp-15.1.0 | 442 KB | ########## | 100%  2025-05-07T19:44:01.4915996Z 2025-05-07T19:44:01.4916000Z 2025-05-07T19:44:01.4916004Z 2025-05-07T19:44:01.4986413Z libgomp-15.1.0 | 442 KB | ########## | 100%  2025-05-07T19:44:01.4986732Z 2025-05-07T19:44:01.4986738Z 2025-05-07T19:44:01.4987002Z 2025-05-07T19:44:01.4987006Z 2025-05-07T19:44:01.4987042Z 2025-05-07T19:44:01.4987352Z 2025-05-07T19:44:01.5034572Z pycparser-2.22 | 108 KB | #4 | 15%  2025-05-07T19:44:01.5035506Z 2025-05-07T19:44:01.5035520Z 2025-05-07T19:44:01.5035569Z 2025-05-07T19:44:01.5035609Z 2025-05-07T19:44:01.5035619Z 2025-05-07T19:44:01.5035630Z 2025-05-07T19:44:01.5131206Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:01.5131579Z 2025-05-07T19:44:01.5131583Z 2025-05-07T19:44:01.5131587Z 2025-05-07T19:44:01.5131591Z 2025-05-07T19:44:01.5131595Z 2025-05-07T19:44:01.5131627Z 2025-05-07T19:44:01.5131631Z 2025-05-07T19:44:01.5180278Z typing-extensions-4. | 88 KB | #8 | 18%  2025-05-07T19:44:01.5180646Z 2025-05-07T19:44:01.5180652Z 2025-05-07T19:44:01.5180656Z 2025-05-07T19:44:01.5180660Z 2025-05-07T19:44:01.5180665Z 2025-05-07T19:44:01.5180671Z 2025-05-07T19:44:01.5180699Z 2025-05-07T19:44:01.5217073Z typing-extensions-4. | 88 KB | ########## | 100%  2025-05-07T19:44:01.5217438Z 2025-05-07T19:44:01.5217443Z 2025-05-07T19:44:01.5217447Z 2025-05-07T19:44:01.5217451Z 2025-05-07T19:44:01.5217454Z 2025-05-07T19:44:01.5217476Z 2025-05-07T19:44:01.5217480Z 2025-05-07T19:44:01.5217508Z 2025-05-07T19:44:01.5221469Z typing_extensions-4. | 51 KB | ###1 | 31%  2025-05-07T19:44:01.5221790Z 2025-05-07T19:44:01.5221801Z 2025-05-07T19:44:01.5221804Z 2025-05-07T19:44:01.5221808Z 2025-05-07T19:44:01.5221812Z 2025-05-07T19:44:01.5245337Z pyopenssl-25.0.0 | 120 KB | ########## | 100%  2025-05-07T19:44:01.5245670Z 2025-05-07T19:44:01.5245675Z 2025-05-07T19:44:01.5245679Z 2025-05-07T19:44:01.5245694Z 2025-05-07T19:44:01.5245698Z 2025-05-07T19:44:01.5245702Z 2025-05-07T19:44:01.5245707Z 2025-05-07T19:44:01.5245711Z 2025-05-07T19:44:01.5418445Z typing_extensions-4. | 51 KB | ########## | 100%  2025-05-07T19:44:01.5418814Z 2025-05-07T19:44:01.5418818Z 2025-05-07T19:44:01.5418822Z 2025-05-07T19:44:01.5418826Z 2025-05-07T19:44:01.5418829Z 2025-05-07T19:44:01.5418833Z 2025-05-07T19:44:01.5418836Z 2025-05-07T19:44:01.5474700Z typing-extensions-4. | 88 KB | ########## | 100%  2025-05-07T19:44:01.5475078Z 2025-05-07T19:44:01.5573292Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:01.5573600Z 2025-05-07T19:44:01.5573756Z 2025-05-07T19:44:01.5573767Z 2025-05-07T19:44:01.5573771Z 2025-05-07T19:44:01.5573776Z 2025-05-07T19:44:01.5573814Z 2025-05-07T19:44:01.5573824Z 2025-05-07T19:44:01.5573830Z 2025-05-07T19:44:01.5573833Z 2025-05-07T19:44:01.5573843Z 2025-05-07T19:44:01.5580071Z python_abi-3.11 | 5 KB | ########## | 100%  2025-05-07T19:44:01.5580393Z 2025-05-07T19:44:01.5580398Z 2025-05-07T19:44:01.5580401Z 2025-05-07T19:44:01.5580405Z 2025-05-07T19:44:01.5580409Z 2025-05-07T19:44:01.5580430Z 2025-05-07T19:44:01.5580434Z 2025-05-07T19:44:01.5580437Z 2025-05-07T19:44:01.5580441Z 2025-05-07T19:44:01.5580620Z 2025-05-07T19:44:01.5590250Z python_abi-3.11 | 5 KB | ########## | 100%  2025-05-07T19:44:01.5590597Z 2025-05-07T19:44:01.5590602Z 2025-05-07T19:44:01.5590606Z 2025-05-07T19:44:01.5590611Z 2025-05-07T19:44:01.5590615Z 2025-05-07T19:44:01.5590619Z 2025-05-07T19:44:01.5590624Z 2025-05-07T19:44:01.5590693Z 2025-05-07T19:44:01.5610723Z typing_extensions-4. | 51 KB | ########## | 100%  2025-05-07T19:44:01.5797867Z openssl-3.5.0 | 3.0 MB | ########## | 100% 2025-05-07T19:44:01.5798186Z 2025-05-07T19:44:01.5798191Z 2025-05-07T19:44:01.5798196Z 2025-05-07T19:44:01.5798200Z 2025-05-07T19:44:01.5798204Z 2025-05-07T19:44:01.5798207Z 2025-05-07T19:44:01.5798211Z 2025-05-07T19:44:01.5798214Z 2025-05-07T19:44:01.5798218Z 2025-05-07T19:44:01.5817520Z libgcc-ng-15.1.0 | 34 KB | ####7 | 47%  2025-05-07T19:44:01.5818084Z 2025-05-07T19:44:01.5818089Z 2025-05-07T19:44:01.5818092Z 2025-05-07T19:44:01.5818096Z 2025-05-07T19:44:01.5818099Z 2025-05-07T19:44:01.5818103Z 2025-05-07T19:44:01.5818106Z 2025-05-07T19:44:01.5818110Z 2025-05-07T19:44:01.5818121Z 2025-05-07T19:44:01.5822213Z libgcc-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:01.5822505Z 2025-05-07T19:44:01.5822514Z 2025-05-07T19:44:01.5822739Z libgcc-15.1.0 | 810 KB | ########## | 100%  2025-05-07T19:44:01.5823017Z 2025-05-07T19:44:01.5823024Z 2025-05-07T19:44:01.5943688Z libgcc-15.1.0 | 810 KB | ########## | 100%  2025-05-07T19:44:01.5944007Z 2025-05-07T19:44:01.5944135Z 2025-05-07T19:44:01.5944143Z 2025-05-07T19:44:01.5944148Z 2025-05-07T19:44:01.5944191Z 2025-05-07T19:44:01.5944196Z 2025-05-07T19:44:01.5944200Z 2025-05-07T19:44:01.5944205Z 2025-05-07T19:44:01.5944210Z 2025-05-07T19:44:01.5944214Z 2025-05-07T19:44:01.6319933Z python_abi-3.11 | 5 KB | ########## | 100%  2025-05-07T19:44:01.6320256Z 2025-05-07T19:44:01.6320275Z 2025-05-07T19:44:01.6320279Z 2025-05-07T19:44:01.6320283Z 2025-05-07T19:44:01.6320286Z 2025-05-07T19:44:01.6320306Z 2025-05-07T19:44:01.6321426Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:01.6321718Z 2025-05-07T19:44:01.6321730Z 2025-05-07T19:44:01.6321734Z 2025-05-07T19:44:01.6321738Z 2025-05-07T19:44:01.6321741Z 2025-05-07T19:44:01.6321745Z 2025-05-07T19:44:01.6630462Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:01.6630811Z 2025-05-07T19:44:01.6630815Z 2025-05-07T19:44:01.6630820Z 2025-05-07T19:44:01.6630825Z 2025-05-07T19:44:01.6630862Z 2025-05-07T19:44:01.6630866Z 2025-05-07T19:44:01.6630978Z 2025-05-07T19:44:01.6630988Z 2025-05-07T19:44:01.6630993Z 2025-05-07T19:44:01.7512145Z libgcc-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:01.7512675Z openssl-3.5.0 | 3.0 MB | ########## | 100% 2025-05-07T19:44:01.7565094Z openssl-3.5.0 | 3.0 MB | ########## | 100% 2025-05-07T19:44:01.7565590Z 2025-05-07T19:44:01.7566172Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:01.7566456Z 2025-05-07T19:44:01.7572183Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:01.7573338Z 2025-05-07T19:44:01.7573953Z 2025-05-07T19:44:01.7574432Z  2025-05-07T19:44:01.7575071Z 2025-05-07T19:44:01.7575084Z 2025-05-07T19:44:01.7575598Z  2025-05-07T19:44:01.7575819Z 2025-05-07T19:44:01.7575823Z 2025-05-07T19:44:01.7575826Z 2025-05-07T19:44:01.7576025Z  2025-05-07T19:44:01.7576248Z 2025-05-07T19:44:01.7576251Z 2025-05-07T19:44:01.7576263Z 2025-05-07T19:44:01.7576267Z 2025-05-07T19:44:01.7576452Z  2025-05-07T19:44:01.7576700Z 2025-05-07T19:44:01.7576703Z 2025-05-07T19:44:01.7576707Z 2025-05-07T19:44:01.7576711Z 2025-05-07T19:44:01.7576724Z 2025-05-07T19:44:01.7576913Z  2025-05-07T19:44:01.7577148Z 2025-05-07T19:44:01.7577151Z 2025-05-07T19:44:01.7577155Z 2025-05-07T19:44:01.7577184Z 2025-05-07T19:44:01.7577187Z 2025-05-07T19:44:01.7577190Z 2025-05-07T19:44:01.7577384Z  2025-05-07T19:44:01.7577618Z 2025-05-07T19:44:01.7577621Z 2025-05-07T19:44:01.7577625Z 2025-05-07T19:44:01.7577628Z 2025-05-07T19:44:01.7577632Z 2025-05-07T19:44:01.7577635Z 2025-05-07T19:44:01.7577639Z 2025-05-07T19:44:01.7577859Z  2025-05-07T19:44:01.7578094Z 2025-05-07T19:44:01.7578268Z 2025-05-07T19:44:01.7578272Z 2025-05-07T19:44:01.7578276Z 2025-05-07T19:44:01.7578279Z 2025-05-07T19:44:01.7578283Z 2025-05-07T19:44:01.7578287Z 2025-05-07T19:44:01.7578290Z 2025-05-07T19:44:01.7578521Z  2025-05-07T19:44:01.7578757Z 2025-05-07T19:44:01.7578760Z 2025-05-07T19:44:01.7578764Z 2025-05-07T19:44:01.7578767Z 2025-05-07T19:44:01.7578771Z 2025-05-07T19:44:01.7578774Z 2025-05-07T19:44:01.7578777Z 2025-05-07T19:44:01.7578781Z 2025-05-07T19:44:01.7578784Z 2025-05-07T19:44:01.7579002Z  2025-05-07T19:44:01.7579237Z 2025-05-07T19:44:01.7579241Z 2025-05-07T19:44:01.7579245Z 2025-05-07T19:44:01.7579248Z 2025-05-07T19:44:01.7579252Z 2025-05-07T19:44:01.7579255Z 2025-05-07T19:44:01.7579258Z 2025-05-07T19:44:01.7579262Z 2025-05-07T19:44:01.7579265Z 2025-05-07T19:44:01.7579268Z 2025-05-07T19:44:01.7579506Z  done 2025-05-07T19:44:01.8584949Z Preparing transaction: - done 2025-05-07T19:44:01.9594258Z Verifying transaction: | done 2025-05-07T19:44:03.3621566Z Executing transaction: - \ | / - \ | / - \ | / - \ done 2025-05-07T19:44:03.4619690Z [SETUP] Testing pyOpenSSL import ... 2025-05-07T19:44:05.1633538Z [CHECK] Python (sub-)package 'OpenSSL' found ... 2025-05-07T19:44:05.1642635Z [SETUP] Installing libxcrypt ... 2025-05-07T19:44:05.1669349Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y libxcrypt 2025-05-07T19:44:05.8301765Z Channels: 2025-05-07T19:44:05.8302470Z - conda-forge 2025-05-07T19:44:05.8303123Z Platform: linux-64 2025-05-07T19:44:08.9059198Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:09.3330901Z Solving environment: \ done 2025-05-07T19:44:09.3826822Z 2025-05-07T19:44:09.3827437Z ## Package Plan ## 2025-05-07T19:44:09.3827958Z 2025-05-07T19:44:09.3829081Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:09.3830101Z 2025-05-07T19:44:09.3830400Z added / updated specs: 2025-05-07T19:44:09.3831174Z - libxcrypt 2025-05-07T19:44:09.3831569Z 2025-05-07T19:44:09.3831995Z 2025-05-07T19:44:09.3832370Z The following packages will be downloaded: 2025-05-07T19:44:09.3833116Z 2025-05-07T19:44:09.3833252Z package | build 2025-05-07T19:44:09.3833611Z ---------------------------|----------------- 2025-05-07T19:44:09.3834162Z libxcrypt-4.4.36 | hd590300_1 98 KB conda-forge 2025-05-07T19:44:09.3834613Z ------------------------------------------------------------ 2025-05-07T19:44:09.3835004Z Total: 98 KB 2025-05-07T19:44:09.3835293Z 2025-05-07T19:44:09.3835453Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:09.3835692Z 2025-05-07T19:44:09.3835941Z libxcrypt conda-forge/linux-64::libxcrypt-4.4.36-hd590300_1 2025-05-07T19:44:09.3836272Z 2025-05-07T19:44:09.3836276Z 2025-05-07T19:44:09.3836280Z 2025-05-07T19:44:09.3836431Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:09.5235042Z libxcrypt-4.4.36 | 98 KB | | 0% 2025-05-07T19:44:09.5268089Z libxcrypt-4.4.36 | 98 KB | #6 | 16% 2025-05-07T19:44:09.5365669Z libxcrypt-4.4.36 | 98 KB | ########## | 100% 2025-05-07T19:44:09.5366293Z libxcrypt-4.4.36 | 98 KB | ########## | 100% 2025-05-07T19:44:09.5366666Z 2025-05-07T19:44:09.5366963Z done 2025-05-07T19:44:09.6375257Z Preparing transaction: / done 2025-05-07T19:44:09.7382942Z Verifying transaction: \ done 2025-05-07T19:44:09.8393285Z Executing transaction: / done 2025-05-07T19:44:13.1483979Z [SETUP] Copying over ... 2025-05-07T19:44:13.1486141Z + cp /github/home/miniconda/envs/build_binary/include/crypt.h /github/home/miniconda/envs/build_binary/include/python3.11/crypt.h 2025-05-07T19:44:13.1487398Z 2025-05-07T19:44:13.1516749Z 2025-05-07T19:44:14.7552549Z [SETUP] Installed Python version: Python 3.11.11 2025-05-07T19:44:14.7553839Z [SETUP] Successfully created Conda environment: build_binary 2025-05-07T19:44:14.7629424Z ##[group]Run . $PRELUDE; install_cxx_compiler $BUILD_ENV clang 2025-05-07T19:44:14.7629965Z . $PRELUDE; install_cxx_compiler $BUILD_ENV clang 2025-05-07T19:44:14.7630622Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:44:14.7630960Z env: 2025-05-07T19:44:14.7631183Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:44:14.7631502Z BUILD_ENV: build_binary 2025-05-07T19:44:14.7631747Z BUILD_TARGET: default 2025-05-07T19:44:14.7631991Z BUILD_VARIANT: cuda 2025-05-07T19:44:14.7632229Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:44:14.7632490Z ##[endgroup] 2025-05-07T19:44:15.1779636Z ################################################################################ 2025-05-07T19:44:15.1780126Z # Install C/C++ Compilers 2025-05-07T19:44:15.1780406Z # 2025-05-07T19:44:15.1793079Z # [2025-05-07T19:44:15.178Z] + install_cxx_compiler build_binary clang 2025-05-07T19:44:15.1793716Z ################################################################################ 2025-05-07T19:44:15.1794116Z 2025-05-07T19:44:15.1817501Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:44:15.2694337Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:44:15.2703656Z [INSTALL] Installing GLIBC (architecture = 64) ... 2025-05-07T19:44:15.2728950Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y sysroot_linux-64=2.17 2025-05-07T19:44:15.9379268Z Channels: 2025-05-07T19:44:15.9380086Z - conda-forge 2025-05-07T19:44:15.9380375Z Platform: linux-64 2025-05-07T19:44:19.0070685Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:19.4339990Z Solving environment: \ done 2025-05-07T19:44:19.4832810Z 2025-05-07T19:44:19.4833422Z ## Package Plan ## 2025-05-07T19:44:19.4833918Z 2025-05-07T19:44:19.4834726Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:19.4835642Z 2025-05-07T19:44:19.4835949Z added / updated specs: 2025-05-07T19:44:19.4836901Z - sysroot_linux-64=2.17 2025-05-07T19:44:19.4837082Z 2025-05-07T19:44:19.4837087Z 2025-05-07T19:44:19.4837246Z The following packages will be downloaded: 2025-05-07T19:44:19.4837478Z 2025-05-07T19:44:19.4837603Z package | build 2025-05-07T19:44:19.4837980Z ---------------------------|----------------- 2025-05-07T19:44:19.4838431Z kernel-headers_linux-64-3.10.0| he073ed8_18 921 KB conda-forge 2025-05-07T19:44:19.4838982Z sysroot_linux-64-2.17 | h0157908_18 14.5 MB conda-forge 2025-05-07T19:44:19.4839468Z ------------------------------------------------------------ 2025-05-07T19:44:19.4839841Z Total: 15.4 MB 2025-05-07T19:44:19.4840131Z 2025-05-07T19:44:19.4840274Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:19.4840515Z 2025-05-07T19:44:19.4840956Z kernel-headers_li~ conda-forge/noarch::kernel-headers_linux-64-3.10.0-he073ed8_18 2025-05-07T19:44:19.4841709Z sysroot_linux-64 conda-forge/noarch::sysroot_linux-64-2.17-h0157908_18 2025-05-07T19:44:19.4842023Z 2025-05-07T19:44:19.4842027Z 2025-05-07T19:44:19.4842030Z 2025-05-07T19:44:19.4842201Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:19.4842581Z sysroot_linux-64-2.1 | 14.5 MB | | 0% 2025-05-07T19:44:19.4842846Z 2025-05-07T19:44:19.7031404Z kernel-headers_linux | 921 KB | | 0%  2025-05-07T19:44:19.7217221Z sysroot_linux-64-2.1 | 14.5 MB | | 0% 2025-05-07T19:44:19.7217748Z 2025-05-07T19:44:19.7342527Z kernel-headers_linux | 921 KB | 1 | 2%  2025-05-07T19:44:19.7342829Z 2025-05-07T19:44:19.8072920Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:19.9268188Z sysroot_linux-64-2.1 | 14.5 MB | ######7 | 68% 2025-05-07T19:44:19.9268516Z 2025-05-07T19:44:19.9268852Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:19.9269203Z 2025-05-07T19:44:19.9302696Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:20.3680498Z sysroot_linux-64-2.1 | 14.5 MB | ########## | 100% 2025-05-07T19:44:20.3682152Z sysroot_linux-64-2.1 | 14.5 MB | ########## | 100% 2025-05-07T19:44:20.3682555Z 2025-05-07T19:44:20.3682778Z 2025-05-07T19:44:20.3685827Z  done 2025-05-07T19:44:20.4694442Z Preparing transaction: / done 2025-05-07T19:44:20.6706023Z Verifying transaction: \ | done 2025-05-07T19:44:20.7716592Z Executing transaction: - done 2025-05-07T19:44:20.8591944Z [CHECK] LD_LIBRARY_PATH = 2025-05-07T19:44:20.8593598Z [CHECK] CONDA_PREFIX is not set. 2025-05-07T19:44:22.4869232Z [CHECK] libstdc++.so.6 found in CONDA_PREFIX PATH (symbolic link): /github/home/miniconda/envs/build_binary/lib/libstdc++.so.6 2025-05-07T19:44:22.4881630Z [INSTALL] Installing GCC (11.4.0, 64) through Conda ... 2025-05-07T19:44:22.4906397Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y gxx_linux-64=11.4.0 2025-05-07T19:44:23.1778246Z Channels: 2025-05-07T19:44:23.1778680Z - conda-forge 2025-05-07T19:44:23.1778993Z Platform: linux-64 2025-05-07T19:44:26.2917509Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:27.4358688Z Solving environment: \ | / done 2025-05-07T19:44:27.4892507Z 2025-05-07T19:44:27.4893106Z ## Package Plan ## 2025-05-07T19:44:27.4893566Z 2025-05-07T19:44:27.4894160Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:27.4895112Z 2025-05-07T19:44:27.4895477Z added / updated specs: 2025-05-07T19:44:27.4896258Z - gxx_linux-64=11.4.0 2025-05-07T19:44:27.4896764Z 2025-05-07T19:44:27.4896776Z 2025-05-07T19:44:27.4897128Z The following packages will be downloaded: 2025-05-07T19:44:27.4897783Z 2025-05-07T19:44:27.4898151Z package | build 2025-05-07T19:44:27.4899102Z ---------------------------|----------------- 2025-05-07T19:44:27.4899545Z binutils_impl_linux-64-2.40| ha1999f0_7 6.0 MB conda-forge 2025-05-07T19:44:27.4900035Z binutils_linux-64-2.40 | hb3c18ed_4 28 KB conda-forge 2025-05-07T19:44:27.4900527Z gcc_impl_linux-64-11.4.0 | h00c12a0_13 53.0 MB conda-forge 2025-05-07T19:44:27.4900995Z gcc_linux-64-11.4.0 | ha077dfb_4 31 KB conda-forge 2025-05-07T19:44:27.4901437Z gxx_impl_linux-64-11.4.0 | h634f3ee_13 11.2 MB conda-forge 2025-05-07T19:44:27.4901905Z gxx_linux-64-11.4.0 | h35bfe5d_4 29 KB conda-forge 2025-05-07T19:44:27.4902340Z ld_impl_linux-64-2.40 | hf3520f5_7 691 KB conda-forge 2025-05-07T19:44:27.4902845Z libgcc-devel_linux-64-11.4.0| h8f596e0_113 2.3 MB conda-forge 2025-05-07T19:44:27.4903349Z libsanitizer-11.4.0 | h5763a12_13 3.5 MB conda-forge 2025-05-07T19:44:27.4903806Z libstdcxx-15.1.0 | h8f9b012_2 3.7 MB conda-forge 2025-05-07T19:44:27.4904310Z libstdcxx-devel_linux-64-11.4.0| h8f596e0_113 11.1 MB conda-forge 2025-05-07T19:44:27.4905166Z libstdcxx-ng-15.1.0 | h4852527_2 34 KB conda-forge 2025-05-07T19:44:27.4905639Z ------------------------------------------------------------ 2025-05-07T19:44:27.4906013Z Total: 91.6 MB 2025-05-07T19:44:27.4906317Z 2025-05-07T19:44:27.4906467Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:27.4906710Z 2025-05-07T19:44:27.4907050Z binutils_impl_lin~ conda-forge/linux-64::binutils_impl_linux-64-2.40-ha1999f0_7 2025-05-07T19:44:27.4907956Z binutils_linux-64 conda-forge/linux-64::binutils_linux-64-2.40-hb3c18ed_4 2025-05-07T19:44:27.4908575Z gcc_impl_linux-64 conda-forge/linux-64::gcc_impl_linux-64-11.4.0-h00c12a0_13 2025-05-07T19:44:27.4909288Z gcc_linux-64 conda-forge/linux-64::gcc_linux-64-11.4.0-ha077dfb_4 2025-05-07T19:44:27.4909873Z gxx_impl_linux-64 conda-forge/linux-64::gxx_impl_linux-64-11.4.0-h634f3ee_13 2025-05-07T19:44:27.4910450Z gxx_linux-64 conda-forge/linux-64::gxx_linux-64-11.4.0-h35bfe5d_4 2025-05-07T19:44:27.4911025Z libgcc-devel_linu~ conda-forge/noarch::libgcc-devel_linux-64-11.4.0-h8f596e0_113 2025-05-07T19:44:27.4911769Z libsanitizer conda-forge/linux-64::libsanitizer-11.4.0-h5763a12_13 2025-05-07T19:44:27.4912387Z libstdcxx conda-forge/linux-64::libstdcxx-15.1.0-h8f9b012_2 2025-05-07T19:44:27.4912961Z libstdcxx-devel_l~ conda-forge/noarch::libstdcxx-devel_linux-64-11.4.0-h8f596e0_113 2025-05-07T19:44:27.4913327Z 2025-05-07T19:44:27.4913473Z The following packages will be UPDATED: 2025-05-07T19:44:27.4913679Z 2025-05-07T19:44:27.4914123Z ld_impl_linux-64 pkgs/main::ld_impl_linux-64-2.40-h12e~ --> conda-forge::ld_impl_linux-64-2.40-hf3520f5_7 2025-05-07T19:44:27.4915104Z libstdcxx-ng pkgs/main::libstdcxx-ng-11.2.0-h12345~ --> conda-forge::libstdcxx-ng-15.1.0-h4852527_2 2025-05-07T19:44:27.4915548Z 2025-05-07T19:44:27.4915552Z 2025-05-07T19:44:27.4915556Z 2025-05-07T19:44:27.4915734Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:27.4916134Z gcc_impl_linux-64-11 | 53.0 MB | | 0% 2025-05-07T19:44:27.4916413Z 2025-05-07T19:44:27.4916754Z gxx_impl_linux-64-11 | 11.2 MB | | 0%  2025-05-07T19:44:27.4917015Z 2025-05-07T19:44:27.4917019Z 2025-05-07T19:44:27.4918568Z libstdcxx-devel_linu | 11.1 MB | | 0%  2025-05-07T19:44:27.4918848Z 2025-05-07T19:44:27.4918852Z 2025-05-07T19:44:27.4921744Z 2025-05-07T19:44:27.4952569Z binutils_impl_linux- | 6.0 MB | | 0%  2025-05-07T19:44:27.4952957Z 2025-05-07T19:44:27.4952962Z 2025-05-07T19:44:27.4952966Z 2025-05-07T19:44:27.4952970Z 2025-05-07T19:44:27.4964047Z libstdcxx-15.1.0 | 3.7 MB | | 0%  2025-05-07T19:44:27.4964941Z 2025-05-07T19:44:27.4964977Z 2025-05-07T19:44:27.4964988Z 2025-05-07T19:44:27.4964999Z 2025-05-07T19:44:27.4965010Z 2025-05-07T19:44:27.4965760Z libsanitizer-11.4.0 | 3.5 MB | | 0%  2025-05-07T19:44:27.4966627Z 2025-05-07T19:44:27.4966638Z 2025-05-07T19:44:27.4966648Z 2025-05-07T19:44:27.4966659Z 2025-05-07T19:44:27.4966703Z 2025-05-07T19:44:27.4966714Z 2025-05-07T19:44:27.4967481Z libgcc-devel_linux-6 | 2.3 MB | | 0%  2025-05-07T19:44:27.4968362Z 2025-05-07T19:44:27.4968373Z 2025-05-07T19:44:27.4968383Z 2025-05-07T19:44:27.4968393Z 2025-05-07T19:44:27.4968403Z 2025-05-07T19:44:27.4968413Z 2025-05-07T19:44:27.4968610Z 2025-05-07T19:44:27.4968876Z ld_impl_linux-64-2.4 | 691 KB | | 0%  2025-05-07T19:44:27.4969181Z 2025-05-07T19:44:27.4969185Z 2025-05-07T19:44:27.4969188Z 2025-05-07T19:44:27.4969192Z 2025-05-07T19:44:27.4969195Z 2025-05-07T19:44:27.4969199Z 2025-05-07T19:44:27.4969202Z 2025-05-07T19:44:27.4969209Z 2025-05-07T19:44:27.4969504Z libstdcxx-ng-15.1.0 | 34 KB | | 0%  2025-05-07T19:44:27.4969812Z 2025-05-07T19:44:27.4969816Z 2025-05-07T19:44:27.4969819Z 2025-05-07T19:44:27.4969823Z 2025-05-07T19:44:27.4969826Z 2025-05-07T19:44:27.4969830Z 2025-05-07T19:44:27.4969833Z 2025-05-07T19:44:27.4969837Z 2025-05-07T19:44:27.4969840Z 2025-05-07T19:44:27.4990469Z gcc_linux-64-11.4.0 | 31 KB | | 0%  2025-05-07T19:44:27.4990952Z 2025-05-07T19:44:27.4990957Z 2025-05-07T19:44:27.4990961Z 2025-05-07T19:44:27.4990964Z 2025-05-07T19:44:27.4990968Z 2025-05-07T19:44:27.4991035Z 2025-05-07T19:44:27.4991038Z 2025-05-07T19:44:27.4991305Z 2025-05-07T19:44:27.4991308Z 2025-05-07T19:44:27.4991312Z 2025-05-07T19:44:27.4991728Z gxx_linux-64-11.4.0 | 29 KB | | 0%  2025-05-07T19:44:27.4992039Z 2025-05-07T19:44:27.4992042Z 2025-05-07T19:44:27.4992046Z 2025-05-07T19:44:27.4992160Z 2025-05-07T19:44:27.4992164Z 2025-05-07T19:44:27.4992168Z 2025-05-07T19:44:27.4992172Z 2025-05-07T19:44:27.4992175Z 2025-05-07T19:44:27.4992178Z 2025-05-07T19:44:27.4992182Z 2025-05-07T19:44:27.4992214Z 2025-05-07T19:44:27.6838046Z binutils_linux-64-2. | 28 KB | | 0%  2025-05-07T19:44:27.6839066Z 2025-05-07T19:44:27.6839070Z 2025-05-07T19:44:27.6839083Z 2025-05-07T19:44:27.6851753Z binutils_impl_linux- | 6.0 MB | | 0%  2025-05-07T19:44:27.6852175Z 2025-05-07T19:44:27.6852180Z 2025-05-07T19:44:27.6852184Z 2025-05-07T19:44:27.6852188Z 2025-05-07T19:44:27.7716371Z libstdcxx-15.1.0 | 3.7 MB | | 0%  2025-05-07T19:44:27.7716739Z 2025-05-07T19:44:27.7716744Z 2025-05-07T19:44:27.7841130Z libstdcxx-devel_linu | 11.1 MB | | 0%  2025-05-07T19:44:27.7841448Z 2025-05-07T19:44:27.7847744Z gxx_impl_linux-64-11 | 11.2 MB | | 0%  2025-05-07T19:44:27.7848545Z 2025-05-07T19:44:27.7848558Z 2025-05-07T19:44:27.7848569Z 2025-05-07T19:44:27.7848580Z 2025-05-07T19:44:27.7928905Z libstdcxx-15.1.0 | 3.7 MB | #######1 | 72%  2025-05-07T19:44:27.7929257Z 2025-05-07T19:44:27.7929262Z 2025-05-07T19:44:27.7929266Z 2025-05-07T19:44:27.8154849Z binutils_impl_linux- | 6.0 MB | ###1 | 31%  2025-05-07T19:44:27.8155193Z 2025-05-07T19:44:27.8155324Z 2025-05-07T19:44:27.8155334Z 2025-05-07T19:44:27.8155339Z 2025-05-07T19:44:27.8422091Z libstdcxx-15.1.0 | 3.7 MB | ########## | 100%  2025-05-07T19:44:27.8711894Z gcc_impl_linux-64-11 | 53.0 MB | | 0% 2025-05-07T19:44:27.8712223Z 2025-05-07T19:44:27.8712328Z 2025-05-07T19:44:27.8712356Z 2025-05-07T19:44:27.8712427Z 2025-05-07T19:44:27.8712554Z 2025-05-07T19:44:27.8713006Z libsanitizer-11.4.0 | 3.5 MB | | 0%  2025-05-07T19:44:27.8713313Z 2025-05-07T19:44:27.8714205Z 2025-05-07T19:44:27.8843924Z libstdcxx-devel_linu | 11.1 MB | #######9 | 80%  2025-05-07T19:44:27.8844794Z 2025-05-07T19:44:27.9402436Z gxx_impl_linux-64-11 | 11.2 MB | #######2 | 72%  2025-05-07T19:44:27.9402745Z 2025-05-07T19:44:27.9402749Z 2025-05-07T19:44:27.9402764Z 2025-05-07T19:44:27.9403016Z binutils_impl_linux- | 6.0 MB | ########## | 100%  2025-05-07T19:44:27.9403303Z 2025-05-07T19:44:27.9403306Z 2025-05-07T19:44:27.9403310Z 2025-05-07T19:44:27.9427215Z binutils_impl_linux- | 6.0 MB | ########## | 100%  2025-05-07T19:44:27.9516894Z gcc_impl_linux-64-11 | 53.0 MB | #3 | 13% 2025-05-07T19:44:27.9517712Z 2025-05-07T19:44:27.9517727Z 2025-05-07T19:44:27.9517738Z 2025-05-07T19:44:27.9517749Z 2025-05-07T19:44:27.9517796Z 2025-05-07T19:44:27.9799052Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:27.9799419Z 2025-05-07T19:44:27.9799424Z 2025-05-07T19:44:27.9799428Z 2025-05-07T19:44:27.9799431Z 2025-05-07T19:44:27.9799435Z 2025-05-07T19:44:27.9799454Z 2025-05-07T19:44:28.0052544Z libgcc-devel_linux-6 | 2.3 MB | | 1%  2025-05-07T19:44:28.0053519Z 2025-05-07T19:44:28.0053568Z 2025-05-07T19:44:28.0053579Z 2025-05-07T19:44:28.0053590Z 2025-05-07T19:44:28.0053601Z 2025-05-07T19:44:28.0053612Z 2025-05-07T19:44:28.0053622Z 2025-05-07T19:44:28.0140857Z ld_impl_linux-64-2.4 | 691 KB | 2 | 2%  2025-05-07T19:44:28.0141413Z 2025-05-07T19:44:28.0141418Z 2025-05-07T19:44:28.0141449Z 2025-05-07T19:44:28.0141453Z 2025-05-07T19:44:28.0269851Z libstdcxx-15.1.0 | 3.7 MB | ########## | 100%  2025-05-07T19:44:28.0270323Z 2025-05-07T19:44:28.0270328Z 2025-05-07T19:44:28.0270333Z 2025-05-07T19:44:28.0270581Z 2025-05-07T19:44:28.0270585Z 2025-05-07T19:44:28.0270588Z 2025-05-07T19:44:28.0270592Z 2025-05-07T19:44:28.0322161Z ld_impl_linux-64-2.4 | 691 KB | ########## | 100%  2025-05-07T19:44:28.0322495Z 2025-05-07T19:44:28.0323311Z 2025-05-07T19:44:28.0427242Z libstdcxx-devel_linu | 11.1 MB | ########## | 100%  2025-05-07T19:44:28.0451978Z gcc_impl_linux-64-11 | 53.0 MB | ##7 | 28% 2025-05-07T19:44:28.0452373Z 2025-05-07T19:44:28.0533017Z gxx_impl_linux-64-11 | 11.2 MB | ########## | 100%  2025-05-07T19:44:28.0533296Z 2025-05-07T19:44:28.0533301Z 2025-05-07T19:44:28.0533312Z 2025-05-07T19:44:28.0533315Z 2025-05-07T19:44:28.0533319Z 2025-05-07T19:44:28.0533322Z 2025-05-07T19:44:28.0705599Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:28.0706559Z 2025-05-07T19:44:28.0706574Z 2025-05-07T19:44:28.0706586Z 2025-05-07T19:44:28.0706597Z 2025-05-07T19:44:28.0706607Z 2025-05-07T19:44:28.0706652Z 2025-05-07T19:44:28.0706662Z 2025-05-07T19:44:28.0797104Z ld_impl_linux-64-2.4 | 691 KB | ########## | 100%  2025-05-07T19:44:28.0798049Z 2025-05-07T19:44:28.0798064Z 2025-05-07T19:44:28.0798075Z 2025-05-07T19:44:28.0798085Z 2025-05-07T19:44:28.0798096Z 2025-05-07T19:44:28.0798139Z 2025-05-07T19:44:28.0798151Z 2025-05-07T19:44:28.0798161Z 2025-05-07T19:44:28.0813204Z libstdcxx-ng-15.1.0 | 34 KB | ####7 | 47%  2025-05-07T19:44:28.0813542Z 2025-05-07T19:44:28.0813546Z 2025-05-07T19:44:28.0813550Z 2025-05-07T19:44:28.0813553Z 2025-05-07T19:44:28.0813557Z 2025-05-07T19:44:28.0813560Z 2025-05-07T19:44:28.0813564Z 2025-05-07T19:44:28.0813567Z 2025-05-07T19:44:28.0904417Z libstdcxx-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:28.0905372Z 2025-05-07T19:44:28.0905387Z 2025-05-07T19:44:28.0905399Z 2025-05-07T19:44:28.0905409Z 2025-05-07T19:44:28.0905420Z 2025-05-07T19:44:28.0905459Z 2025-05-07T19:44:28.0905503Z 2025-05-07T19:44:28.0905513Z 2025-05-07T19:44:28.0905524Z 2025-05-07T19:44:28.0905535Z 2025-05-07T19:44:28.0917974Z gxx_linux-64-11.4.0 | 29 KB | #####5 | 55%  2025-05-07T19:44:28.0918283Z 2025-05-07T19:44:28.0918287Z 2025-05-07T19:44:28.0918308Z 2025-05-07T19:44:28.0918334Z 2025-05-07T19:44:28.0918338Z 2025-05-07T19:44:28.0918341Z 2025-05-07T19:44:28.0918345Z 2025-05-07T19:44:28.0918348Z 2025-05-07T19:44:28.0918351Z 2025-05-07T19:44:28.0918475Z 2025-05-07T19:44:28.0920811Z gxx_linux-64-11.4.0 | 29 KB | ########## | 100%  2025-05-07T19:44:28.0921140Z 2025-05-07T19:44:28.0921144Z 2025-05-07T19:44:28.0921150Z 2025-05-07T19:44:28.0921153Z 2025-05-07T19:44:28.0921156Z 2025-05-07T19:44:28.0921160Z 2025-05-07T19:44:28.0921164Z 2025-05-07T19:44:28.0921167Z 2025-05-07T19:44:28.0921193Z 2025-05-07T19:44:28.0921196Z 2025-05-07T19:44:28.0921208Z 2025-05-07T19:44:28.0931591Z binutils_linux-64-2. | 28 KB | #####6 | 56%  2025-05-07T19:44:28.0931935Z 2025-05-07T19:44:28.0931939Z 2025-05-07T19:44:28.0931942Z 2025-05-07T19:44:28.0931946Z 2025-05-07T19:44:28.0931949Z 2025-05-07T19:44:28.0931982Z 2025-05-07T19:44:28.0931985Z 2025-05-07T19:44:28.0931994Z 2025-05-07T19:44:28.0931997Z 2025-05-07T19:44:28.0932001Z 2025-05-07T19:44:28.0932009Z 2025-05-07T19:44:28.0944667Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:28.0944993Z 2025-05-07T19:44:28.0945253Z 2025-05-07T19:44:28.0945261Z 2025-05-07T19:44:28.0945266Z 2025-05-07T19:44:28.0945270Z 2025-05-07T19:44:28.0945274Z 2025-05-07T19:44:28.0945289Z 2025-05-07T19:44:28.0945293Z 2025-05-07T19:44:28.0945390Z 2025-05-07T19:44:28.0955572Z gcc_linux-64-11.4.0 | 31 KB | #####2 | 52%  2025-05-07T19:44:28.0955934Z 2025-05-07T19:44:28.0955939Z 2025-05-07T19:44:28.0955943Z 2025-05-07T19:44:28.0955947Z 2025-05-07T19:44:28.0956165Z 2025-05-07T19:44:28.0956169Z 2025-05-07T19:44:28.0956172Z 2025-05-07T19:44:28.0956176Z 2025-05-07T19:44:28.0956179Z 2025-05-07T19:44:28.1425364Z gcc_linux-64-11.4.0 | 31 KB | ########## | 100%  2025-05-07T19:44:28.1426317Z 2025-05-07T19:44:28.1426726Z 2025-05-07T19:44:28.1426741Z 2025-05-07T19:44:28.1426752Z 2025-05-07T19:44:28.1426762Z 2025-05-07T19:44:28.1427569Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:28.1429305Z gcc_impl_linux-64-11 | 53.0 MB | #####2 | 52% 2025-05-07T19:44:28.1430020Z 2025-05-07T19:44:28.1430031Z 2025-05-07T19:44:28.1430041Z 2025-05-07T19:44:28.1430051Z 2025-05-07T19:44:28.1430062Z 2025-05-07T19:44:28.2435499Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:28.3040678Z gcc_impl_linux-64-11 | 53.0 MB | ######8 | 69% 2025-05-07T19:44:28.3041049Z 2025-05-07T19:44:28.3041212Z 2025-05-07T19:44:28.3041224Z 2025-05-07T19:44:28.3436588Z binutils_impl_linux- | 6.0 MB | ########## | 100%  2025-05-07T19:44:28.3680258Z gcc_impl_linux-64-11 | 53.0 MB | #########4 | 94% 2025-05-07T19:44:28.3681109Z 2025-05-07T19:44:28.3681124Z 2025-05-07T19:44:28.3681136Z 2025-05-07T19:44:28.3681146Z 2025-05-07T19:44:28.3681156Z 2025-05-07T19:44:28.3681200Z 2025-05-07T19:44:28.3682186Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:28.3683110Z 2025-05-07T19:44:28.3683122Z 2025-05-07T19:44:28.3683133Z 2025-05-07T19:44:28.3683144Z 2025-05-07T19:44:28.3683154Z 2025-05-07T19:44:28.3683164Z 2025-05-07T19:44:28.3907861Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:28.3908226Z 2025-05-07T19:44:28.3908231Z 2025-05-07T19:44:28.3908234Z 2025-05-07T19:44:28.3908238Z 2025-05-07T19:44:28.3908241Z 2025-05-07T19:44:28.3908245Z 2025-05-07T19:44:28.3908248Z 2025-05-07T19:44:28.3908281Z 2025-05-07T19:44:28.3908555Z libstdcxx-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:28.3908877Z 2025-05-07T19:44:28.3908881Z 2025-05-07T19:44:28.3908884Z 2025-05-07T19:44:28.3908888Z 2025-05-07T19:44:28.3908891Z 2025-05-07T19:44:28.3908895Z 2025-05-07T19:44:28.3908898Z 2025-05-07T19:44:28.3908901Z 2025-05-07T19:44:28.4157808Z libstdcxx-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:28.4158731Z 2025-05-07T19:44:28.4164042Z gxx_impl_linux-64-11 | 11.2 MB | ########## | 100%  2025-05-07T19:44:28.4164652Z 2025-05-07T19:44:28.4164664Z 2025-05-07T19:44:28.4164674Z 2025-05-07T19:44:28.4164685Z 2025-05-07T19:44:28.4164855Z 2025-05-07T19:44:28.4164867Z 2025-05-07T19:44:28.4165008Z 2025-05-07T19:44:28.4165019Z 2025-05-07T19:44:28.4165068Z 2025-05-07T19:44:28.4165284Z 2025-05-07T19:44:28.4166631Z gxx_linux-64-11.4.0 | 29 KB | ########## | 100%  2025-05-07T19:44:28.4167499Z 2025-05-07T19:44:28.4167537Z 2025-05-07T19:44:28.4167547Z 2025-05-07T19:44:28.4167558Z 2025-05-07T19:44:28.4167590Z 2025-05-07T19:44:28.4167601Z 2025-05-07T19:44:28.4167611Z 2025-05-07T19:44:28.4167621Z 2025-05-07T19:44:28.4167631Z 2025-05-07T19:44:28.4167642Z 2025-05-07T19:44:28.4442508Z gxx_linux-64-11.4.0 | 29 KB | ########## | 100%  2025-05-07T19:44:28.4443516Z 2025-05-07T19:44:28.4443533Z 2025-05-07T19:44:28.4443544Z 2025-05-07T19:44:28.4443555Z 2025-05-07T19:44:28.4443565Z 2025-05-07T19:44:28.4443576Z 2025-05-07T19:44:28.4443586Z 2025-05-07T19:44:28.4443597Z 2025-05-07T19:44:28.4443607Z 2025-05-07T19:44:28.4443618Z 2025-05-07T19:44:28.4443665Z 2025-05-07T19:44:28.4444517Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:28.4445431Z 2025-05-07T19:44:28.4445442Z 2025-05-07T19:44:28.4445452Z 2025-05-07T19:44:28.4445463Z 2025-05-07T19:44:28.4445474Z 2025-05-07T19:44:28.4445484Z 2025-05-07T19:44:28.4445495Z 2025-05-07T19:44:28.4445505Z 2025-05-07T19:44:28.4445515Z 2025-05-07T19:44:28.4446023Z 2025-05-07T19:44:28.4446035Z 2025-05-07T19:44:28.4476913Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:28.4477288Z 2025-05-07T19:44:28.4477293Z 2025-05-07T19:44:28.4477297Z 2025-05-07T19:44:28.4477300Z 2025-05-07T19:44:28.4477606Z 2025-05-07T19:44:28.4477614Z 2025-05-07T19:44:28.4477618Z 2025-05-07T19:44:28.4477622Z 2025-05-07T19:44:28.4477626Z 2025-05-07T19:44:28.4477935Z gcc_linux-64-11.4.0 | 31 KB | ########## | 100%  2025-05-07T19:44:28.4478248Z 2025-05-07T19:44:28.4478252Z 2025-05-07T19:44:28.4478256Z 2025-05-07T19:44:28.4478260Z 2025-05-07T19:44:28.4478264Z 2025-05-07T19:44:28.4478297Z 2025-05-07T19:44:28.4478301Z 2025-05-07T19:44:28.4478304Z 2025-05-07T19:44:28.4478318Z 2025-05-07T19:44:28.5842325Z gcc_linux-64-11.4.0 | 31 KB | ########## | 100%  2025-05-07T19:44:28.5843269Z 2025-05-07T19:44:28.5843284Z 2025-05-07T19:44:28.5975017Z libstdcxx-devel_linu | 11.1 MB | ########## | 100%  2025-05-07T19:44:29.1348033Z gcc_impl_linux-64-11 | 53.0 MB | ########## | 100% 2025-05-07T19:44:29.1351077Z gcc_impl_linux-64-11 | 53.0 MB | ########## | 100% 2025-05-07T19:44:29.1351517Z 2025-05-07T19:44:29.1351742Z 2025-05-07T19:44:29.1352017Z  2025-05-07T19:44:29.1352250Z 2025-05-07T19:44:29.1352254Z 2025-05-07T19:44:29.1352475Z  2025-05-07T19:44:29.1352702Z 2025-05-07T19:44:29.1352706Z 2025-05-07T19:44:29.1352710Z 2025-05-07T19:44:29.1352900Z  2025-05-07T19:44:29.1353154Z 2025-05-07T19:44:29.1353158Z 2025-05-07T19:44:29.1353162Z 2025-05-07T19:44:29.1353165Z 2025-05-07T19:44:29.1353380Z  2025-05-07T19:44:29.1353640Z 2025-05-07T19:44:29.1353644Z 2025-05-07T19:44:29.1353658Z 2025-05-07T19:44:29.1353661Z 2025-05-07T19:44:29.1353665Z 2025-05-07T19:44:29.1353968Z  2025-05-07T19:44:29.1354214Z 2025-05-07T19:44:29.1354248Z 2025-05-07T19:44:29.1354251Z 2025-05-07T19:44:29.1354261Z 2025-05-07T19:44:29.1354265Z 2025-05-07T19:44:29.1354268Z 2025-05-07T19:44:29.1354473Z  2025-05-07T19:44:29.1354709Z 2025-05-07T19:44:29.1354713Z 2025-05-07T19:44:29.1354716Z 2025-05-07T19:44:29.1354720Z 2025-05-07T19:44:29.1354724Z 2025-05-07T19:44:29.1354757Z 2025-05-07T19:44:29.1354761Z 2025-05-07T19:44:29.1355053Z  2025-05-07T19:44:29.1355293Z 2025-05-07T19:44:29.1355297Z 2025-05-07T19:44:29.1355301Z 2025-05-07T19:44:29.1355304Z 2025-05-07T19:44:29.1355307Z 2025-05-07T19:44:29.1355311Z 2025-05-07T19:44:29.1355314Z 2025-05-07T19:44:29.1355318Z 2025-05-07T19:44:29.1355548Z  2025-05-07T19:44:29.1355793Z 2025-05-07T19:44:29.1355797Z 2025-05-07T19:44:29.1355800Z 2025-05-07T19:44:29.1355803Z 2025-05-07T19:44:29.1355807Z 2025-05-07T19:44:29.1355810Z 2025-05-07T19:44:29.1355818Z 2025-05-07T19:44:29.1355822Z 2025-05-07T19:44:29.1355826Z 2025-05-07T19:44:29.1356055Z  2025-05-07T19:44:29.1356297Z 2025-05-07T19:44:29.1356301Z 2025-05-07T19:44:29.1356304Z 2025-05-07T19:44:29.1356308Z 2025-05-07T19:44:29.1356311Z 2025-05-07T19:44:29.1356315Z 2025-05-07T19:44:29.1356318Z 2025-05-07T19:44:29.1356321Z 2025-05-07T19:44:29.1356325Z 2025-05-07T19:44:29.1356329Z 2025-05-07T19:44:29.1356566Z  2025-05-07T19:44:29.1356810Z 2025-05-07T19:44:29.1356813Z 2025-05-07T19:44:29.1356817Z 2025-05-07T19:44:29.1356820Z 2025-05-07T19:44:29.1357062Z 2025-05-07T19:44:29.1357066Z 2025-05-07T19:44:29.1357070Z 2025-05-07T19:44:29.1357073Z 2025-05-07T19:44:29.1357077Z 2025-05-07T19:44:29.1357080Z 2025-05-07T19:44:29.1357084Z 2025-05-07T19:44:29.1357440Z  done 2025-05-07T19:44:29.2363986Z Preparing transaction: \ done 2025-05-07T19:44:29.9382387Z Verifying transaction: / - \ | / - \ done 2025-05-07T19:44:30.0395211Z Executing transaction: / done 2025-05-07T19:44:30.1306424Z [INSTALL] Setting the C/C++ compiler symlinks ... 2025-05-07T19:44:33.9100459Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:44:33.9101149Z 2025-05-07T19:44:33.9113253Z 2025-05-07T19:44:33.9133351Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:44:33.9134639Z 2025-05-07T19:44:33.9154100Z 2025-05-07T19:44:33.9181551Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:44:33.9182991Z 2025-05-07T19:44:33.9193163Z 2025-05-07T19:44:33.9212003Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:44:33.9212655Z 2025-05-07T19:44:33.9231256Z 2025-05-07T19:44:33.9245306Z [INSTALL] Installing Clang (16.0.6, 64) and relevant libraries through Conda ... 2025-05-07T19:44:33.9269905Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y clangxx=16.0.6 libcxx llvm-openmp=16.0.6 compiler-rt=16.0.6 2025-05-07T19:44:34.6409537Z Channels: 2025-05-07T19:44:34.6410310Z - conda-forge 2025-05-07T19:44:34.6410639Z Platform: linux-64 2025-05-07T19:44:37.7448202Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:39.1014673Z Solving environment: \ | / done 2025-05-07T19:44:39.1572035Z 2025-05-07T19:44:39.1572358Z ## Package Plan ## 2025-05-07T19:44:39.1572540Z 2025-05-07T19:44:39.1572853Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:39.1573265Z 2025-05-07T19:44:39.1573405Z added / updated specs: 2025-05-07T19:44:39.1573673Z - clangxx=16.0.6 2025-05-07T19:44:39.1573942Z - compiler-rt=16.0.6 2025-05-07T19:44:39.1574205Z - libcxx 2025-05-07T19:44:39.1574423Z - llvm-openmp=16.0.6 2025-05-07T19:44:39.1574582Z 2025-05-07T19:44:39.1574587Z 2025-05-07T19:44:39.1574733Z The following packages will be downloaded: 2025-05-07T19:44:39.1574962Z 2025-05-07T19:44:39.1575081Z package | build 2025-05-07T19:44:39.1575438Z ---------------------------|----------------- 2025-05-07T19:44:39.1575949Z clang-16.0.6 |default_h9e3a008_14 110 KB conda-forge 2025-05-07T19:44:39.1576426Z clang-16-16.0.6 |default_hb5137d0_14 780 KB conda-forge 2025-05-07T19:44:39.1577016Z clangxx-16.0.6 |default_ha78316a_14 110 KB conda-forge 2025-05-07T19:44:39.1577450Z compiler-rt-16.0.6 | h00ab1b0_2 107 KB conda-forge 2025-05-07T19:44:39.1578111Z compiler-rt_linux-64-16.0.6| h00ab1b0_2 36.0 MB conda-forge 2025-05-07T19:44:39.1578568Z icu-73.2 | h59595ed_0 11.5 MB conda-forge 2025-05-07T19:44:39.1579067Z libclang-cpp16-16.0.6 |default_hb5137d0_14 17.3 MB conda-forge 2025-05-07T19:44:39.1579557Z libcxx-19.1.7 | h2713693_1 1000 KB conda-forge 2025-05-07T19:44:39.1580033Z libcxxabi-19.1.7 | hd85fd95_1 158 KB conda-forge 2025-05-07T19:44:39.1580514Z libiconv-1.18 | h4ce23a2_1 696 KB conda-forge 2025-05-07T19:44:39.1580971Z libllvm16-16.0.6 | hb3ce162_3 33.7 MB conda-forge 2025-05-07T19:44:39.1581447Z libxml2-2.12.7 | hc051c1a_1 688 KB conda-forge 2025-05-07T19:44:39.1582173Z libzlib-1.2.13 | h4ab18f5_6 60 KB conda-forge 2025-05-07T19:44:39.1582665Z llvm-openmp-16.0.6 | h4dfa4b3_0 39.9 MB conda-forge 2025-05-07T19:44:39.1583274Z zlib-1.2.13 | h4ab18f5_6 91 KB conda-forge 2025-05-07T19:44:39.1583701Z zstd-1.5.6 | ha6fb4c9_0 542 KB conda-forge 2025-05-07T19:44:39.1584149Z ------------------------------------------------------------ 2025-05-07T19:44:39.1584633Z Total: 142.6 MB 2025-05-07T19:44:39.1585005Z 2025-05-07T19:44:39.1585144Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:39.1585374Z 2025-05-07T19:44:39.1585640Z clang conda-forge/linux-64::clang-16.0.6-default_h9e3a008_14 2025-05-07T19:44:39.1586128Z clang-16 conda-forge/linux-64::clang-16-16.0.6-default_hb5137d0_14 2025-05-07T19:44:39.1586658Z clangxx conda-forge/linux-64::clangxx-16.0.6-default_ha78316a_14 2025-05-07T19:44:39.1587160Z compiler-rt conda-forge/linux-64::compiler-rt-16.0.6-h00ab1b0_2 2025-05-07T19:44:39.1587736Z compiler-rt_linux~ conda-forge/noarch::compiler-rt_linux-64-16.0.6-h00ab1b0_2 2025-05-07T19:44:39.1588223Z icu conda-forge/linux-64::icu-73.2-h59595ed_0 2025-05-07T19:44:39.1588749Z libclang-cpp16 conda-forge/linux-64::libclang-cpp16-16.0.6-default_hb5137d0_14 2025-05-07T19:44:39.1589300Z libcxx conda-forge/linux-64::libcxx-19.1.7-h2713693_1 2025-05-07T19:44:39.1589749Z libcxxabi conda-forge/linux-64::libcxxabi-19.1.7-hd85fd95_1 2025-05-07T19:44:39.1590242Z libiconv conda-forge/linux-64::libiconv-1.18-h4ce23a2_1 2025-05-07T19:44:39.1590891Z libllvm16 conda-forge/linux-64::libllvm16-16.0.6-hb3ce162_3 2025-05-07T19:44:39.1591388Z libxml2 conda-forge/linux-64::libxml2-2.12.7-hc051c1a_1 2025-05-07T19:44:39.1591889Z libzlib conda-forge/linux-64::libzlib-1.2.13-h4ab18f5_6 2025-05-07T19:44:39.1592384Z llvm-openmp conda-forge/linux-64::llvm-openmp-16.0.6-h4dfa4b3_0 2025-05-07T19:44:39.1596048Z zstd conda-forge/linux-64::zstd-1.5.6-ha6fb4c9_0 2025-05-07T19:44:39.1596328Z 2025-05-07T19:44:39.1596465Z The following packages will be UPDATED: 2025-05-07T19:44:39.1596712Z 2025-05-07T19:44:39.1596980Z zlib pkgs/main::zlib-1.2.13-h5eee18b_1 --> conda-forge::zlib-1.2.13-h4ab18f5_6 2025-05-07T19:44:39.1597345Z 2025-05-07T19:44:39.1597350Z 2025-05-07T19:44:39.1597380Z 2025-05-07T19:44:39.1597547Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:39.1597965Z llvm-openmp-16.0.6 | 39.9 MB | | 0% 2025-05-07T19:44:39.1598260Z 2025-05-07T19:44:39.1598617Z compiler-rt_linux-64 | 36.0 MB | | 0%  2025-05-07T19:44:39.1598885Z 2025-05-07T19:44:39.1598889Z 2025-05-07T19:44:39.1599152Z libllvm16-16.0.6 | 33.7 MB | | 0%  2025-05-07T19:44:39.1599429Z 2025-05-07T19:44:39.1599433Z 2025-05-07T19:44:39.1599437Z 2025-05-07T19:44:39.1599686Z libclang-cpp16-16.0. | 17.3 MB | | 0%  2025-05-07T19:44:39.1600004Z 2025-05-07T19:44:39.1600013Z 2025-05-07T19:44:39.1600017Z 2025-05-07T19:44:39.1600020Z 2025-05-07T19:44:39.1618406Z icu-73.2 | 11.5 MB | | 0%  2025-05-07T19:44:39.1619267Z 2025-05-07T19:44:39.1619282Z 2025-05-07T19:44:39.1619293Z 2025-05-07T19:44:39.1619304Z 2025-05-07T19:44:39.1619314Z 2025-05-07T19:44:39.1620292Z libcxx-19.1.7 | 1000 KB | | 0%  2025-05-07T19:44:39.1620575Z 2025-05-07T19:44:39.1620579Z 2025-05-07T19:44:39.1620583Z 2025-05-07T19:44:39.1620587Z 2025-05-07T19:44:39.1620591Z 2025-05-07T19:44:39.1622860Z 2025-05-07T19:44:39.1624580Z clang-16-16.0.6 | 780 KB | | 0%  2025-05-07T19:44:39.1624907Z 2025-05-07T19:44:39.1625180Z 2025-05-07T19:44:39.1625186Z 2025-05-07T19:44:39.1625230Z 2025-05-07T19:44:39.1625235Z 2025-05-07T19:44:39.1625239Z 2025-05-07T19:44:39.1625318Z 2025-05-07T19:44:39.1626184Z libiconv-1.18 | 696 KB | | 0%  2025-05-07T19:44:39.1626598Z 2025-05-07T19:44:39.1626604Z 2025-05-07T19:44:39.1626608Z 2025-05-07T19:44:39.1626628Z 2025-05-07T19:44:39.1626632Z 2025-05-07T19:44:39.1626663Z 2025-05-07T19:44:39.1626667Z 2025-05-07T19:44:39.1626671Z 2025-05-07T19:44:39.1629684Z libxml2-2.12.7 | 688 KB | | 0%  2025-05-07T19:44:39.1629991Z 2025-05-07T19:44:39.1629995Z 2025-05-07T19:44:39.1629999Z 2025-05-07T19:44:39.1630003Z 2025-05-07T19:44:39.1630007Z 2025-05-07T19:44:39.1630040Z 2025-05-07T19:44:39.1630044Z 2025-05-07T19:44:39.1630047Z 2025-05-07T19:44:39.1630051Z 2025-05-07T19:44:39.1630590Z zstd-1.5.6 | 542 KB | | 0%  2025-05-07T19:44:39.1630912Z 2025-05-07T19:44:39.1630931Z 2025-05-07T19:44:39.1630935Z 2025-05-07T19:44:39.1630940Z 2025-05-07T19:44:39.1630974Z 2025-05-07T19:44:39.1630979Z 2025-05-07T19:44:39.1630982Z 2025-05-07T19:44:39.1630987Z 2025-05-07T19:44:39.1630990Z 2025-05-07T19:44:39.1630994Z 2025-05-07T19:44:39.1634505Z libcxxabi-19.1.7 | 158 KB | | 0%  2025-05-07T19:44:39.1634819Z 2025-05-07T19:44:39.1634865Z 2025-05-07T19:44:39.1634869Z 2025-05-07T19:44:39.1634873Z 2025-05-07T19:44:39.1634877Z 2025-05-07T19:44:39.1634880Z 2025-05-07T19:44:39.1634884Z 2025-05-07T19:44:39.1634889Z 2025-05-07T19:44:39.1634893Z 2025-05-07T19:44:39.1634898Z 2025-05-07T19:44:39.1634902Z 2025-05-07T19:44:39.1635794Z clang-16.0.6 | 110 KB | | 0%  2025-05-07T19:44:39.1636129Z 2025-05-07T19:44:39.1636134Z 2025-05-07T19:44:39.1636159Z 2025-05-07T19:44:39.1636163Z 2025-05-07T19:44:39.1636167Z 2025-05-07T19:44:39.1636171Z 2025-05-07T19:44:39.1636199Z 2025-05-07T19:44:39.1636204Z 2025-05-07T19:44:39.1636235Z 2025-05-07T19:44:39.1636239Z 2025-05-07T19:44:39.1636243Z 2025-05-07T19:44:39.1636248Z 2025-05-07T19:44:39.1636712Z clangxx-16.0.6 | 110 KB | | 0%  2025-05-07T19:44:39.1637037Z 2025-05-07T19:44:39.1637042Z 2025-05-07T19:44:39.1637046Z 2025-05-07T19:44:39.1637050Z 2025-05-07T19:44:39.1637079Z 2025-05-07T19:44:39.1637096Z 2025-05-07T19:44:39.1637100Z 2025-05-07T19:44:39.1637104Z 2025-05-07T19:44:39.1637108Z 2025-05-07T19:44:39.1637112Z 2025-05-07T19:44:39.1637116Z 2025-05-07T19:44:39.1637120Z 2025-05-07T19:44:39.1637124Z 2025-05-07T19:44:39.1637697Z compiler-rt-16.0.6 | 107 KB | | 0%  2025-05-07T19:44:39.1638050Z 2025-05-07T19:44:39.1638066Z 2025-05-07T19:44:39.1638070Z 2025-05-07T19:44:39.1638073Z 2025-05-07T19:44:39.1638077Z 2025-05-07T19:44:39.1638081Z 2025-05-07T19:44:39.1638084Z 2025-05-07T19:44:39.1638093Z 2025-05-07T19:44:39.1638096Z 2025-05-07T19:44:39.1638099Z 2025-05-07T19:44:39.1638103Z 2025-05-07T19:44:39.1638106Z 2025-05-07T19:44:39.1638110Z 2025-05-07T19:44:39.1638113Z 2025-05-07T19:44:39.1638809Z zlib-1.2.13 | 91 KB | | 0%  2025-05-07T19:44:39.1639105Z 2025-05-07T19:44:39.1639120Z 2025-05-07T19:44:39.1639123Z 2025-05-07T19:44:39.1639127Z 2025-05-07T19:44:39.1639130Z 2025-05-07T19:44:39.1639134Z 2025-05-07T19:44:39.1639137Z 2025-05-07T19:44:39.1639140Z 2025-05-07T19:44:39.1639144Z 2025-05-07T19:44:39.1639148Z 2025-05-07T19:44:39.1639152Z 2025-05-07T19:44:39.1639156Z 2025-05-07T19:44:39.1639159Z 2025-05-07T19:44:39.1639162Z 2025-05-07T19:44:39.1639166Z 2025-05-07T19:44:39.5775417Z libzlib-1.2.13 | 60 KB | | 0%  2025-05-07T19:44:39.5776405Z 2025-05-07T19:44:39.5776420Z 2025-05-07T19:44:39.5833649Z libllvm16-16.0.6 | 33.7 MB | | 0%  2025-05-07T19:44:39.5834367Z 2025-05-07T19:44:39.5879520Z compiler-rt_linux-64 | 36.0 MB | | 0%  2025-05-07T19:44:39.5880359Z 2025-05-07T19:44:39.5880373Z 2025-05-07T19:44:39.5880384Z 2025-05-07T19:44:39.6104769Z libclang-cpp16-16.0. | 17.3 MB | | 0%  2025-05-07T19:44:39.6171716Z llvm-openmp-16.0.6 | 39.9 MB | | 0% 2025-05-07T19:44:39.6172554Z 2025-05-07T19:44:39.6172569Z 2025-05-07T19:44:39.6172581Z 2025-05-07T19:44:39.6172593Z 2025-05-07T19:44:39.6776354Z icu-73.2 | 11.5 MB | | 0%  2025-05-07T19:44:39.6777178Z 2025-05-07T19:44:39.6777193Z 2025-05-07T19:44:39.6834871Z libllvm16-16.0.6 | 33.7 MB | ##7 | 27%  2025-05-07T19:44:39.6835730Z 2025-05-07T19:44:39.6880833Z compiler-rt_linux-64 | 36.0 MB | ##6 | 26%  2025-05-07T19:44:39.6881151Z 2025-05-07T19:44:39.6881157Z 2025-05-07T19:44:39.6881172Z 2025-05-07T19:44:39.7104506Z libclang-cpp16-16.0. | 17.3 MB | ######1 | 62%  2025-05-07T19:44:39.7173631Z llvm-openmp-16.0.6 | 39.9 MB | 7 | 7% 2025-05-07T19:44:39.7175207Z 2025-05-07T19:44:39.7175223Z 2025-05-07T19:44:39.7175236Z 2025-05-07T19:44:39.7175247Z 2025-05-07T19:44:39.7804036Z icu-73.2 | 11.5 MB | ########4 | 85%  2025-05-07T19:44:39.7804355Z 2025-05-07T19:44:39.7804360Z 2025-05-07T19:44:39.7916510Z libllvm16-16.0.6 | 33.7 MB | #####4 | 55%  2025-05-07T19:44:39.7916816Z 2025-05-07T19:44:39.8097053Z compiler-rt_linux-64 | 36.0 MB | #####1 | 52%  2025-05-07T19:44:39.8097441Z 2025-05-07T19:44:39.8097667Z 2025-05-07T19:44:39.8097671Z 2025-05-07T19:44:39.8119803Z libclang-cpp16-16.0. | 17.3 MB | #########8 | 98%  2025-05-07T19:44:39.8132024Z llvm-openmp-16.0.6 | 39.9 MB | ###4 | 35% 2025-05-07T19:44:39.8132310Z 2025-05-07T19:44:39.8132315Z 2025-05-07T19:44:39.8132327Z 2025-05-07T19:44:39.8132333Z 2025-05-07T19:44:39.8662555Z icu-73.2 | 11.5 MB | ########## | 100%  2025-05-07T19:44:39.8663026Z 2025-05-07T19:44:39.8663160Z 2025-05-07T19:44:39.8663166Z 2025-05-07T19:44:39.8663195Z 2025-05-07T19:44:39.8663198Z 2025-05-07T19:44:39.8908666Z libcxx-19.1.7 | 1000 KB | 1 | 2%  2025-05-07T19:44:39.8909570Z 2025-05-07T19:44:39.8909586Z 2025-05-07T19:44:39.8909598Z 2025-05-07T19:44:39.8909609Z 2025-05-07T19:44:39.8909620Z 2025-05-07T19:44:39.8920929Z libcxx-19.1.7 | 1000 KB | ########## | 100%  2025-05-07T19:44:39.8921742Z 2025-05-07T19:44:39.8931573Z compiler-rt_linux-64 | 36.0 MB | #######7 | 77%  2025-05-07T19:44:39.8932374Z 2025-05-07T19:44:39.8932385Z 2025-05-07T19:44:39.9315548Z libllvm16-16.0.6 | 33.7 MB | #######5 | 76%  2025-05-07T19:44:39.9405175Z llvm-openmp-16.0.6 | 39.9 MB | ####9 | 50% 2025-05-07T19:44:39.9406002Z 2025-05-07T19:44:39.9406017Z 2025-05-07T19:44:39.9406029Z 2025-05-07T19:44:39.9406070Z 2025-05-07T19:44:39.9406081Z 2025-05-07T19:44:39.9406091Z 2025-05-07T19:44:39.9634576Z clang-16-16.0.6 | 780 KB | 2 | 2%  2025-05-07T19:44:39.9634989Z 2025-05-07T19:44:39.9635542Z 2025-05-07T19:44:39.9635713Z 2025-05-07T19:44:39.9635736Z 2025-05-07T19:44:39.9635771Z 2025-05-07T19:44:39.9635833Z 2025-05-07T19:44:39.9996974Z clang-16-16.0.6 | 780 KB | ########## | 100%  2025-05-07T19:44:39.9997295Z 2025-05-07T19:44:40.0111647Z compiler-rt_linux-64 | 36.0 MB | #########8 | 99%  2025-05-07T19:44:40.0112537Z 2025-05-07T19:44:40.0112552Z 2025-05-07T19:44:40.0112564Z 2025-05-07T19:44:40.0112574Z 2025-05-07T19:44:40.0112585Z 2025-05-07T19:44:40.0118827Z libcxx-19.1.7 | 1000 KB | ########## | 100%  2025-05-07T19:44:40.0119683Z 2025-05-07T19:44:40.0119695Z 2025-05-07T19:44:40.0119705Z 2025-05-07T19:44:40.0119716Z 2025-05-07T19:44:40.0119727Z 2025-05-07T19:44:40.0156034Z libcxx-19.1.7 | 1000 KB | ########## | 100%  2025-05-07T19:44:40.0156621Z 2025-05-07T19:44:40.0156626Z 2025-05-07T19:44:40.0156630Z 2025-05-07T19:44:40.0156634Z 2025-05-07T19:44:40.0156637Z 2025-05-07T19:44:40.0156641Z 2025-05-07T19:44:40.0156645Z 2025-05-07T19:44:40.0207664Z libiconv-1.18 | 696 KB | 2 | 2%  2025-05-07T19:44:40.0207995Z 2025-05-07T19:44:40.0208024Z 2025-05-07T19:44:40.0316014Z libllvm16-16.0.6 | 33.7 MB | #########6 | 96%  2025-05-07T19:44:40.0388872Z llvm-openmp-16.0.6 | 39.9 MB | ######5 | 66% 2025-05-07T19:44:40.0389731Z 2025-05-07T19:44:40.0389746Z 2025-05-07T19:44:40.0389758Z 2025-05-07T19:44:40.0389768Z 2025-05-07T19:44:40.0389779Z 2025-05-07T19:44:40.0389789Z 2025-05-07T19:44:40.0389800Z 2025-05-07T19:44:40.0879194Z libiconv-1.18 | 696 KB | ########## | 100%  2025-05-07T19:44:40.0879525Z 2025-05-07T19:44:40.0879530Z 2025-05-07T19:44:40.0879534Z 2025-05-07T19:44:40.0879565Z 2025-05-07T19:44:40.0879586Z 2025-05-07T19:44:40.0879589Z 2025-05-07T19:44:40.0879593Z 2025-05-07T19:44:40.0879596Z 2025-05-07T19:44:40.0975571Z libxml2-2.12.7 | 688 KB | 2 | 2%  2025-05-07T19:44:40.0976188Z 2025-05-07T19:44:40.0976303Z 2025-05-07T19:44:40.0976326Z 2025-05-07T19:44:40.1095018Z libclang-cpp16-16.0. | 17.3 MB | ########## | 100%  2025-05-07T19:44:40.1095931Z 2025-05-07T19:44:40.1095946Z 2025-05-07T19:44:40.1095956Z 2025-05-07T19:44:40.1095967Z 2025-05-07T19:44:40.1095978Z 2025-05-07T19:44:40.1095988Z 2025-05-07T19:44:40.1095999Z 2025-05-07T19:44:40.1096009Z 2025-05-07T19:44:40.1316044Z libxml2-2.12.7 | 688 KB | ########## | 100%  2025-05-07T19:44:40.1439828Z llvm-openmp-16.0.6 | 39.9 MB | ########5 | 85% 2025-05-07T19:44:40.1440167Z 2025-05-07T19:44:40.1440172Z 2025-05-07T19:44:40.1440176Z 2025-05-07T19:44:40.1440179Z 2025-05-07T19:44:40.1440184Z 2025-05-07T19:44:40.1440187Z 2025-05-07T19:44:40.1440449Z 2025-05-07T19:44:40.1444030Z libiconv-1.18 | 696 KB | ########## | 100%  2025-05-07T19:44:40.1444371Z 2025-05-07T19:44:40.1444376Z 2025-05-07T19:44:40.1444380Z 2025-05-07T19:44:40.1444383Z 2025-05-07T19:44:40.1444387Z 2025-05-07T19:44:40.1444407Z 2025-05-07T19:44:40.1444412Z 2025-05-07T19:44:40.1530620Z libiconv-1.18 | 696 KB | ########## | 100%  2025-05-07T19:44:40.1530959Z 2025-05-07T19:44:40.1530964Z 2025-05-07T19:44:40.1530968Z 2025-05-07T19:44:40.1530972Z 2025-05-07T19:44:40.1530975Z 2025-05-07T19:44:40.1530978Z 2025-05-07T19:44:40.1530982Z 2025-05-07T19:44:40.1530985Z 2025-05-07T19:44:40.1530998Z 2025-05-07T19:44:40.1705817Z zstd-1.5.6 | 542 KB | 2 | 3%  2025-05-07T19:44:40.1706161Z 2025-05-07T19:44:40.1706166Z 2025-05-07T19:44:40.1706170Z 2025-05-07T19:44:40.1706173Z 2025-05-07T19:44:40.1706177Z 2025-05-07T19:44:40.1706180Z 2025-05-07T19:44:40.1706184Z 2025-05-07T19:44:40.1706203Z 2025-05-07T19:44:40.1706207Z 2025-05-07T19:44:40.1796611Z zstd-1.5.6 | 542 KB | ########## | 100%  2025-05-07T19:44:40.1796956Z 2025-05-07T19:44:40.1796961Z 2025-05-07T19:44:40.1796965Z 2025-05-07T19:44:40.1796969Z 2025-05-07T19:44:40.1796989Z 2025-05-07T19:44:40.1796993Z 2025-05-07T19:44:40.1796996Z 2025-05-07T19:44:40.1796999Z 2025-05-07T19:44:40.1797003Z 2025-05-07T19:44:40.1797006Z 2025-05-07T19:44:40.1855154Z libcxxabi-19.1.7 | 158 KB | # | 10%  2025-05-07T19:44:40.1856109Z 2025-05-07T19:44:40.1856124Z 2025-05-07T19:44:40.1856134Z 2025-05-07T19:44:40.1856145Z 2025-05-07T19:44:40.1856155Z 2025-05-07T19:44:40.1856166Z 2025-05-07T19:44:40.1856177Z 2025-05-07T19:44:40.1856187Z 2025-05-07T19:44:40.1856198Z 2025-05-07T19:44:40.1856209Z 2025-05-07T19:44:40.2164529Z libcxxabi-19.1.7 | 158 KB | ########## | 100%  2025-05-07T19:44:40.2165100Z 2025-05-07T19:44:40.2165104Z 2025-05-07T19:44:40.2165108Z 2025-05-07T19:44:40.2165111Z 2025-05-07T19:44:40.2165115Z 2025-05-07T19:44:40.2165118Z 2025-05-07T19:44:40.2165122Z 2025-05-07T19:44:40.2165125Z 2025-05-07T19:44:40.2165129Z 2025-05-07T19:44:40.2165132Z 2025-05-07T19:44:40.2165253Z 2025-05-07T19:44:40.2201746Z clang-16.0.6 | 110 KB | #4 | 15%  2025-05-07T19:44:40.2202676Z 2025-05-07T19:44:40.2202690Z 2025-05-07T19:44:40.2202701Z 2025-05-07T19:44:40.2202711Z 2025-05-07T19:44:40.2202722Z 2025-05-07T19:44:40.2202733Z 2025-05-07T19:44:40.2202743Z 2025-05-07T19:44:40.2202782Z 2025-05-07T19:44:40.2202792Z 2025-05-07T19:44:40.2202802Z 2025-05-07T19:44:40.2202813Z 2025-05-07T19:44:40.2404358Z clang-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:40.2404682Z 2025-05-07T19:44:40.2404686Z 2025-05-07T19:44:40.2404690Z 2025-05-07T19:44:40.2404715Z 2025-05-07T19:44:40.2404719Z 2025-05-07T19:44:40.2404736Z 2025-05-07T19:44:40.2404740Z 2025-05-07T19:44:40.2404743Z 2025-05-07T19:44:40.2404747Z 2025-05-07T19:44:40.2404750Z 2025-05-07T19:44:40.2404754Z 2025-05-07T19:44:40.2404757Z 2025-05-07T19:44:40.2442950Z clangxx-16.0.6 | 110 KB | #4 | 15%  2025-05-07T19:44:40.2443305Z 2025-05-07T19:44:40.2443310Z 2025-05-07T19:44:40.2443314Z 2025-05-07T19:44:40.2443317Z 2025-05-07T19:44:40.2443321Z 2025-05-07T19:44:40.2443324Z 2025-05-07T19:44:40.2443328Z 2025-05-07T19:44:40.2443331Z 2025-05-07T19:44:40.2443335Z 2025-05-07T19:44:40.2443338Z 2025-05-07T19:44:40.2443341Z 2025-05-07T19:44:40.2443345Z 2025-05-07T19:44:40.2836138Z clangxx-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:40.2836488Z 2025-05-07T19:44:40.2836505Z 2025-05-07T19:44:40.2836509Z 2025-05-07T19:44:40.2836512Z 2025-05-07T19:44:40.2836516Z 2025-05-07T19:44:40.2836519Z 2025-05-07T19:44:40.2836523Z 2025-05-07T19:44:40.2836545Z 2025-05-07T19:44:40.2836549Z 2025-05-07T19:44:40.2836552Z 2025-05-07T19:44:40.2836556Z 2025-05-07T19:44:40.2836559Z 2025-05-07T19:44:40.2836563Z 2025-05-07T19:44:40.2863027Z compiler-rt-16.0.6 | 107 KB | #4 | 15%  2025-05-07T19:44:40.2863388Z 2025-05-07T19:44:40.2863393Z 2025-05-07T19:44:40.2863397Z 2025-05-07T19:44:40.2863400Z 2025-05-07T19:44:40.2863404Z 2025-05-07T19:44:40.2863407Z 2025-05-07T19:44:40.2863411Z 2025-05-07T19:44:40.2863414Z 2025-05-07T19:44:40.2863418Z 2025-05-07T19:44:40.2863421Z 2025-05-07T19:44:40.2863424Z 2025-05-07T19:44:40.2863428Z 2025-05-07T19:44:40.2863431Z 2025-05-07T19:44:40.3023826Z compiler-rt-16.0.6 | 107 KB | ########## | 100%  2025-05-07T19:44:40.3024176Z 2025-05-07T19:44:40.3024181Z 2025-05-07T19:44:40.3024185Z 2025-05-07T19:44:40.3024189Z 2025-05-07T19:44:40.3024192Z 2025-05-07T19:44:40.3024196Z 2025-05-07T19:44:40.3024199Z 2025-05-07T19:44:40.3024217Z 2025-05-07T19:44:40.3024242Z 2025-05-07T19:44:40.3024245Z 2025-05-07T19:44:40.3024249Z 2025-05-07T19:44:40.3024252Z 2025-05-07T19:44:40.3024256Z 2025-05-07T19:44:40.3024265Z 2025-05-07T19:44:40.3042027Z zlib-1.2.13 | 91 KB | #7 | 18%  2025-05-07T19:44:40.3042329Z 2025-05-07T19:44:40.3042358Z 2025-05-07T19:44:40.3042361Z 2025-05-07T19:44:40.3042365Z 2025-05-07T19:44:40.3042368Z 2025-05-07T19:44:40.3042371Z 2025-05-07T19:44:40.3042375Z 2025-05-07T19:44:40.3042378Z 2025-05-07T19:44:40.3042382Z 2025-05-07T19:44:40.3042391Z 2025-05-07T19:44:40.3042395Z 2025-05-07T19:44:40.3042398Z 2025-05-07T19:44:40.3042401Z 2025-05-07T19:44:40.3042405Z 2025-05-07T19:44:40.3443337Z zlib-1.2.13 | 91 KB | ########## | 100%  2025-05-07T19:44:40.3443741Z 2025-05-07T19:44:40.3443746Z 2025-05-07T19:44:40.3443750Z 2025-05-07T19:44:40.3443753Z 2025-05-07T19:44:40.3443757Z 2025-05-07T19:44:40.3443937Z 2025-05-07T19:44:40.3443941Z 2025-05-07T19:44:40.3443944Z 2025-05-07T19:44:40.3443948Z 2025-05-07T19:44:40.3443951Z 2025-05-07T19:44:40.3443955Z 2025-05-07T19:44:40.3443958Z 2025-05-07T19:44:40.3443961Z 2025-05-07T19:44:40.3443965Z 2025-05-07T19:44:40.3443968Z 2025-05-07T19:44:40.3471399Z libzlib-1.2.13 | 60 KB | ##6 | 27%  2025-05-07T19:44:40.3471781Z 2025-05-07T19:44:40.3471786Z 2025-05-07T19:44:40.3471789Z 2025-05-07T19:44:40.3471793Z 2025-05-07T19:44:40.3471796Z 2025-05-07T19:44:40.3471800Z 2025-05-07T19:44:40.3471803Z 2025-05-07T19:44:40.3471807Z 2025-05-07T19:44:40.3471810Z 2025-05-07T19:44:40.3471814Z 2025-05-07T19:44:40.3471817Z 2025-05-07T19:44:40.3471843Z 2025-05-07T19:44:40.3471846Z 2025-05-07T19:44:40.3471850Z 2025-05-07T19:44:40.3471853Z 2025-05-07T19:44:40.3668808Z libzlib-1.2.13 | 60 KB | ########## | 100%  2025-05-07T19:44:40.3669245Z 2025-05-07T19:44:40.3669266Z 2025-05-07T19:44:40.3669272Z 2025-05-07T19:44:40.3669309Z 2025-05-07T19:44:40.3669314Z 2025-05-07T19:44:40.3669319Z 2025-05-07T19:44:40.3669632Z clang-16-16.0.6 | 780 KB | ########## | 100%  2025-05-07T19:44:40.3669922Z 2025-05-07T19:44:40.3669933Z 2025-05-07T19:44:40.3669937Z 2025-05-07T19:44:40.3669940Z 2025-05-07T19:44:40.3669944Z 2025-05-07T19:44:40.3669950Z 2025-05-07T19:44:40.4311292Z clang-16-16.0.6 | 780 KB | ########## | 100%  2025-05-07T19:44:40.4311644Z 2025-05-07T19:44:40.4389146Z compiler-rt_linux-64 | 36.0 MB | ########## | 100%  2025-05-07T19:44:40.4389470Z 2025-05-07T19:44:40.4389476Z 2025-05-07T19:44:40.4389481Z 2025-05-07T19:44:40.4389486Z 2025-05-07T19:44:40.4389491Z 2025-05-07T19:44:40.4389498Z 2025-05-07T19:44:40.4389505Z 2025-05-07T19:44:40.4389510Z 2025-05-07T19:44:40.4389814Z libxml2-2.12.7 | 688 KB | ########## | 100%  2025-05-07T19:44:40.4390203Z 2025-05-07T19:44:40.4390207Z 2025-05-07T19:44:40.4390210Z 2025-05-07T19:44:40.4390214Z 2025-05-07T19:44:40.4390217Z 2025-05-07T19:44:40.4390221Z 2025-05-07T19:44:40.4390224Z 2025-05-07T19:44:40.4390232Z 2025-05-07T19:44:40.4496642Z libxml2-2.12.7 | 688 KB | ########## | 100%  2025-05-07T19:44:40.4496980Z 2025-05-07T19:44:40.4496985Z 2025-05-07T19:44:40.4647031Z libllvm16-16.0.6 | 33.7 MB | ########## | 100%  2025-05-07T19:44:40.4647885Z 2025-05-07T19:44:40.4647929Z 2025-05-07T19:44:40.4647940Z 2025-05-07T19:44:40.4647951Z 2025-05-07T19:44:40.4647961Z 2025-05-07T19:44:40.4647972Z 2025-05-07T19:44:40.4647982Z 2025-05-07T19:44:40.4647993Z 2025-05-07T19:44:40.4648027Z 2025-05-07T19:44:40.4648763Z zstd-1.5.6 | 542 KB | ########## | 100%  2025-05-07T19:44:40.4649580Z 2025-05-07T19:44:40.4649590Z 2025-05-07T19:44:40.4649601Z 2025-05-07T19:44:40.4649611Z 2025-05-07T19:44:40.4649622Z 2025-05-07T19:44:40.4649664Z 2025-05-07T19:44:40.4649674Z 2025-05-07T19:44:40.4649685Z 2025-05-07T19:44:40.4649696Z 2025-05-07T19:44:40.4759740Z zstd-1.5.6 | 542 KB | ########## | 100%  2025-05-07T19:44:40.4760075Z 2025-05-07T19:44:40.4760082Z 2025-05-07T19:44:40.4760103Z 2025-05-07T19:44:40.4760119Z 2025-05-07T19:44:40.4849302Z icu-73.2 | 11.5 MB | ########## | 100%  2025-05-07T19:44:40.4850075Z 2025-05-07T19:44:40.4850089Z 2025-05-07T19:44:40.4850100Z 2025-05-07T19:44:40.4850110Z 2025-05-07T19:44:40.4850153Z 2025-05-07T19:44:40.4850163Z 2025-05-07T19:44:40.4850174Z 2025-05-07T19:44:40.4850184Z 2025-05-07T19:44:40.4850195Z 2025-05-07T19:44:40.4850205Z 2025-05-07T19:44:40.4851139Z libcxxabi-19.1.7 | 158 KB | ########## | 100%  2025-05-07T19:44:40.4851449Z 2025-05-07T19:44:40.4851452Z 2025-05-07T19:44:40.4851456Z 2025-05-07T19:44:40.4851484Z 2025-05-07T19:44:40.4851487Z 2025-05-07T19:44:40.4851491Z 2025-05-07T19:44:40.4851673Z 2025-05-07T19:44:40.4851676Z 2025-05-07T19:44:40.4851792Z 2025-05-07T19:44:40.4851795Z 2025-05-07T19:44:40.5459552Z libcxxabi-19.1.7 | 158 KB | ########## | 100%  2025-05-07T19:44:40.5459948Z 2025-05-07T19:44:40.5459955Z 2025-05-07T19:44:40.5460890Z 2025-05-07T19:44:40.5460898Z 2025-05-07T19:44:40.5460901Z 2025-05-07T19:44:40.5460905Z 2025-05-07T19:44:40.5460909Z 2025-05-07T19:44:40.5460912Z 2025-05-07T19:44:40.5460916Z 2025-05-07T19:44:40.5460919Z 2025-05-07T19:44:40.5460923Z 2025-05-07T19:44:40.5461254Z clang-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:40.5461573Z 2025-05-07T19:44:40.5461577Z 2025-05-07T19:44:40.5461581Z 2025-05-07T19:44:40.5461584Z 2025-05-07T19:44:40.5461588Z 2025-05-07T19:44:40.5461709Z 2025-05-07T19:44:40.5461712Z 2025-05-07T19:44:40.5461716Z 2025-05-07T19:44:40.5461730Z 2025-05-07T19:44:40.5461734Z 2025-05-07T19:44:40.5461737Z 2025-05-07T19:44:40.5576862Z clang-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:40.5577789Z 2025-05-07T19:44:40.5577804Z 2025-05-07T19:44:40.5577816Z 2025-05-07T19:44:40.5577826Z 2025-05-07T19:44:40.5577837Z 2025-05-07T19:44:40.5577848Z 2025-05-07T19:44:40.5577892Z 2025-05-07T19:44:40.5577903Z 2025-05-07T19:44:40.5577913Z 2025-05-07T19:44:40.5577923Z 2025-05-07T19:44:40.5577933Z 2025-05-07T19:44:40.5577943Z 2025-05-07T19:44:40.5578794Z clangxx-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:40.5579665Z 2025-05-07T19:44:40.5579676Z 2025-05-07T19:44:40.5579687Z 2025-05-07T19:44:40.5579697Z 2025-05-07T19:44:40.5579707Z 2025-05-07T19:44:40.5579718Z 2025-05-07T19:44:40.5579728Z 2025-05-07T19:44:40.5579738Z 2025-05-07T19:44:40.5579750Z 2025-05-07T19:44:40.5579760Z 2025-05-07T19:44:40.5579789Z 2025-05-07T19:44:40.5579799Z 2025-05-07T19:44:40.5655733Z clangxx-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:40.5730260Z llvm-openmp-16.0.6 | 39.9 MB | ########## | 100% 2025-05-07T19:44:40.5730645Z 2025-05-07T19:44:40.5730650Z 2025-05-07T19:44:40.5730654Z 2025-05-07T19:44:40.5730657Z 2025-05-07T19:44:40.5730661Z 2025-05-07T19:44:40.5730665Z 2025-05-07T19:44:40.5730686Z 2025-05-07T19:44:40.5730690Z 2025-05-07T19:44:40.5730694Z 2025-05-07T19:44:40.5730697Z 2025-05-07T19:44:40.5730701Z 2025-05-07T19:44:40.5730704Z 2025-05-07T19:44:40.5730708Z 2025-05-07T19:44:40.5730711Z 2025-05-07T19:44:40.5731120Z zlib-1.2.13 | 91 KB | ########## | 100%  2025-05-07T19:44:40.5731418Z 2025-05-07T19:44:40.5731422Z 2025-05-07T19:44:40.5731426Z 2025-05-07T19:44:40.5731429Z 2025-05-07T19:44:40.5731433Z 2025-05-07T19:44:40.5731436Z 2025-05-07T19:44:40.5731440Z 2025-05-07T19:44:40.5731443Z 2025-05-07T19:44:40.5731447Z 2025-05-07T19:44:40.5731451Z 2025-05-07T19:44:40.5731455Z 2025-05-07T19:44:40.5731459Z 2025-05-07T19:44:40.5731473Z 2025-05-07T19:44:40.5731476Z 2025-05-07T19:44:40.5741887Z zlib-1.2.13 | 91 KB | ########## | 100%  2025-05-07T19:44:40.5742791Z 2025-05-07T19:44:40.5742805Z 2025-05-07T19:44:40.5742817Z 2025-05-07T19:44:40.5742861Z 2025-05-07T19:44:40.5742873Z 2025-05-07T19:44:40.5742884Z 2025-05-07T19:44:40.5742894Z 2025-05-07T19:44:40.5742904Z 2025-05-07T19:44:40.5742915Z 2025-05-07T19:44:40.5742954Z 2025-05-07T19:44:40.5743110Z 2025-05-07T19:44:40.5743114Z 2025-05-07T19:44:40.5743118Z 2025-05-07T19:44:40.5743442Z compiler-rt-16.0.6 | 107 KB | ########## | 100%  2025-05-07T19:44:40.5743770Z 2025-05-07T19:44:40.5743774Z 2025-05-07T19:44:40.5743777Z 2025-05-07T19:44:40.5743781Z 2025-05-07T19:44:40.5743785Z 2025-05-07T19:44:40.5743817Z 2025-05-07T19:44:40.5743831Z 2025-05-07T19:44:40.5743835Z 2025-05-07T19:44:40.5743839Z 2025-05-07T19:44:40.5743842Z 2025-05-07T19:44:40.5743846Z 2025-05-07T19:44:40.5744084Z 2025-05-07T19:44:40.5744087Z 2025-05-07T19:44:40.5849624Z compiler-rt-16.0.6 | 107 KB | ########## | 100%  2025-05-07T19:44:40.5850675Z 2025-05-07T19:44:40.5850679Z 2025-05-07T19:44:40.5850683Z 2025-05-07T19:44:40.5850934Z 2025-05-07T19:44:40.5850940Z 2025-05-07T19:44:40.5850944Z 2025-05-07T19:44:40.5850947Z 2025-05-07T19:44:40.5850951Z 2025-05-07T19:44:40.5850954Z 2025-05-07T19:44:40.5850958Z 2025-05-07T19:44:40.5850961Z 2025-05-07T19:44:40.5850965Z 2025-05-07T19:44:40.5850968Z 2025-05-07T19:44:40.5850972Z 2025-05-07T19:44:40.5850976Z 2025-05-07T19:44:40.5851345Z libzlib-1.2.13 | 60 KB | ########## | 100%  2025-05-07T19:44:40.5851812Z 2025-05-07T19:44:40.5851815Z 2025-05-07T19:44:40.5851819Z 2025-05-07T19:44:40.5851823Z 2025-05-07T19:44:40.5851826Z 2025-05-07T19:44:40.5851830Z 2025-05-07T19:44:40.5851834Z 2025-05-07T19:44:40.5851837Z 2025-05-07T19:44:40.5851840Z 2025-05-07T19:44:40.5851853Z 2025-05-07T19:44:40.5851856Z 2025-05-07T19:44:40.5851886Z 2025-05-07T19:44:40.5851890Z 2025-05-07T19:44:40.5851893Z 2025-05-07T19:44:40.5851897Z 2025-05-07T19:44:40.6697917Z libzlib-1.2.13 | 60 KB | ########## | 100%  2025-05-07T19:44:40.6698273Z 2025-05-07T19:44:40.6698278Z 2025-05-07T19:44:40.6698282Z 2025-05-07T19:44:40.9819968Z libclang-cpp16-16.0. | 17.3 MB | ########## | 100%  2025-05-07T19:44:40.9820427Z 2025-05-07T19:44:41.0365744Z compiler-rt_linux-64 | 36.0 MB | ########## | 100%  2025-05-07T19:44:41.0366607Z 2025-05-07T19:44:41.0366621Z 2025-05-07T19:44:41.1371994Z libllvm16-16.0.6 | 33.7 MB | ########## | 100%  2025-05-07T19:44:41.1375873Z llvm-openmp-16.0.6 | 39.9 MB | ########## | 100% 2025-05-07T19:44:41.1376400Z 2025-05-07T19:44:41.1376682Z 2025-05-07T19:44:41.1377147Z  2025-05-07T19:44:41.1377601Z 2025-05-07T19:44:41.1377637Z 2025-05-07T19:44:41.1377832Z  2025-05-07T19:44:41.1378066Z 2025-05-07T19:44:41.1378070Z 2025-05-07T19:44:41.1378074Z 2025-05-07T19:44:41.1378421Z  2025-05-07T19:44:41.1378648Z 2025-05-07T19:44:41.1378672Z 2025-05-07T19:44:41.1378676Z 2025-05-07T19:44:41.1378679Z 2025-05-07T19:44:41.1378873Z  2025-05-07T19:44:41.1379132Z 2025-05-07T19:44:41.1379136Z 2025-05-07T19:44:41.1379139Z 2025-05-07T19:44:41.1379143Z 2025-05-07T19:44:41.1379146Z 2025-05-07T19:44:41.1379334Z  2025-05-07T19:44:41.1379592Z 2025-05-07T19:44:41.1379596Z 2025-05-07T19:44:41.1379613Z 2025-05-07T19:44:41.1379617Z 2025-05-07T19:44:41.1379620Z 2025-05-07T19:44:41.1379624Z 2025-05-07T19:44:41.1379815Z  2025-05-07T19:44:41.1380058Z 2025-05-07T19:44:41.1380062Z 2025-05-07T19:44:41.1380095Z 2025-05-07T19:44:41.1380098Z 2025-05-07T19:44:41.1380102Z 2025-05-07T19:44:41.1380105Z 2025-05-07T19:44:41.1380109Z 2025-05-07T19:44:41.1380309Z  2025-05-07T19:44:41.1380544Z 2025-05-07T19:44:41.1380549Z 2025-05-07T19:44:41.1380552Z 2025-05-07T19:44:41.1380556Z 2025-05-07T19:44:41.1380559Z 2025-05-07T19:44:41.1380589Z 2025-05-07T19:44:41.1380593Z 2025-05-07T19:44:41.1380596Z 2025-05-07T19:44:41.1380794Z  2025-05-07T19:44:41.1381033Z 2025-05-07T19:44:41.1381037Z 2025-05-07T19:44:41.1381041Z 2025-05-07T19:44:41.1381044Z 2025-05-07T19:44:41.1381047Z 2025-05-07T19:44:41.1381051Z 2025-05-07T19:44:41.1381125Z 2025-05-07T19:44:41.1381129Z 2025-05-07T19:44:41.1381133Z 2025-05-07T19:44:41.1381338Z  2025-05-07T19:44:41.1382011Z 2025-05-07T19:44:41.1382047Z 2025-05-07T19:44:41.1382050Z 2025-05-07T19:44:41.1382054Z 2025-05-07T19:44:41.1382057Z 2025-05-07T19:44:41.1382061Z 2025-05-07T19:44:41.1382064Z 2025-05-07T19:44:41.1382181Z 2025-05-07T19:44:41.1382185Z 2025-05-07T19:44:41.1382190Z 2025-05-07T19:44:41.1382418Z  2025-05-07T19:44:41.1382666Z 2025-05-07T19:44:41.1382699Z 2025-05-07T19:44:41.1382702Z 2025-05-07T19:44:41.1382706Z 2025-05-07T19:44:41.1382710Z 2025-05-07T19:44:41.1382714Z 2025-05-07T19:44:41.1382717Z 2025-05-07T19:44:41.1382720Z 2025-05-07T19:44:41.1382724Z 2025-05-07T19:44:41.1382727Z 2025-05-07T19:44:41.1382731Z 2025-05-07T19:44:41.1382947Z  2025-05-07T19:44:41.1383227Z 2025-05-07T19:44:41.1383231Z 2025-05-07T19:44:41.1383240Z 2025-05-07T19:44:41.1383243Z 2025-05-07T19:44:41.1383247Z 2025-05-07T19:44:41.1383250Z 2025-05-07T19:44:41.1383254Z 2025-05-07T19:44:41.1383257Z 2025-05-07T19:44:41.1383260Z 2025-05-07T19:44:41.1383264Z 2025-05-07T19:44:41.1383267Z 2025-05-07T19:44:41.1383271Z 2025-05-07T19:44:41.1383495Z  2025-05-07T19:44:41.1383778Z 2025-05-07T19:44:41.1383782Z 2025-05-07T19:44:41.1383785Z 2025-05-07T19:44:41.1383788Z 2025-05-07T19:44:41.1383792Z 2025-05-07T19:44:41.1383795Z 2025-05-07T19:44:41.1383799Z 2025-05-07T19:44:41.1383802Z 2025-05-07T19:44:41.1383806Z 2025-05-07T19:44:41.1383810Z 2025-05-07T19:44:41.1383813Z 2025-05-07T19:44:41.1383816Z 2025-05-07T19:44:41.1383820Z 2025-05-07T19:44:41.1384042Z  2025-05-07T19:44:41.1384326Z 2025-05-07T19:44:41.1384329Z 2025-05-07T19:44:41.1384333Z 2025-05-07T19:44:41.1384337Z 2025-05-07T19:44:41.1384344Z 2025-05-07T19:44:41.1384348Z 2025-05-07T19:44:41.1384352Z 2025-05-07T19:44:41.1384356Z 2025-05-07T19:44:41.1384359Z 2025-05-07T19:44:41.1384362Z 2025-05-07T19:44:41.1384366Z 2025-05-07T19:44:41.1384369Z 2025-05-07T19:44:41.1384373Z 2025-05-07T19:44:41.1384380Z 2025-05-07T19:44:41.1384627Z  2025-05-07T19:44:41.1384883Z 2025-05-07T19:44:41.1384887Z 2025-05-07T19:44:41.1384891Z 2025-05-07T19:44:41.1384894Z 2025-05-07T19:44:41.1384897Z 2025-05-07T19:44:41.1384901Z 2025-05-07T19:44:41.1384904Z 2025-05-07T19:44:41.1384907Z 2025-05-07T19:44:41.1384911Z 2025-05-07T19:44:41.1384914Z 2025-05-07T19:44:41.1384917Z 2025-05-07T19:44:41.1384921Z 2025-05-07T19:44:41.1384925Z 2025-05-07T19:44:41.1384928Z 2025-05-07T19:44:41.1384962Z 2025-05-07T19:44:41.1385224Z  done 2025-05-07T19:44:41.2389954Z Preparing transaction: \ done 2025-05-07T19:44:41.3397071Z Verifying transaction: / done 2025-05-07T19:44:41.4414022Z Executing transaction: \ done 2025-05-07T19:44:41.5338108Z [INSTALL] Setting the C/C++ compiler symlinks ... 2025-05-07T19:44:45.3032969Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:44:45.3033579Z 2025-05-07T19:44:45.3055710Z 2025-05-07T19:44:45.3076985Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:44:45.3077545Z 2025-05-07T19:44:45.3099712Z 2025-05-07T19:44:45.3118218Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang++ /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:44:45.3118751Z 2025-05-07T19:44:45.3143102Z 2025-05-07T19:44:45.3159944Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang++ /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:44:45.3160468Z 2025-05-07T19:44:45.3184549Z 2025-05-07T19:44:45.3184974Z [INSTALL] Removing GCC package activation scripts ... 2025-05-07T19:44:47.2161123Z + ls -la /github/home/miniconda/envs/build_binary/etc/conda/activate.d 2025-05-07T19:44:47.2162191Z 2025-05-07T19:44:47.2178944Z total 28 2025-05-07T19:44:47.2180108Z drwxr-xr-x. 2 root root 134 May 7 19:44 . 2025-05-07T19:44:47.2180569Z drwxr-xr-x. 5 root root 62 May 7 19:44 .. 2025-05-07T19:44:47.2181021Z -rw-r--r--. 2 root root 3778 Jun 10 2024 activate-binutils_linux-64.sh 2025-05-07T19:44:47.2181676Z -rw-r--r--. 2 root root 11630 Jun 10 2024 activate-gcc_linux-64.sh 2025-05-07T19:44:47.2182141Z -rw-r--r--. 2 root root 5190 Jun 10 2024 activate-gxx_linux-64.sh 2025-05-07T19:44:47.2182602Z -rw-r--r--. 2 root root 873 Jun 5 2024 libxml2_activate.sh 2025-05-07T19:44:47.2182875Z 2025-05-07T19:44:47.2183309Z + rm -rf /github/home/miniconda/envs/build_binary/etc/conda/activate.d/activate-gcc_linux-64.sh 2025-05-07T19:44:47.2183745Z 2025-05-07T19:44:47.2195479Z 2025-05-07T19:44:47.2196769Z + rm -rf /github/home/miniconda/envs/build_binary/etc/conda/activate.d/activate-gxx_linux-64.sh 2025-05-07T19:44:47.2198434Z 2025-05-07T19:44:47.2209476Z 2025-05-07T19:44:47.2210065Z + conda env config vars set -n build_binary CC= 2025-05-07T19:44:47.2210773Z 2025-05-07T19:44:47.6395557Z 2025-05-07T19:44:47.6396791Z + conda env config vars set -n build_binary CXX= 2025-05-07T19:44:47.6397565Z 2025-05-07T19:44:48.0603660Z 2025-05-07T19:44:48.0604672Z + conda run -n build_binary printenv CC 2025-05-07T19:44:48.0605482Z 2025-05-07T19:44:49.6515228Z 2025-05-07T19:44:49.6515483Z 2025-05-07T19:44:49.7104007Z 2025-05-07T19:44:49.7104919Z + conda run -n build_binary printenv CXX 2025-05-07T19:44:49.7105223Z 2025-05-07T19:44:51.2975852Z 2025-05-07T19:44:51.2975879Z 2025-05-07T19:44:51.3554712Z 2025-05-07T19:44:53.0193199Z [ENV] Appending to LD_LIBRARY_PATH: /github/home/miniconda/envs/build_binary/lib ... 2025-05-07T19:44:54.6079040Z ERROR conda.cli.main_run:execute(125): `conda run printenv LD_LIBRARY_PATH` failed. (See above for error) 2025-05-07T19:44:54.6660078Z + conda env config vars set -n build_binary LD_LIBRARY_PATH=/github/home/miniconda/envs/build_binary/lib 2025-05-07T19:44:54.6661438Z 2025-05-07T19:44:55.0773881Z 2025-05-07T19:44:56.6656475Z /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:44:56.7239780Z 2025-05-07T19:44:56.7240293Z [CHECK] Binary cc found in PATH 2025-05-07T19:44:58.3125910Z /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:44:58.3126714Z 2025-05-07T19:44:58.3700942Z [CHECK] Binary gcc found in PATH 2025-05-07T19:44:59.9558953Z /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:44:59.9559756Z 2025-05-07T19:45:00.0139295Z [CHECK] Binary c++ found in PATH 2025-05-07T19:45:01.5976322Z /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:45:01.5976691Z 2025-05-07T19:45:01.6789249Z [CHECK] Binary g++ found in PATH 2025-05-07T19:45:01.6792599Z [INFO] Printing out all preprocessor defines in the C compiler ... 2025-05-07T19:45:01.6794302Z + conda run -n build_binary cc -dM -E - 2025-05-07T19:45:01.6794956Z 2025-05-07T19:45:03.2887070Z #define _LP64 1 2025-05-07T19:45:03.2887478Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:45:03.2887981Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:45:03.2888297Z #define __ATOMIC_CONSUME 1 2025-05-07T19:45:03.2888614Z #define __ATOMIC_RELAXED 0 2025-05-07T19:45:03.2888925Z #define __ATOMIC_RELEASE 3 2025-05-07T19:45:03.2889209Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:45:03.2889535Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:45:03.2889871Z #define __BITINT_MAXWIDTH__ 8388608 2025-05-07T19:45:03.2890346Z #define __BOOL_WIDTH__ 8 2025-05-07T19:45:03.2890665Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:45:03.2891054Z #define __CHAR16_TYPE__ unsigned short 2025-05-07T19:45:03.2891380Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:45:03.2891889Z #define __CHAR_BIT__ 8 2025-05-07T19:45:03.2892317Z #define __CLANG_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:45:03.2892981Z #define __CLANG_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:45:03.2893362Z #define __CLANG_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:45:03.2893699Z #define __CLANG_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:45:03.2894059Z #define __CLANG_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:45:03.2894553Z #define __CLANG_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:45:03.2894928Z #define __CLANG_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:45:03.2895269Z #define __CLANG_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:45:03.2895647Z #define __CLANG_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:45:03.2896021Z #define __CLANG_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:45:03.2896360Z #define __CONSTANT_CFSTRINGS__ 1 2025-05-07T19:45:03.2896696Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:45:03.2897012Z #define __DBL_DENORM_MIN__ 4.9406564584124654e-324 2025-05-07T19:45:03.2897397Z #define __DBL_DIG__ 15 2025-05-07T19:45:03.2897687Z #define __DBL_EPSILON__ 2.2204460492503131e-16 2025-05-07T19:45:03.2898062Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:45:03.2898366Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:45:03.2898700Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:45:03.2898997Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:45:03.2899424Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:45:03.2899733Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:45:03.2900017Z #define __DBL_MAX__ 1.7976931348623157e+308 2025-05-07T19:45:03.2900357Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:45:03.2900642Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:45:03.2900946Z #define __DBL_MIN__ 2.2250738585072014e-308 2025-05-07T19:45:03.2901265Z #define __DECIMAL_DIG__ __LDBL_DECIMAL_DIG__ 2025-05-07T19:45:03.2901592Z #define __ELF__ 1 2025-05-07T19:45:03.2901827Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:45:03.2902124Z #define __FLOAT128__ 1 2025-05-07T19:45:03.2902379Z #define __FLT16_DECIMAL_DIG__ 5 2025-05-07T19:45:03.2902717Z #define __FLT16_DENORM_MIN__ 5.9604644775390625e-8F16 2025-05-07T19:45:03.2903084Z #define __FLT16_DIG__ 3 2025-05-07T19:45:03.2903344Z #define __FLT16_EPSILON__ 9.765625e-4F16 2025-05-07T19:45:03.2903694Z #define __FLT16_HAS_DENORM__ 1 2025-05-07T19:45:03.2903982Z #define __FLT16_HAS_INFINITY__ 1 2025-05-07T19:45:03.2904296Z #define __FLT16_HAS_QUIET_NAN__ 1 2025-05-07T19:45:03.2904590Z #define __FLT16_MANT_DIG__ 11 2025-05-07T19:45:03.2904896Z #define __FLT16_MAX_10_EXP__ 4 2025-05-07T19:45:03.2905165Z #define __FLT16_MAX_EXP__ 16 2025-05-07T19:45:03.2905467Z #define __FLT16_MAX__ 6.5504e+4F16 2025-05-07T19:45:03.2905783Z #define __FLT16_MIN_10_EXP__ (-4) 2025-05-07T19:45:03.2906076Z #define __FLT16_MIN_EXP__ (-13) 2025-05-07T19:45:03.2906385Z #define __FLT16_MIN__ 6.103515625e-5F16 2025-05-07T19:45:03.2906681Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:45:03.2906993Z #define __FLT_DENORM_MIN__ 1.40129846e-45F 2025-05-07T19:45:03.2907293Z #define __FLT_DIG__ 6 2025-05-07T19:45:03.2907577Z #define __FLT_EPSILON__ 1.19209290e-7F 2025-05-07T19:45:03.2907881Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:45:03.2908178Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:45:03.2908460Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:45:03.2908764Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:45:03.2909058Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:45:03.2909332Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:45:03.2909625Z #define __FLT_MAX__ 3.40282347e+38F 2025-05-07T19:45:03.2909916Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:45:03.2910227Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:45:03.2910500Z #define __FLT_MIN__ 1.17549435e-38F 2025-05-07T19:45:03.2910809Z #define __FLT_RADIX__ 2 2025-05-07T19:45:03.2911057Z #define __FXSR__ 1 2025-05-07T19:45:03.2911332Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:45:03.2911631Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:45:03.2911969Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:45:03.2912315Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:45:03.2912627Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:45:03.2912955Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:45:03.2913260Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:45:03.2913702Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:45:03.2914133Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:45:03.2914668Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:45:03.2915083Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:45:03.2915462Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:45:03.2915794Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:45:03.2916150Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:45:03.2916544Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:45:03.2916898Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:45:03.2917281Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:45:03.2917616Z #define __GNUC_MINOR__ 2 2025-05-07T19:45:03.2917930Z #define __GNUC_PATCHLEVEL__ 1 2025-05-07T19:45:03.2918232Z #define __GNUC_STDC_INLINE__ 1 2025-05-07T19:45:03.2918540Z #define __GNUC__ 4 2025-05-07T19:45:03.2918797Z #define __GXX_ABI_VERSION 1002 2025-05-07T19:45:03.2919117Z #define __INT16_C_SUFFIX__ 2025-05-07T19:45:03.2919406Z #define __INT16_FMTd__ "hd" 2025-05-07T19:45:03.2919717Z #define __INT16_FMTi__ "hi" 2025-05-07T19:45:03.2920034Z #define __INT16_MAX__ 32767 2025-05-07T19:45:03.2920313Z #define __INT16_TYPE__ short 2025-05-07T19:45:03.2920625Z #define __INT32_C_SUFFIX__ 2025-05-07T19:45:03.2920897Z #define __INT32_FMTd__ "d" 2025-05-07T19:45:03.2921200Z #define __INT32_FMTi__ "i" 2025-05-07T19:45:03.2921479Z #define __INT32_MAX__ 2147483647 2025-05-07T19:45:03.2921791Z #define __INT32_TYPE__ int 2025-05-07T19:45:03.2922072Z #define __INT64_C_SUFFIX__ L 2025-05-07T19:45:03.2922383Z #define __INT64_FMTd__ "ld" 2025-05-07T19:45:03.2922667Z #define __INT64_FMTi__ "li" 2025-05-07T19:45:03.2922990Z #define __INT64_MAX__ 9223372036854775807L 2025-05-07T19:45:03.2923345Z #define __INT64_TYPE__ long int 2025-05-07T19:45:03.2923641Z #define __INT8_C_SUFFIX__ 2025-05-07T19:45:03.2923943Z #define __INT8_FMTd__ "hhd" 2025-05-07T19:45:03.2924229Z #define __INT8_FMTi__ "hhi" 2025-05-07T19:45:03.2924549Z #define __INT8_MAX__ 127 2025-05-07T19:45:03.2924835Z #define __INT8_TYPE__ signed char 2025-05-07T19:45:03.2925180Z #define __INTMAX_C_SUFFIX__ L 2025-05-07T19:45:03.2925482Z #define __INTMAX_FMTd__ "ld" 2025-05-07T19:45:03.2925813Z #define __INTMAX_FMTi__ "li" 2025-05-07T19:45:03.2926117Z #define __INTMAX_MAX__ 9223372036854775807L 2025-05-07T19:45:03.2926484Z #define __INTMAX_TYPE__ long int 2025-05-07T19:45:03.2926939Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:45:03.2927211Z #define __INTPTR_FMTd__ "ld" 2025-05-07T19:45:03.2927605Z #define __INTPTR_FMTi__ "li" 2025-05-07T19:45:03.2927933Z #define __INTPTR_MAX__ 9223372036854775807L 2025-05-07T19:45:03.2928253Z #define __INTPTR_TYPE__ long int 2025-05-07T19:45:03.2929121Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:45:03.2929413Z #define __INT_FAST16_FMTd__ "hd" 2025-05-07T19:45:03.2929769Z #define __INT_FAST16_FMTi__ "hi" 2025-05-07T19:45:03.2930076Z #define __INT_FAST16_MAX__ 32767 2025-05-07T19:45:03.2930420Z #define __INT_FAST16_TYPE__ short 2025-05-07T19:45:03.2930766Z #define __INT_FAST16_WIDTH__ 16 2025-05-07T19:45:03.2931073Z #define __INT_FAST32_FMTd__ "d" 2025-05-07T19:45:03.2931407Z #define __INT_FAST32_FMTi__ "i" 2025-05-07T19:45:03.2931706Z #define __INT_FAST32_MAX__ 2147483647 2025-05-07T19:45:03.2932057Z #define __INT_FAST32_TYPE__ int 2025-05-07T19:45:03.2932355Z #define __INT_FAST32_WIDTH__ 32 2025-05-07T19:45:03.2932679Z #define __INT_FAST64_FMTd__ "ld" 2025-05-07T19:45:03.2932982Z #define __INT_FAST64_FMTi__ "li" 2025-05-07T19:45:03.2933334Z #define __INT_FAST64_MAX__ 9223372036854775807L 2025-05-07T19:45:03.2933683Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:45:03.2934024Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:45:03.2934351Z #define __INT_FAST8_FMTd__ "hhd" 2025-05-07T19:45:03.2934649Z #define __INT_FAST8_FMTi__ "hhi" 2025-05-07T19:45:03.2934982Z #define __INT_FAST8_MAX__ 127 2025-05-07T19:45:03.2935285Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:45:03.2935632Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:45:03.2936116Z #define __INT_LEAST16_FMTd__ "hd" 2025-05-07T19:45:03.2936461Z #define __INT_LEAST16_FMTi__ "hi" 2025-05-07T19:45:03.2936773Z #define __INT_LEAST16_MAX__ 32767 2025-05-07T19:45:03.2937119Z #define __INT_LEAST16_TYPE__ short 2025-05-07T19:45:03.2937543Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:45:03.2937882Z #define __INT_LEAST32_FMTd__ "d" 2025-05-07T19:45:03.2938214Z #define __INT_LEAST32_FMTi__ "i" 2025-05-07T19:45:03.2938520Z #define __INT_LEAST32_MAX__ 2147483647 2025-05-07T19:45:03.2938869Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:45:03.2939173Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:45:03.2939505Z #define __INT_LEAST64_FMTd__ "ld" 2025-05-07T19:45:03.2939812Z #define __INT_LEAST64_FMTi__ "li" 2025-05-07T19:45:03.2940165Z #define __INT_LEAST64_MAX__ 9223372036854775807L 2025-05-07T19:45:03.2940521Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:45:03.2940870Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:45:03.2941319Z #define __INT_LEAST8_FMTd__ "hhd" 2025-05-07T19:45:03.2941735Z #define __INT_LEAST8_FMTi__ "hhi" 2025-05-07T19:45:03.2942045Z #define __INT_LEAST8_MAX__ 127 2025-05-07T19:45:03.2942328Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:45:03.2942657Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:45:03.2942933Z #define __INT_MAX__ 2147483647 2025-05-07T19:45:03.2943225Z #define __INT_WIDTH__ 32 2025-05-07T19:45:03.2943482Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:45:03.2943831Z #define __LDBL_DENORM_MIN__ 3.64519953188247460253e-4951L 2025-05-07T19:45:03.2944175Z #define __LDBL_DIG__ 18 2025-05-07T19:45:03.2944483Z #define __LDBL_EPSILON__ 1.08420217248550443401e-19L 2025-05-07T19:45:03.2944847Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:45:03.2945125Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:45:03.2945429Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:45:03.2945712Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:45:03.2946005Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:45:03.2946283Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:45:03.2946614Z #define __LDBL_MAX__ 1.18973149535723176502e+4932L 2025-05-07T19:45:03.2946951Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:45:03.2947272Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:45:03.2947578Z #define __LDBL_MIN__ 3.36210314311209350626e-4932L 2025-05-07T19:45:03.2947931Z #define __LITTLE_ENDIAN__ 1 2025-05-07T19:45:03.2948220Z #define __LLONG_WIDTH__ 64 2025-05-07T19:45:03.2948506Z #define __LONG_LONG_MAX__ 9223372036854775807LL 2025-05-07T19:45:03.2948857Z #define __LONG_MAX__ 9223372036854775807L 2025-05-07T19:45:03.2949162Z #define __LONG_WIDTH__ 64 2025-05-07T19:45:03.2949436Z #define __LP64__ 1 2025-05-07T19:45:03.2949667Z #define __MMX__ 1 2025-05-07T19:45:03.2949920Z #define __NO_INLINE__ 1 2025-05-07T19:45:03.2950168Z #define __NO_MATH_INLINES 1 2025-05-07T19:45:03.2950462Z #define __OBJC_BOOL_IS_BOOL 0 2025-05-07T19:45:03.2950765Z #define __OPENCL_MEMORY_SCOPE_ALL_SVM_DEVICES 3 2025-05-07T19:45:03.2951129Z #define __OPENCL_MEMORY_SCOPE_DEVICE 2 2025-05-07T19:45:03.2951483Z #define __OPENCL_MEMORY_SCOPE_SUB_GROUP 4 2025-05-07T19:45:03.2951819Z #define __OPENCL_MEMORY_SCOPE_WORK_GROUP 1 2025-05-07T19:45:03.2952175Z #define __OPENCL_MEMORY_SCOPE_WORK_ITEM 0 2025-05-07T19:45:03.2952491Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:45:03.2952821Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:45:03.2953124Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:45:03.2953441Z #define __PIC__ 2 2025-05-07T19:45:03.2953669Z #define __PIE__ 2 2025-05-07T19:45:03.2954025Z #define __POINTER_WIDTH__ 64 2025-05-07T19:45:03.2954314Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:45:03.2954834Z #define __PTRDIFF_FMTd__ "ld" 2025-05-07T19:45:03.2955272Z #define __PTRDIFF_FMTi__ "li" 2025-05-07T19:45:03.2955585Z #define __PTRDIFF_MAX__ 9223372036854775807L 2025-05-07T19:45:03.2955940Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:45:03.2956272Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:45:03.2956593Z #define __REGISTER_PREFIX__ 2025-05-07T19:45:03.2956880Z #define __SCHAR_MAX__ 127 2025-05-07T19:45:03.2957323Z #define __SEG_FS 1 2025-05-07T19:45:03.2957577Z #define __SEG_GS 1 2025-05-07T19:45:03.2957865Z #define __SHRT_MAX__ 32767 2025-05-07T19:45:03.2958149Z #define __SHRT_WIDTH__ 16 2025-05-07T19:45:03.2958471Z #define __SIG_ATOMIC_MAX__ 2147483647 2025-05-07T19:45:03.2960185Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:45:03.2960496Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:45:03.2960828Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:45:03.2961129Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:45:03.2961445Z #define __SIZEOF_INT128__ 16 2025-05-07T19:45:03.2961739Z #define __SIZEOF_INT__ 4 2025-05-07T19:45:03.2962063Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:45:03.2962384Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:45:03.2962721Z #define __SIZEOF_LONG__ 8 2025-05-07T19:45:03.2963005Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:45:03.2963334Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:45:03.2963665Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:45:03.2963952Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:45:03.2964283Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:45:03.2964581Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:45:03.2964904Z #define __SIZE_FMTX__ "lX" 2025-05-07T19:45:03.2965187Z #define __SIZE_FMTo__ "lo" 2025-05-07T19:45:03.2965496Z #define __SIZE_FMTu__ "lu" 2025-05-07T19:45:03.2965767Z #define __SIZE_FMTx__ "lx" 2025-05-07T19:45:03.2966085Z #define __SIZE_MAX__ 18446744073709551615UL 2025-05-07T19:45:03.2966422Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:45:03.2966888Z #define __SIZE_WIDTH__ 64 2025-05-07T19:45:03.2967172Z #define __SSE2_MATH__ 1 2025-05-07T19:45:03.2967411Z #define __SSE2__ 1 2025-05-07T19:45:03.2967663Z #define __SSE_MATH__ 1 2025-05-07T19:45:03.2967903Z #define __SSE__ 1 2025-05-07T19:45:03.2968170Z #define __STDC_HOSTED__ 1 2025-05-07T19:45:03.2968427Z #define __STDC_UTF_16__ 1 2025-05-07T19:45:03.2968714Z #define __STDC_UTF_32__ 1 2025-05-07T19:45:03.2968978Z #define __STDC_VERSION__ 201710L 2025-05-07T19:45:03.2969283Z #define __STDC__ 1 2025-05-07T19:45:03.2969531Z #define __UINT16_C_SUFFIX__ 2025-05-07T19:45:03.2969829Z #define __UINT16_FMTX__ "hX" 2025-05-07T19:45:03.2970110Z #define __UINT16_FMTo__ "ho" 2025-05-07T19:45:03.2970408Z #define __UINT16_FMTu__ "hu" 2025-05-07T19:45:03.2970703Z #define __UINT16_FMTx__ "hx" 2025-05-07T19:45:03.2970970Z #define __UINT16_MAX__ 65535 2025-05-07T19:45:03.2971271Z #define __UINT16_TYPE__ unsigned short 2025-05-07T19:45:03.2971574Z #define __UINT32_C_SUFFIX__ U 2025-05-07T19:45:03.2971874Z #define __UINT32_FMTX__ "X" 2025-05-07T19:45:03.2972146Z #define __UINT32_FMTo__ "o" 2025-05-07T19:45:03.2972432Z #define __UINT32_FMTu__ "u" 2025-05-07T19:45:03.2972693Z #define __UINT32_FMTx__ "x" 2025-05-07T19:45:03.2972983Z #define __UINT32_MAX__ 4294967295U 2025-05-07T19:45:03.2973278Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:45:03.2973602Z #define __UINT64_C_SUFFIX__ UL 2025-05-07T19:45:03.2973904Z #define __UINT64_FMTX__ "lX" 2025-05-07T19:45:03.2974174Z #define __UINT64_FMTo__ "lo" 2025-05-07T19:45:03.2974476Z #define __UINT64_FMTu__ "lu" 2025-05-07T19:45:03.2974743Z #define __UINT64_FMTx__ "lx" 2025-05-07T19:45:03.2975050Z #define __UINT64_MAX__ 18446744073709551615UL 2025-05-07T19:45:03.2975373Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:45:03.2975708Z #define __UINT8_C_SUFFIX__ 2025-05-07T19:45:03.2975972Z #define __UINT8_FMTX__ "hhX" 2025-05-07T19:45:03.2976264Z #define __UINT8_FMTo__ "hho" 2025-05-07T19:45:03.2976529Z #define __UINT8_FMTu__ "hhu" 2025-05-07T19:45:03.2976822Z #define __UINT8_FMTx__ "hhx" 2025-05-07T19:45:03.2977110Z #define __UINT8_MAX__ 255 2025-05-07T19:45:03.2977379Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:45:03.2977702Z #define __UINTMAX_C_SUFFIX__ UL 2025-05-07T19:45:03.2977980Z #define __UINTMAX_FMTX__ "lX" 2025-05-07T19:45:03.2978280Z #define __UINTMAX_FMTo__ "lo" 2025-05-07T19:45:03.2978557Z #define __UINTMAX_FMTu__ "lu" 2025-05-07T19:45:03.2978856Z #define __UINTMAX_FMTx__ "lx" 2025-05-07T19:45:03.2979149Z #define __UINTMAX_MAX__ 18446744073709551615UL 2025-05-07T19:45:03.2979629Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:45:03.2979940Z #define __UINTMAX_WIDTH__ 64 2025-05-07T19:45:03.2980237Z #define __UINTPTR_FMTX__ "lX" 2025-05-07T19:45:03.2980537Z #define __UINTPTR_FMTo__ "lo" 2025-05-07T19:45:03.2980874Z #define __UINTPTR_FMTu__ "lu" 2025-05-07T19:45:03.2981173Z #define __UINTPTR_FMTx__ "lx" 2025-05-07T19:45:03.2981460Z #define __UINTPTR_MAX__ 18446744073709551615UL 2025-05-07T19:45:03.2981816Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:45:03.2982127Z #define __UINTPTR_WIDTH__ 64 2025-05-07T19:45:03.2982424Z #define __UINT_FAST16_FMTX__ "hX" 2025-05-07T19:45:03.2982713Z #define __UINT_FAST16_FMTo__ "ho" 2025-05-07T19:45:03.2983022Z #define __UINT_FAST16_FMTu__ "hu" 2025-05-07T19:45:03.2983317Z #define __UINT_FAST16_FMTx__ "hx" 2025-05-07T19:45:03.2983625Z #define __UINT_FAST16_MAX__ 65535 2025-05-07T19:45:03.2983951Z #define __UINT_FAST16_TYPE__ unsigned short 2025-05-07T19:45:03.2984267Z #define __UINT_FAST32_FMTX__ "X" 2025-05-07T19:45:03.2984590Z #define __UINT_FAST32_FMTo__ "o" 2025-05-07T19:45:03.2984872Z #define __UINT_FAST32_FMTu__ "u" 2025-05-07T19:45:03.2985184Z #define __UINT_FAST32_FMTx__ "x" 2025-05-07T19:45:03.2985474Z #define __UINT_FAST32_MAX__ 4294967295U 2025-05-07T19:45:03.2985823Z #define __UINT_FAST32_TYPE__ unsigned int 2025-05-07T19:45:03.2986141Z #define __UINT_FAST64_FMTX__ "lX" 2025-05-07T19:45:03.2986453Z #define __UINT_FAST64_FMTo__ "lo" 2025-05-07T19:45:03.2986762Z #define __UINT_FAST64_FMTu__ "lu" 2025-05-07T19:45:03.2987046Z #define __UINT_FAST64_FMTx__ "lx" 2025-05-07T19:45:03.2987380Z #define __UINT_FAST64_MAX__ 18446744073709551615UL 2025-05-07T19:45:03.2987731Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:45:03.2988082Z #define __UINT_FAST8_FMTX__ "hhX" 2025-05-07T19:45:03.2988368Z #define __UINT_FAST8_FMTo__ "hho" 2025-05-07T19:45:03.2988678Z #define __UINT_FAST8_FMTu__ "hhu" 2025-05-07T19:45:03.2988970Z #define __UINT_FAST8_FMTx__ "hhx" 2025-05-07T19:45:03.2989281Z #define __UINT_FAST8_MAX__ 255 2025-05-07T19:45:03.2989581Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:45:03.2989922Z #define __UINT_LEAST16_FMTX__ "hX" 2025-05-07T19:45:03.2990251Z #define __UINT_LEAST16_FMTo__ "ho" 2025-05-07T19:45:03.2990548Z #define __UINT_LEAST16_FMTu__ "hu" 2025-05-07T19:45:03.2990867Z #define __UINT_LEAST16_FMTx__ "hx" 2025-05-07T19:45:03.2991163Z #define __UINT_LEAST16_MAX__ 65535 2025-05-07T19:45:03.2991506Z #define __UINT_LEAST16_TYPE__ unsigned short 2025-05-07T19:45:03.2991830Z #define __UINT_LEAST32_FMTX__ "X" 2025-05-07T19:45:03.2992144Z #define __UINT_LEAST32_FMTo__ "o" 2025-05-07T19:45:03.2992435Z #define __UINT_LEAST32_FMTu__ "u" 2025-05-07T19:45:03.2992753Z #define __UINT_LEAST32_FMTx__ "x" 2025-05-07T19:45:03.2993054Z #define __UINT_LEAST32_MAX__ 4294967295U 2025-05-07T19:45:03.2993396Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:45:03.2993748Z #define __UINT_LEAST64_FMTX__ "lX" 2025-05-07T19:45:03.2994156Z #define __UINT_LEAST64_FMTo__ "lo" 2025-05-07T19:45:03.2994678Z #define __UINT_LEAST64_FMTu__ "lu" 2025-05-07T19:45:03.2994990Z #define __UINT_LEAST64_FMTx__ "lx" 2025-05-07T19:45:03.2995383Z #define __UINT_LEAST64_MAX__ 18446744073709551615UL 2025-05-07T19:45:03.2995771Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:45:03.2996159Z #define __UINT_LEAST8_FMTX__ "hhX" 2025-05-07T19:45:03.2996472Z #define __UINT_LEAST8_FMTo__ "hho" 2025-05-07T19:45:03.2996803Z #define __UINT_LEAST8_FMTu__ "hhu" 2025-05-07T19:45:03.2997133Z #define __UINT_LEAST8_FMTx__ "hhx" 2025-05-07T19:45:03.2997433Z #define __UINT_LEAST8_MAX__ 255 2025-05-07T19:45:03.2997767Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:45:03.2998100Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:45:03.2998789Z #define __VERSION__ "Clang 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:45:03.2999443Z #define __WCHAR_MAX__ 2147483647 2025-05-07T19:45:03.2999768Z #define __WCHAR_TYPE__ int 2025-05-07T19:45:03.3000152Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:45:03.3000463Z #define __WINT_MAX__ 4294967295U 2025-05-07T19:45:03.3000795Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:45:03.3001105Z #define __WINT_UNSIGNED__ 1 2025-05-07T19:45:03.3001423Z #define __WINT_WIDTH__ 32 2025-05-07T19:45:03.3001765Z #define __amd64 1 2025-05-07T19:45:03.3002040Z #define __amd64__ 1 2025-05-07T19:45:03.3002294Z #define __clang__ 1 2025-05-07T19:45:03.3002603Z #define __clang_literal_encoding__ "UTF-8" 2025-05-07T19:45:03.3002940Z #define __clang_major__ 16 2025-05-07T19:45:03.3003245Z #define __clang_minor__ 0 2025-05-07T19:45:03.3003531Z #define __clang_patchlevel__ 6 2025-05-07T19:45:03.3004180Z #define __clang_version__ "16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:45:03.3004890Z #define __clang_wide_literal_encoding__ "UTF-32" 2025-05-07T19:45:03.3005249Z #define __code_model_small__ 1 2025-05-07T19:45:03.3005561Z #define __gnu_linux__ 1 2025-05-07T19:45:03.3005827Z #define __k8 1 2025-05-07T19:45:03.3006090Z #define __k8__ 1 2025-05-07T19:45:03.3006334Z #define __linux 1 2025-05-07T19:45:03.3006602Z #define __linux__ 1 2025-05-07T19:45:03.3006955Z #define __llvm__ 1 2025-05-07T19:45:03.3007207Z #define __pic__ 2 2025-05-07T19:45:03.3007434Z #define __pie__ 2 2025-05-07T19:45:03.3007731Z #define __seg_fs __attribute__((address_space(257))) 2025-05-07T19:45:03.3008136Z #define __seg_gs __attribute__((address_space(256))) 2025-05-07T19:45:03.3008467Z #define __tune_k8__ 1 2025-05-07T19:45:03.3008738Z #define __unix 1 2025-05-07T19:45:03.3008964Z #define __unix__ 1 2025-05-07T19:45:03.3009218Z #define __x86_64 1 2025-05-07T19:45:03.3009449Z #define __x86_64__ 1 2025-05-07T19:45:03.3009708Z #define linux 1 2025-05-07T19:45:03.3009933Z #define unix 1 2025-05-07T19:45:03.3010091Z 2025-05-07T19:45:03.3485616Z 2025-05-07T19:45:03.3486681Z [INFO] Printing out all preprocessor defines in the C++ compiler ... 2025-05-07T19:45:03.3488089Z + conda run -n build_binary c++ -dM -E -x c++ - 2025-05-07T19:45:03.3488832Z 2025-05-07T19:45:04.9503979Z #define _GNU_SOURCE 1 2025-05-07T19:45:04.9504450Z #define _LP64 1 2025-05-07T19:45:04.9505524Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:45:04.9506007Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:45:04.9506321Z #define __ATOMIC_CONSUME 1 2025-05-07T19:45:04.9506609Z #define __ATOMIC_RELAXED 0 2025-05-07T19:45:04.9506868Z #define __ATOMIC_RELEASE 3 2025-05-07T19:45:04.9507133Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:45:04.9507397Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:45:04.9507700Z #define __BITINT_MAXWIDTH__ 8388608 2025-05-07T19:45:04.9507990Z #define __BOOL_WIDTH__ 8 2025-05-07T19:45:04.9508289Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:45:04.9508626Z #define __CHAR16_TYPE__ unsigned short 2025-05-07T19:45:04.9508946Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:45:04.9509245Z #define __CHAR_BIT__ 8 2025-05-07T19:45:04.9509492Z #define __CLANG_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:45:04.9509827Z #define __CLANG_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:45:04.9510177Z #define __CLANG_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:45:04.9510509Z #define __CLANG_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:45:04.9510820Z #define __CLANG_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:45:04.9511153Z #define __CLANG_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:45:04.9511495Z #define __CLANG_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:45:04.9511826Z #define __CLANG_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:45:04.9512148Z #define __CLANG_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:45:04.9512482Z #define __CLANG_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:45:04.9512811Z #define __CONSTANT_CFSTRINGS__ 1 2025-05-07T19:45:04.9513099Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:45:04.9513419Z #define __DBL_DENORM_MIN__ 4.9406564584124654e-324 2025-05-07T19:45:04.9513736Z #define __DBL_DIG__ 15 2025-05-07T19:45:04.9514171Z #define __DBL_EPSILON__ 2.2204460492503131e-16 2025-05-07T19:45:04.9514490Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:45:04.9514779Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:45:04.9515389Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:45:04.9515798Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:45:04.9516075Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:45:04.9516344Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:45:04.9516760Z #define __DBL_MAX__ 1.7976931348623157e+308 2025-05-07T19:45:04.9517069Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:45:04.9517371Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:45:04.9517651Z #define __DBL_MIN__ 2.2250738585072014e-308 2025-05-07T19:45:04.9517991Z #define __DECIMAL_DIG__ __LDBL_DECIMAL_DIG__ 2025-05-07T19:45:04.9518305Z #define __DEPRECATED 1 2025-05-07T19:45:04.9518554Z #define __ELF__ 1 2025-05-07T19:45:04.9518777Z #define __EXCEPTIONS 1 2025-05-07T19:45:04.9519038Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:45:04.9519315Z #define __FLOAT128__ 1 2025-05-07T19:45:04.9519546Z #define __FLT16_DECIMAL_DIG__ 5 2025-05-07T19:45:04.9519860Z #define __FLT16_DENORM_MIN__ 5.9604644775390625e-8F16 2025-05-07T19:45:04.9520184Z #define __FLT16_DIG__ 3 2025-05-07T19:45:04.9520444Z #define __FLT16_EPSILON__ 9.765625e-4F16 2025-05-07T19:45:04.9520835Z #define __FLT16_HAS_DENORM__ 1 2025-05-07T19:45:04.9521101Z #define __FLT16_HAS_INFINITY__ 1 2025-05-07T19:45:04.9521358Z #define __FLT16_HAS_QUIET_NAN__ 1 2025-05-07T19:45:04.9521625Z #define __FLT16_MANT_DIG__ 11 2025-05-07T19:45:04.9521867Z #define __FLT16_MAX_10_EXP__ 4 2025-05-07T19:45:04.9522122Z #define __FLT16_MAX_EXP__ 16 2025-05-07T19:45:04.9522382Z #define __FLT16_MAX__ 6.5504e+4F16 2025-05-07T19:45:04.9522640Z #define __FLT16_MIN_10_EXP__ (-4) 2025-05-07T19:45:04.9522908Z #define __FLT16_MIN_EXP__ (-13) 2025-05-07T19:45:04.9523157Z #define __FLT16_MIN__ 6.103515625e-5F16 2025-05-07T19:45:04.9523440Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:45:04.9523697Z #define __FLT_DENORM_MIN__ 1.40129846e-45F 2025-05-07T19:45:04.9523984Z #define __FLT_DIG__ 6 2025-05-07T19:45:04.9524209Z #define __FLT_EPSILON__ 1.19209290e-7F 2025-05-07T19:45:04.9524490Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:45:04.9524735Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:45:04.9524996Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:45:04.9525254Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:45:04.9525489Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:45:04.9525747Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:45:04.9525982Z #define __FLT_MAX__ 3.40282347e+38F 2025-05-07T19:45:04.9526259Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:45:04.9526511Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:45:04.9526769Z #define __FLT_MIN__ 1.17549435e-38F 2025-05-07T19:45:04.9527020Z #define __FLT_RADIX__ 2 2025-05-07T19:45:04.9527254Z #define __FXSR__ 1 2025-05-07T19:45:04.9527467Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:45:04.9527747Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:45:04.9528041Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:45:04.9528504Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:45:04.9528989Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:45:04.9529392Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:45:04.9529703Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:45:04.9530001Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:45:04.9530323Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:45:04.9530635Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:45:04.9530959Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:45:04.9531290Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:45:04.9531593Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:45:04.9531909Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:45:04.9532235Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:45:04.9532576Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:45:04.9532900Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:45:04.9533229Z #define __GLIBCXX_BITSIZE_INT_N_0 128 2025-05-07T19:45:04.9533525Z #define __GLIBCXX_TYPE_INT_N_0 __int128 2025-05-07T19:45:04.9533833Z #define __GNUC_GNU_INLINE__ 1 2025-05-07T19:45:04.9534212Z #define __GNUC_MINOR__ 2 2025-05-07T19:45:04.9534479Z #define __GNUC_PATCHLEVEL__ 1 2025-05-07T19:45:04.9534746Z #define __GNUC__ 4 2025-05-07T19:45:04.9534954Z #define __GNUG__ 4 2025-05-07T19:45:04.9535199Z #define __GXX_ABI_VERSION 1002 2025-05-07T19:45:04.9535680Z #define __GXX_EXPERIMENTAL_CXX0X__ 1 2025-05-07T19:45:04.9535955Z #define __GXX_RTTI 1 2025-05-07T19:45:04.9536164Z #define __GXX_WEAK__ 1 2025-05-07T19:45:04.9536396Z #define __INT16_C_SUFFIX__ 2025-05-07T19:45:04.9536627Z #define __INT16_FMTd__ "hd" 2025-05-07T19:45:04.9536877Z #define __INT16_FMTi__ "hi" 2025-05-07T19:45:04.9537103Z #define __INT16_MAX__ 32767 2025-05-07T19:45:04.9537350Z #define __INT16_TYPE__ short 2025-05-07T19:45:04.9537612Z #define __INT32_C_SUFFIX__ 2025-05-07T19:45:04.9537849Z #define __INT32_FMTd__ "d" 2025-05-07T19:45:04.9538096Z #define __INT32_FMTi__ "i" 2025-05-07T19:45:04.9538327Z #define __INT32_MAX__ 2147483647 2025-05-07T19:45:04.9538593Z #define __INT32_TYPE__ int 2025-05-07T19:45:04.9538827Z #define __INT64_C_SUFFIX__ L 2025-05-07T19:45:04.9539074Z #define __INT64_FMTd__ "ld" 2025-05-07T19:45:04.9539304Z #define __INT64_FMTi__ "li" 2025-05-07T19:45:04.9539561Z #define __INT64_MAX__ 9223372036854775807L 2025-05-07T19:45:04.9539833Z #define __INT64_TYPE__ long int 2025-05-07T19:45:04.9540093Z #define __INT8_C_SUFFIX__ 2025-05-07T19:45:04.9540339Z #define __INT8_FMTd__ "hhd" 2025-05-07T19:45:04.9540570Z #define __INT8_FMTi__ "hhi" 2025-05-07T19:45:04.9540818Z #define __INT8_MAX__ 127 2025-05-07T19:45:04.9541049Z #define __INT8_TYPE__ signed char 2025-05-07T19:45:04.9541323Z #define __INTMAX_C_SUFFIX__ L 2025-05-07T19:45:04.9541575Z #define __INTMAX_FMTd__ "ld" 2025-05-07T19:45:04.9541830Z #define __INTMAX_FMTi__ "li" 2025-05-07T19:45:04.9542081Z #define __INTMAX_MAX__ 9223372036854775807L 2025-05-07T19:45:04.9542378Z #define __INTMAX_TYPE__ long int 2025-05-07T19:45:04.9542631Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:45:04.9542888Z #define __INTPTR_FMTd__ "ld" 2025-05-07T19:45:04.9543136Z #define __INTPTR_FMTi__ "li" 2025-05-07T19:45:04.9543408Z #define __INTPTR_MAX__ 9223372036854775807L 2025-05-07T19:45:04.9543707Z #define __INTPTR_TYPE__ long int 2025-05-07T19:45:04.9543962Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:45:04.9544226Z #define __INT_FAST16_FMTd__ "hd" 2025-05-07T19:45:04.9544481Z #define __INT_FAST16_FMTi__ "hi" 2025-05-07T19:45:04.9544746Z #define __INT_FAST16_MAX__ 32767 2025-05-07T19:45:04.9545001Z #define __INT_FAST16_TYPE__ short 2025-05-07T19:45:04.9545277Z #define __INT_FAST16_WIDTH__ 16 2025-05-07T19:45:04.9545521Z #define __INT_FAST32_FMTd__ "d" 2025-05-07T19:45:04.9545779Z #define __INT_FAST32_FMTi__ "i" 2025-05-07T19:45:04.9546048Z #define __INT_FAST32_MAX__ 2147483647 2025-05-07T19:45:04.9546319Z #define __INT_FAST32_TYPE__ int 2025-05-07T19:45:04.9546582Z #define __INT_FAST32_WIDTH__ 32 2025-05-07T19:45:04.9546829Z #define __INT_FAST64_FMTd__ "ld" 2025-05-07T19:45:04.9547091Z #define __INT_FAST64_FMTi__ "li" 2025-05-07T19:45:04.9547364Z #define __INT_FAST64_MAX__ 9223372036854775807L 2025-05-07T19:45:04.9547679Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:45:04.9547939Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:45:04.9548196Z #define __INT_FAST8_FMTd__ "hhd" 2025-05-07T19:45:04.9548443Z #define __INT_FAST8_FMTi__ "hhi" 2025-05-07T19:45:04.9548707Z #define __INT_FAST8_MAX__ 127 2025-05-07T19:45:04.9548970Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:45:04.9567081Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:45:04.9567419Z #define __INT_LEAST16_FMTd__ "hd" 2025-05-07T19:45:04.9567716Z #define __INT_LEAST16_FMTi__ "hi" 2025-05-07T19:45:04.9567977Z #define __INT_LEAST16_MAX__ 32767 2025-05-07T19:45:04.9568290Z #define __INT_LEAST16_TYPE__ short 2025-05-07T19:45:04.9568558Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:45:04.9568846Z #define __INT_LEAST32_FMTd__ "d" 2025-05-07T19:45:04.9569114Z #define __INT_LEAST32_FMTi__ "i" 2025-05-07T19:45:04.9569389Z #define __INT_LEAST32_MAX__ 2147483647 2025-05-07T19:45:04.9569664Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:45:04.9570078Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:45:04.9570357Z #define __INT_LEAST64_FMTd__ "ld" 2025-05-07T19:45:04.9570618Z #define __INT_LEAST64_FMTi__ "li" 2025-05-07T19:45:04.9570915Z #define __INT_LEAST64_MAX__ 9223372036854775807L 2025-05-07T19:45:04.9571298Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:45:04.9571590Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:45:04.9571844Z #define __INT_LEAST8_FMTd__ "hhd" 2025-05-07T19:45:04.9572118Z #define __INT_LEAST8_FMTi__ "hhi" 2025-05-07T19:45:04.9572377Z #define __INT_LEAST8_MAX__ 127 2025-05-07T19:45:04.9572649Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:45:04.9572951Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:45:04.9573200Z #define __INT_MAX__ 2147483647 2025-05-07T19:45:04.9573452Z #define __INT_WIDTH__ 32 2025-05-07T19:45:04.9573690Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:45:04.9573998Z #define __LDBL_DENORM_MIN__ 3.64519953188247460253e-4951L 2025-05-07T19:45:04.9574315Z #define __LDBL_DIG__ 18 2025-05-07T19:45:04.9574590Z #define __LDBL_EPSILON__ 1.08420217248550443401e-19L 2025-05-07T19:45:04.9574892Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:45:04.9575152Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:45:04.9575405Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:45:04.9575670Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:45:04.9575927Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:45:04.9576178Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:45:04.9576462Z #define __LDBL_MAX__ 1.18973149535723176502e+4932L 2025-05-07T19:45:04.9576763Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:45:04.9577043Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:45:04.9577322Z #define __LDBL_MIN__ 3.36210314311209350626e-4932L 2025-05-07T19:45:04.9577633Z #define __LITTLE_ENDIAN__ 1 2025-05-07T19:45:04.9577870Z #define __LLONG_WIDTH__ 64 2025-05-07T19:45:04.9578143Z #define __LONG_LONG_MAX__ 9223372036854775807LL 2025-05-07T19:45:04.9578441Z #define __LONG_MAX__ 9223372036854775807L 2025-05-07T19:45:04.9578734Z #define __LONG_WIDTH__ 64 2025-05-07T19:45:04.9578975Z #define __LP64__ 1 2025-05-07T19:45:04.9579174Z #define __MMX__ 1 2025-05-07T19:45:04.9579390Z #define __NO_INLINE__ 1 2025-05-07T19:45:04.9579618Z #define __NO_MATH_INLINES 1 2025-05-07T19:45:04.9579875Z #define __OBJC_BOOL_IS_BOOL 0 2025-05-07T19:45:04.9580148Z #define __OPENCL_MEMORY_SCOPE_ALL_SVM_DEVICES 3 2025-05-07T19:45:04.9580478Z #define __OPENCL_MEMORY_SCOPE_DEVICE 2 2025-05-07T19:45:04.9580765Z #define __OPENCL_MEMORY_SCOPE_SUB_GROUP 4 2025-05-07T19:45:04.9581074Z #define __OPENCL_MEMORY_SCOPE_WORK_GROUP 1 2025-05-07T19:45:04.9581372Z #define __OPENCL_MEMORY_SCOPE_WORK_ITEM 0 2025-05-07T19:45:04.9581669Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:45:04.9581946Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:45:04.9582219Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:45:04.9582481Z #define __PIC__ 2 2025-05-07T19:45:04.9582678Z #define __PIE__ 2 2025-05-07T19:45:04.9582906Z #define __POINTER_WIDTH__ 64 2025-05-07T19:45:04.9583168Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:45:04.9583449Z #define __PTRDIFF_FMTd__ "ld" 2025-05-07T19:45:04.9583696Z #define __PTRDIFF_FMTi__ "li" 2025-05-07T19:45:04.9583977Z #define __PTRDIFF_MAX__ 9223372036854775807L 2025-05-07T19:45:04.9584266Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:45:04.9584537Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:45:04.9584798Z #define __REGISTER_PREFIX__ 2025-05-07T19:45:04.9585040Z #define __SCHAR_MAX__ 127 2025-05-07T19:45:04.9585275Z #define __SEG_FS 1 2025-05-07T19:45:04.9585474Z #define __SEG_GS 1 2025-05-07T19:45:04.9585695Z #define __SHRT_MAX__ 32767 2025-05-07T19:45:04.9585925Z #define __SHRT_WIDTH__ 16 2025-05-07T19:45:04.9586181Z #define __SIG_ATOMIC_MAX__ 2147483647 2025-05-07T19:45:04.9586449Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:45:04.9586711Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:45:04.9586951Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:45:04.9587208Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:45:04.9587455Z #define __SIZEOF_INT128__ 16 2025-05-07T19:45:04.9587796Z #define __SIZEOF_INT__ 4 2025-05-07T19:45:04.9588045Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:45:04.9588301Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:45:04.9588558Z #define __SIZEOF_LONG__ 8 2025-05-07T19:45:04.9588847Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:45:04.9589107Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:45:04.9589350Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:45:04.9589598Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:45:04.9589835Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:45:04.9590086Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:45:04.9590338Z #define __SIZE_FMTX__ "lX" 2025-05-07T19:45:04.9590572Z #define __SIZE_FMTo__ "lo" 2025-05-07T19:45:04.9590820Z #define __SIZE_FMTu__ "lu" 2025-05-07T19:45:04.9591051Z #define __SIZE_FMTx__ "lx" 2025-05-07T19:45:04.9591311Z #define __SIZE_MAX__ 18446744073709551615UL 2025-05-07T19:45:04.9591599Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:45:04.9591895Z #define __SIZE_WIDTH__ 64 2025-05-07T19:45:04.9592124Z #define __SSE2_MATH__ 1 2025-05-07T19:45:04.9592352Z #define __SSE2__ 1 2025-05-07T19:45:04.9592591Z #define __SSE_MATH__ 1 2025-05-07T19:45:04.9592861Z #define __SSE__ 1 2025-05-07T19:45:04.9593128Z #define __STDCPP_DEFAULT_NEW_ALIGNMENT__ 16UL 2025-05-07T19:45:04.9593484Z #define __STDCPP_THREADS__ 1 2025-05-07T19:45:04.9593787Z #define __STDC_HOSTED__ 1 2025-05-07T19:45:04.9594138Z #define __STDC_UTF_16__ 1 2025-05-07T19:45:04.9594599Z #define __STDC_UTF_32__ 1 2025-05-07T19:45:04.9594871Z #define __STDC__ 1 2025-05-07T19:45:04.9595197Z #define __UINT16_C_SUFFIX__ 2025-05-07T19:45:04.9595491Z #define __UINT16_FMTX__ "hX" 2025-05-07T19:45:04.9595819Z #define __UINT16_FMTo__ "ho" 2025-05-07T19:45:04.9596108Z #define __UINT16_FMTu__ "hu" 2025-05-07T19:45:04.9596427Z #define __UINT16_FMTx__ "hx" 2025-05-07T19:45:04.9596716Z #define __UINT16_MAX__ 65535 2025-05-07T19:45:04.9597041Z #define __UINT16_TYPE__ unsigned short 2025-05-07T19:45:04.9597394Z #define __UINT32_C_SUFFIX__ U 2025-05-07T19:45:04.9597660Z #define __UINT32_FMTX__ "X" 2025-05-07T19:45:04.9597943Z #define __UINT32_FMTo__ "o" 2025-05-07T19:45:04.9598220Z #define __UINT32_FMTu__ "u" 2025-05-07T19:45:04.9598533Z #define __UINT32_FMTx__ "x" 2025-05-07T19:45:04.9598818Z #define __UINT32_MAX__ 4294967295U 2025-05-07T19:45:04.9599166Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:45:04.9599488Z #define __UINT64_C_SUFFIX__ UL 2025-05-07T19:45:04.9599808Z #define __UINT64_FMTX__ "lX" 2025-05-07T19:45:04.9600098Z #define __UINT64_FMTo__ "lo" 2025-05-07T19:45:04.9600418Z #define __UINT64_FMTu__ "lu" 2025-05-07T19:45:04.9600732Z #define __UINT64_FMTx__ "lx" 2025-05-07T19:45:04.9601037Z #define __UINT64_MAX__ 18446744073709551615UL 2025-05-07T19:45:04.9601416Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:45:04.9601747Z #define __UINT8_C_SUFFIX__ 2025-05-07T19:45:04.9602027Z #define __UINT8_FMTX__ "hhX" 2025-05-07T19:45:04.9602282Z #define __UINT8_FMTo__ "hho" 2025-05-07T19:45:04.9602549Z #define __UINT8_FMTu__ "hhu" 2025-05-07T19:45:04.9602805Z #define __UINT8_FMTx__ "hhx" 2025-05-07T19:45:04.9603069Z #define __UINT8_MAX__ 255 2025-05-07T19:45:04.9603326Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:45:04.9603627Z #define __UINTMAX_C_SUFFIX__ UL 2025-05-07T19:45:04.9603907Z #define __UINTMAX_FMTX__ "lX" 2025-05-07T19:45:04.9604179Z #define __UINTMAX_FMTo__ "lo" 2025-05-07T19:45:04.9604453Z #define __UINTMAX_FMTu__ "lu" 2025-05-07T19:45:04.9604712Z #define __UINTMAX_FMTx__ "lx" 2025-05-07T19:45:04.9605014Z #define __UINTMAX_MAX__ 18446744073709551615UL 2025-05-07T19:45:04.9605340Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:45:04.9605659Z #define __UINTMAX_WIDTH__ 64 2025-05-07T19:45:04.9605922Z #define __UINTPTR_FMTX__ "lX" 2025-05-07T19:45:04.9606201Z #define __UINTPTR_FMTo__ "lo" 2025-05-07T19:45:04.9606466Z #define __UINTPTR_FMTu__ "lu" 2025-05-07T19:45:04.9606846Z #define __UINTPTR_FMTx__ "lx" 2025-05-07T19:45:04.9607123Z #define __UINTPTR_MAX__ 18446744073709551615UL 2025-05-07T19:45:04.9607427Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:45:04.9609594Z #define __UINTPTR_WIDTH__ 64 2025-05-07T19:45:04.9609844Z #define __UINT_FAST16_FMTX__ "hX" 2025-05-07T19:45:04.9610121Z #define __UINT_FAST16_FMTo__ "ho" 2025-05-07T19:45:04.9610378Z #define __UINT_FAST16_FMTu__ "hu" 2025-05-07T19:45:04.9610716Z #define __UINT_FAST16_FMTx__ "hx" 2025-05-07T19:45:04.9610968Z #define __UINT_FAST16_MAX__ 65535 2025-05-07T19:45:04.9611251Z #define __UINT_FAST16_TYPE__ unsigned short 2025-05-07T19:45:04.9611550Z #define __UINT_FAST32_FMTX__ "X" 2025-05-07T19:45:04.9611820Z #define __UINT_FAST32_FMTo__ "o" 2025-05-07T19:45:04.9612089Z #define __UINT_FAST32_FMTu__ "u" 2025-05-07T19:45:04.9612343Z #define __UINT_FAST32_FMTx__ "x" 2025-05-07T19:45:04.9612614Z #define __UINT_FAST32_MAX__ 4294967295U 2025-05-07T19:45:04.9612900Z #define __UINT_FAST32_TYPE__ unsigned int 2025-05-07T19:45:04.9613199Z #define __UINT_FAST64_FMTX__ "lX" 2025-05-07T19:45:04.9613454Z #define __UINT_FAST64_FMTo__ "lo" 2025-05-07T19:45:04.9613725Z #define __UINT_FAST64_FMTu__ "lu" 2025-05-07T19:45:04.9613987Z #define __UINT_FAST64_FMTx__ "lx" 2025-05-07T19:45:04.9614280Z #define __UINT_FAST64_MAX__ 18446744073709551615UL 2025-05-07T19:45:04.9614621Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:45:04.9614927Z #define __UINT_FAST8_FMTX__ "hhX" 2025-05-07T19:45:04.9615197Z #define __UINT_FAST8_FMTo__ "hho" 2025-05-07T19:45:04.9615452Z #define __UINT_FAST8_FMTu__ "hhu" 2025-05-07T19:45:04.9615726Z #define __UINT_FAST8_FMTx__ "hhx" 2025-05-07T19:45:04.9615981Z #define __UINT_FAST8_MAX__ 255 2025-05-07T19:45:04.9616254Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:45:04.9616540Z #define __UINT_LEAST16_FMTX__ "hX" 2025-05-07T19:45:04.9616819Z #define __UINT_LEAST16_FMTo__ "ho" 2025-05-07T19:45:04.9617079Z #define __UINT_LEAST16_FMTu__ "hu" 2025-05-07T19:45:04.9617352Z #define __UINT_LEAST16_FMTx__ "hx" 2025-05-07T19:45:04.9617621Z #define __UINT_LEAST16_MAX__ 65535 2025-05-07T19:45:04.9617894Z #define __UINT_LEAST16_TYPE__ unsigned short 2025-05-07T19:45:04.9618198Z #define __UINT_LEAST32_FMTX__ "X" 2025-05-07T19:45:04.9618458Z #define __UINT_LEAST32_FMTo__ "o" 2025-05-07T19:45:04.9618724Z #define __UINT_LEAST32_FMTu__ "u" 2025-05-07T19:45:04.9618975Z #define __UINT_LEAST32_FMTx__ "x" 2025-05-07T19:45:04.9619258Z #define __UINT_LEAST32_MAX__ 4294967295U 2025-05-07T19:45:04.9619547Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:45:04.9619845Z #define __UINT_LEAST64_FMTX__ "lX" 2025-05-07T19:45:04.9620121Z #define __UINT_LEAST64_FMTo__ "lo" 2025-05-07T19:45:04.9620380Z #define __UINT_LEAST64_FMTu__ "lu" 2025-05-07T19:45:04.9620652Z #define __UINT_LEAST64_FMTx__ "lx" 2025-05-07T19:45:04.9620935Z #define __UINT_LEAST64_MAX__ 18446744073709551615UL 2025-05-07T19:45:04.9621280Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:45:04.9621581Z #define __UINT_LEAST8_FMTX__ "hhX" 2025-05-07T19:45:04.9621862Z #define __UINT_LEAST8_FMTo__ "hho" 2025-05-07T19:45:04.9622119Z #define __UINT_LEAST8_FMTu__ "hhu" 2025-05-07T19:45:04.9622402Z #define __UINT_LEAST8_FMTx__ "hhx" 2025-05-07T19:45:04.9622662Z #define __UINT_LEAST8_MAX__ 255 2025-05-07T19:45:04.9622936Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:45:04.9623225Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:45:04.9623820Z #define __VERSION__ "Clang 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:45:04.9624418Z #define __WCHAR_MAX__ 2147483647 2025-05-07T19:45:04.9624670Z #define __WCHAR_TYPE__ int 2025-05-07T19:45:04.9624919Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:45:04.9625149Z #define __WINT_MAX__ 4294967295U 2025-05-07T19:45:04.9625414Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:45:04.9625671Z #define __WINT_UNSIGNED__ 1 2025-05-07T19:45:04.9625918Z #define __WINT_WIDTH__ 32 2025-05-07T19:45:04.9626139Z #define __amd64 1 2025-05-07T19:45:04.9626352Z #define __amd64__ 1 2025-05-07T19:45:04.9626550Z #define __clang__ 1 2025-05-07T19:45:04.9626789Z #define __clang_literal_encoding__ "UTF-8" 2025-05-07T19:45:04.9627164Z #define __clang_major__ 16 2025-05-07T19:45:04.9627395Z #define __clang_minor__ 0 2025-05-07T19:45:04.9627648Z #define __clang_patchlevel__ 6 2025-05-07T19:45:04.9628265Z #define __clang_version__ "16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:45:04.9629378Z #define __clang_wide_literal_encoding__ "UTF-32" 2025-05-07T19:45:04.9629704Z #define __code_model_small__ 1 2025-05-07T19:45:04.9629985Z #define __cplusplus 201703L 2025-05-07T19:45:04.9630251Z #define __cpp_aggregate_bases 201603L 2025-05-07T19:45:04.9630566Z #define __cpp_aggregate_nsdmi 201304L 2025-05-07T19:45:04.9630881Z #define __cpp_alias_templates 200704L 2025-05-07T19:45:04.9631174Z #define __cpp_aligned_new 201606L 2025-05-07T19:45:04.9631469Z #define __cpp_attributes 200809L 2025-05-07T19:45:04.9631754Z #define __cpp_binary_literals 201304L 2025-05-07T19:45:04.9632063Z #define __cpp_capture_star_this 201603L 2025-05-07T19:45:04.9632363Z #define __cpp_constexpr 201603L 2025-05-07T19:45:04.9632675Z #define __cpp_constexpr_in_decltype 201711L 2025-05-07T19:45:04.9632987Z #define __cpp_decltype 200707L 2025-05-07T19:45:04.9633279Z #define __cpp_decltype_auto 201304L 2025-05-07T19:45:04.9633593Z #define __cpp_deduction_guides 201703L 2025-05-07T19:45:04.9633976Z #define __cpp_delegating_constructors 200604L 2025-05-07T19:45:04.9634326Z #define __cpp_digit_separators 201309L 2025-05-07T19:45:04.9634639Z #define __cpp_enumerator_attributes 201411L 2025-05-07T19:45:04.9634967Z #define __cpp_exceptions 199711L 2025-05-07T19:45:04.9635251Z #define __cpp_fold_expressions 201603L 2025-05-07T19:45:04.9635568Z #define __cpp_generic_lambdas 201304L 2025-05-07T19:45:04.9635888Z #define __cpp_guaranteed_copy_elision 201606L 2025-05-07T19:45:04.9636225Z #define __cpp_hex_float 201603L 2025-05-07T19:45:04.9636498Z #define __cpp_if_constexpr 201606L 2025-05-07T19:45:04.9636822Z #define __cpp_impl_destroying_delete 201806L 2025-05-07T19:45:04.9637174Z #define __cpp_inheriting_constructors 201511L 2025-05-07T19:45:04.9637504Z #define __cpp_init_captures 201304L 2025-05-07T19:45:04.9637815Z #define __cpp_initializer_lists 200806L 2025-05-07T19:45:04.9638119Z #define __cpp_inline_variables 201606L 2025-05-07T19:45:04.9638426Z #define __cpp_lambdas 200907L 2025-05-07T19:45:04.9638718Z #define __cpp_named_character_escapes 202207L 2025-05-07T19:45:04.9639066Z #define __cpp_namespace_attributes 201411L 2025-05-07T19:45:04.9639411Z #define __cpp_nested_namespace_definitions 201411L 2025-05-07T19:45:04.9639787Z #define __cpp_noexcept_function_type 201510L 2025-05-07T19:45:04.9640130Z #define __cpp_nontype_template_args 201411L 2025-05-07T19:45:04.9640485Z #define __cpp_nontype_template_parameter_auto 201606L 2025-05-07T19:45:04.9640846Z #define __cpp_nsdmi 200809L 2025-05-07T19:45:04.9641110Z #define __cpp_range_based_for 201603L 2025-05-07T19:45:04.9641414Z #define __cpp_raw_strings 200710L 2025-05-07T19:45:04.9641701Z #define __cpp_ref_qualifiers 200710L 2025-05-07T19:45:04.9642020Z #define __cpp_return_type_deduction 201304L 2025-05-07T19:45:04.9642328Z #define __cpp_rtti 199711L 2025-05-07T19:45:04.9642606Z #define __cpp_rvalue_references 200610L 2025-05-07T19:45:04.9642909Z #define __cpp_static_assert 201411L 2025-05-07T19:45:04.9643230Z #define __cpp_static_call_operator 202207L 2025-05-07T19:45:04.9643570Z #define __cpp_structured_bindings 201606L 2025-05-07T19:45:04.9643888Z #define __cpp_template_auto 201606L 2025-05-07T19:45:04.9644212Z #define __cpp_threadsafe_static_init 200806L 2025-05-07T19:45:04.9644537Z #define __cpp_unicode_characters 200704L 2025-05-07T19:45:04.9644869Z #define __cpp_unicode_literals 200710L 2025-05-07T19:45:04.9645185Z #define __cpp_user_defined_literals 200809L 2025-05-07T19:45:04.9645521Z #define __cpp_variable_templates 201304L 2025-05-07T19:45:04.9645836Z #define __cpp_variadic_templates 200704L 2025-05-07T19:45:04.9646246Z #define __cpp_variadic_using 201611L 2025-05-07T19:45:04.9646522Z #define __gnu_linux__ 1 2025-05-07T19:45:04.9646736Z #define __k8 1 2025-05-07T19:45:04.9647083Z #define __k8__ 1 2025-05-07T19:45:04.9647275Z #define __linux 1 2025-05-07T19:45:04.9647490Z #define __linux__ 1 2025-05-07T19:45:04.9647685Z #define __llvm__ 1 2025-05-07T19:45:04.9647898Z #define __pic__ 2 2025-05-07T19:45:04.9648093Z #define __pie__ 2 2025-05-07T19:45:04.9648390Z #define __private_extern__ extern 2025-05-07T19:45:04.9648694Z #define __seg_fs __attribute__((address_space(257))) 2025-05-07T19:45:04.9649064Z #define __seg_gs __attribute__((address_space(256))) 2025-05-07T19:45:04.9649386Z #define __tune_k8__ 1 2025-05-07T19:45:04.9649593Z #define __unix 1 2025-05-07T19:45:04.9649803Z #define __unix__ 1 2025-05-07T19:45:04.9649998Z #define __x86_64 1 2025-05-07T19:45:04.9650211Z #define __x86_64__ 1 2025-05-07T19:45:04.9650418Z #define linux 1 2025-05-07T19:45:04.9650641Z #define unix 1 2025-05-07T19:45:04.9650763Z 2025-05-07T19:45:05.0096134Z 2025-05-07T19:45:05.0096626Z + conda run -n build_binary c++ --version 2025-05-07T19:45:05.0096930Z 2025-05-07T19:45:06.6097065Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:45:06.6098974Z Target: x86_64-conda-linux-gnu 2025-05-07T19:45:06.6099792Z Thread model: posix 2025-05-07T19:45:06.6100750Z InstalledDir: /github/home/miniconda/envs/build_binary/bin 2025-05-07T19:45:06.6102645Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:45:06.6104013Z 2025-05-07T19:45:06.6682673Z 2025-05-07T19:45:06.6683665Z [INFO] Printing the default version of the C standard used by the compiler ... 2025-05-07T19:45:06.6685426Z + conda run -n build_binary cc -dM -E - < /dev/null | grep __STDC_VERSION__ 2025-05-07T19:45:06.6686378Z 2025-05-07T19:45:08.3273591Z #define __STDC_VERSION__ 201710L 2025-05-07T19:45:08.3274117Z 2025-05-07T19:45:08.3275050Z [INFO] Printing the default version of the C++ standard used by the compiler ... 2025-05-07T19:45:08.3275700Z + conda run -n build_binary c++ -dM -E -x c++ - < /dev/null | grep __cplusplus 2025-05-07T19:45:08.3276101Z 2025-05-07T19:45:09.9912292Z #define __cplusplus 201703L 2025-05-07T19:45:09.9912900Z 2025-05-07T19:45:09.9913345Z [INSTALL] Successfully installed C/C++ compilers 2025-05-07T19:45:09.9982486Z ##[group]Run . $PRELUDE; install_build_tools $BUILD_ENV 2025-05-07T19:45:09.9982920Z . $PRELUDE; install_build_tools $BUILD_ENV 2025-05-07T19:45:09.9983650Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:45:09.9983977Z env: 2025-05-07T19:45:09.9984192Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:45:09.9984537Z BUILD_ENV: build_binary 2025-05-07T19:45:09.9984792Z BUILD_TARGET: default 2025-05-07T19:45:09.9985061Z BUILD_VARIANT: cuda 2025-05-07T19:45:09.9985285Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:45:09.9985536Z ##[endgroup] 2025-05-07T19:45:10.4118960Z ################################################################################ 2025-05-07T19:45:10.4119359Z # Install Build Tools 2025-05-07T19:45:10.4119664Z # 2025-05-07T19:45:10.4133760Z # [2025-05-07T19:45:10.413Z] + install_build_tools build_binary 2025-05-07T19:45:10.4134211Z ################################################################################ 2025-05-07T19:45:10.4134626Z 2025-05-07T19:45:10.4153118Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:45:10.5017322Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:45:10.5028503Z [INSTALL] Installing build tools ... 2025-05-07T19:45:10.5054555Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y auditwheel bazel cmake>=3.30 hypothesis jinja2 make ncurses ninja openblas patchelf rhash scikit-build wheel pyyaml 2025-05-07T19:45:11.2261286Z Channels: 2025-05-07T19:45:11.2261992Z - conda-forge 2025-05-07T19:45:11.2262631Z Platform: linux-64 2025-05-07T19:45:14.3272242Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:45:17.9913486Z Solving environment: \ | / - done 2025-05-07T19:45:18.0535719Z 2025-05-07T19:45:18.0536466Z ## Package Plan ## 2025-05-07T19:45:18.0536931Z 2025-05-07T19:45:18.0537698Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:45:18.0538623Z 2025-05-07T19:45:18.0538942Z added / updated specs: 2025-05-07T19:45:18.0539684Z - auditwheel 2025-05-07T19:45:18.0540284Z - bazel 2025-05-07T19:45:18.0540926Z - cmake[version='>=3.30'] 2025-05-07T19:45:18.0541662Z - hypothesis 2025-05-07T19:45:18.0542275Z - jinja2 2025-05-07T19:45:18.0542837Z - make 2025-05-07T19:45:18.0543392Z - ncurses 2025-05-07T19:45:18.0543940Z - ninja 2025-05-07T19:45:18.0544513Z - openblas 2025-05-07T19:45:18.0545112Z - patchelf 2025-05-07T19:45:18.0545676Z - pyyaml 2025-05-07T19:45:18.0546256Z - rhash 2025-05-07T19:45:18.0546802Z - scikit-build 2025-05-07T19:45:18.0547424Z - wheel 2025-05-07T19:45:18.0547746Z 2025-05-07T19:45:18.0547760Z 2025-05-07T19:45:18.0548106Z The following packages will be downloaded: 2025-05-07T19:45:18.0548681Z 2025-05-07T19:45:18.0548806Z package | build 2025-05-07T19:45:18.0549154Z ---------------------------|----------------- 2025-05-07T19:45:18.0549597Z alsa-lib-1.2.14 | hb9d3cd8_0 553 KB conda-forge 2025-05-07T19:45:18.0550082Z attrs-25.3.0 | pyh71513ae_0 56 KB conda-forge 2025-05-07T19:45:18.0550650Z auditwheel-6.2.0 | pyha804496_1 40 KB conda-forge 2025-05-07T19:45:18.0551119Z bazel-7.5.0 | h96810dc_2 47.4 MB conda-forge 2025-05-07T19:45:18.0551650Z c-ares-1.34.5 | hb9d3cd8_0 202 KB conda-forge 2025-05-07T19:45:18.0552075Z cairo-1.18.0 | hbb29018_2 961 KB conda-forge 2025-05-07T19:45:18.0552468Z click-8.1.8 | pyh707e725_0 83 KB conda-forge 2025-05-07T19:45:18.0552893Z cmake-4.0.2 | h74e3db0_0 19.4 MB conda-forge 2025-05-07T19:45:18.0553328Z distro-1.9.0 | pyhd8ed1ab_1 41 KB conda-forge 2025-05-07T19:45:18.0554374Z exceptiongroup-1.2.2 | pyhd8ed1ab_1 20 KB conda-forge 2025-05-07T19:45:18.0554983Z font-ttf-dejavu-sans-mono-2.37| hab24e00_0 388 KB conda-forge 2025-05-07T19:45:18.0555581Z font-ttf-inconsolata-3.000 | h77eed37_0 94 KB conda-forge 2025-05-07T19:45:18.0556174Z font-ttf-source-code-pro-2.038| h77eed37_0 684 KB conda-forge 2025-05-07T19:45:18.0556753Z font-ttf-ubuntu-0.83 | h77eed37_3 1.5 MB conda-forge 2025-05-07T19:45:18.0557241Z fontconfig-2.15.0 | h7e30c49_1 259 KB conda-forge 2025-05-07T19:45:18.0557772Z fonts-conda-ecosystem-1 | 0 4 KB conda-forge 2025-05-07T19:45:18.0558298Z fonts-conda-forge-1 | 0 4 KB conda-forge 2025-05-07T19:45:18.0558807Z freetype-2.13.3 | ha770c72_1 168 KB conda-forge 2025-05-07T19:45:18.0559262Z giflib-5.2.2 | hd590300_0 75 KB conda-forge 2025-05-07T19:45:18.0559747Z graphite2-1.3.13 | h59595ed_1003 95 KB conda-forge 2025-05-07T19:45:18.0560245Z harfbuzz-9.0.0 | hfac3d4d_0 1.5 MB conda-forge 2025-05-07T19:45:18.0560810Z hypothesis-6.131.14 | pyha770c72_0 348 KB conda-forge 2025-05-07T19:45:18.0561252Z ijar-7.5.0 | h5888daf_0 114 KB conda-forge 2025-05-07T19:45:18.0561645Z jinja2-3.1.6 | pyhd8ed1ab_0 110 KB conda-forge 2025-05-07T19:45:18.0562077Z keyutils-1.6.1 | h166bdaf_0 115 KB conda-forge 2025-05-07T19:45:18.0562504Z krb5-1.21.3 | h659f571_0 1.3 MB conda-forge 2025-05-07T19:45:18.0562885Z lcms2-2.17 | h717163a_0 242 KB conda-forge 2025-05-07T19:45:18.0563407Z lerc-4.0.0 | h0aef613_1 258 KB conda-forge 2025-05-07T19:45:18.0563845Z libabseil-20250127.1 | cxx17_hbbce691_0 1.3 MB conda-forge 2025-05-07T19:45:18.0564321Z libcups-2.3.3 | h4637d8d_4 4.3 MB conda-forge 2025-05-07T19:45:18.0564729Z libcurl-8.13.0 | h332b0f4_0 428 KB conda-forge 2025-05-07T19:45:18.0565182Z libdeflate-1.23 | h86f0d12_0 71 KB conda-forge 2025-05-07T19:45:18.0565716Z libedit-3.1.20250104 | pl5321h7949ede_0 132 KB conda-forge 2025-05-07T19:45:18.0566375Z libev-4.33 | hd590300_2 110 KB conda-forge 2025-05-07T19:45:18.0566828Z libexpat-2.7.0 | h5888daf_0 73 KB conda-forge 2025-05-07T19:45:18.0567289Z libfreetype-2.13.3 | ha770c72_1 8 KB conda-forge 2025-05-07T19:45:18.0567788Z libfreetype6-2.13.3 | h48d6fc4_1 371 KB conda-forge 2025-05-07T19:45:18.0568267Z libgfortran-15.1.0 | h69a702a_2 34 KB conda-forge 2025-05-07T19:45:18.0568772Z libgfortran5-15.1.0 | hcea5267_2 1.5 MB conda-forge 2025-05-07T19:45:18.0569261Z libglib-2.84.0 | h2ff4ddf_0 3.8 MB conda-forge 2025-05-07T19:45:18.0569695Z libgrpc-1.71.0 | h8e591d7_1 7.6 MB conda-forge 2025-05-07T19:45:18.0570177Z libjpeg-turbo-3.1.0 | hb9d3cd8_0 614 KB conda-forge 2025-05-07T19:45:18.0570630Z liblzma-5.8.1 | hb9d3cd8_1 110 KB conda-forge 2025-05-07T19:45:18.0571114Z liblzma-devel-5.8.1 | hb9d3cd8_1 431 KB conda-forge 2025-05-07T19:45:18.0571614Z libnghttp2-1.64.0 | h161d5f1_0 632 KB conda-forge 2025-05-07T19:45:18.0572173Z libnsl-2.0.1 | hd590300_0 33 KB conda-forge 2025-05-07T19:45:18.0572695Z libopenblas-0.3.29 |pthreads_h94d23a6_0 5.6 MB conda-forge 2025-05-07T19:45:18.0573170Z libpng-1.6.47 | h943b412_0 282 KB conda-forge 2025-05-07T19:45:18.0573708Z libprotobuf-5.29.3 | h501fc15_1 3.2 MB conda-forge 2025-05-07T19:45:18.0574194Z libre2-11-2024.07.02 | hba17884_3 205 KB conda-forge 2025-05-07T19:45:18.0574667Z libsqlite-3.49.2 | hee588c1_0 895 KB conda-forge 2025-05-07T19:45:18.0575090Z libssh2-1.11.1 | hcf80075_0 298 KB conda-forge 2025-05-07T19:45:18.0575536Z libtiff-4.7.0 | hd9ff511_4 419 KB conda-forge 2025-05-07T19:45:18.0575954Z libuuid-2.38.1 | h0b41bf4_0 33 KB conda-forge 2025-05-07T19:45:18.0576393Z libuv-1.50.0 | hb9d3cd8_0 870 KB conda-forge 2025-05-07T19:45:18.0576822Z libwebp-base-1.5.0 | h851e524_0 420 KB conda-forge 2025-05-07T19:45:18.0577277Z libxcb-1.17.0 | h8a09558_0 387 KB conda-forge 2025-05-07T19:45:18.0577714Z libzlib-1.3.1 | hb9d3cd8_2 60 KB conda-forge 2025-05-07T19:45:18.0578108Z make-4.4.1 | hb9d3cd8_2 501 KB conda-forge 2025-05-07T19:45:18.0578557Z markupsafe-3.0.2 | py311h2dc5d0c_1 25 KB conda-forge 2025-05-07T19:45:18.0578984Z ncurses-6.5 | h2d0b736_3 871 KB conda-forge 2025-05-07T19:45:18.0579412Z ninja-1.12.1 | hff21bea_1 158 KB conda-forge 2025-05-07T19:45:18.0579846Z openblas-0.3.29 |pthreads_h6ec200e_0 5.8 MB conda-forge 2025-05-07T19:45:18.0580313Z openjdk-23.0.1 | h4c11d01_0 181.3 MB conda-forge 2025-05-07T19:45:18.0580763Z packaging-25.0 | pyh29332c3_1 61 KB conda-forge 2025-05-07T19:45:18.0581253Z patchelf-0.18.0 | h3f2d84a_2 133 KB conda-forge 2025-05-07T19:45:18.0581687Z pcre2-10.44 | hc749103_2 934 KB conda-forge 2025-05-07T19:45:18.0582091Z pixman-0.46.0 | h29eaf8c_0 389 KB conda-forge 2025-05-07T19:45:18.0582551Z pthread-stubs-0.4 | hb9d3cd8_1002 8 KB conda-forge 2025-05-07T19:45:18.0583022Z pyelftools-0.32 | pyh707e725_1 146 KB conda-forge 2025-05-07T19:45:18.0583459Z python-3.11.11 |h9e4cc4f_2_cpython 29.2 MB conda-forge 2025-05-07T19:45:18.0583911Z pyyaml-6.0.2 | py311h2dc5d0c_2 208 KB conda-forge 2025-05-07T19:45:18.0584315Z re2-2024.07.02 | h9925aae_3 26 KB conda-forge 2025-05-07T19:45:18.0584732Z rhash-1.4.5 | hb9d3cd8_0 183 KB conda-forge 2025-05-07T19:45:18.0585161Z scikit-build-0.18.1 | pyhae55e72_2 114 KB conda-forge 2025-05-07T19:45:18.0585634Z singlejar-7.5.0 | h0e684df_1 122 KB conda-forge 2025-05-07T19:45:18.0586121Z sortedcontainers-2.4.0 | pyhd8ed1ab_1 28 KB conda-forge 2025-05-07T19:45:18.0586570Z sqlite-3.49.2 | h9eae976_0 840 KB conda-forge 2025-05-07T19:45:18.0586998Z tk-8.6.13 |noxft_h4845f30_101 3.2 MB conda-forge 2025-05-07T19:45:18.0587395Z tomli-2.2.1 | pyhd8ed1ab_1 19 KB conda-forge 2025-05-07T19:45:18.0587821Z wheel-0.45.1 | pyhd8ed1ab_1 61 KB conda-forge 2025-05-07T19:45:18.0588273Z xorg-libice-1.1.2 | hb9d3cd8_0 57 KB conda-forge 2025-05-07T19:45:18.0588705Z xorg-libsm-1.2.6 | he73a12e_0 27 KB conda-forge 2025-05-07T19:45:18.0589173Z xorg-libx11-1.8.12 | h4f16b4b_0 816 KB conda-forge 2025-05-07T19:45:18.0589621Z xorg-libxau-1.0.12 | hb9d3cd8_0 14 KB conda-forge 2025-05-07T19:45:18.0590097Z xorg-libxdmcp-1.1.5 | hb9d3cd8_0 19 KB conda-forge 2025-05-07T19:45:18.0590634Z xorg-libxext-1.3.6 | hb9d3cd8_0 49 KB conda-forge 2025-05-07T19:45:18.0591120Z xorg-libxfixes-6.0.1 | hb9d3cd8_0 19 KB conda-forge 2025-05-07T19:45:18.0591591Z xorg-libxi-1.8.2 | hb9d3cd8_0 46 KB conda-forge 2025-05-07T19:45:18.0592036Z xorg-libxrandr-1.5.4 | hb9d3cd8_0 29 KB conda-forge 2025-05-07T19:45:18.0592544Z xorg-libxrender-0.9.12 | hb9d3cd8_0 32 KB conda-forge 2025-05-07T19:45:18.0592997Z xorg-libxt-1.3.1 | hb9d3cd8_0 371 KB conda-forge 2025-05-07T19:45:18.0593461Z xorg-libxtst-1.2.5 | hb9d3cd8_3 32 KB conda-forge 2025-05-07T19:45:18.0593978Z xz-5.8.1 | hbcc6ac9_1 23 KB conda-forge 2025-05-07T19:45:18.0594625Z xz-gpl-tools-5.8.1 | hbcc6ac9_1 33 KB conda-forge 2025-05-07T19:45:18.0595146Z xz-tools-5.8.1 | hb9d3cd8_1 94 KB conda-forge 2025-05-07T19:45:18.0595580Z yaml-0.2.5 | h7f98852_2 87 KB conda-forge 2025-05-07T19:45:18.0596033Z zlib-1.3.1 | hb9d3cd8_2 90 KB conda-forge 2025-05-07T19:45:18.0596454Z zstd-1.5.7 | hb8e6e7a_2 554 KB conda-forge 2025-05-07T19:45:18.0596896Z ------------------------------------------------------------ 2025-05-07T19:45:18.0597293Z Total: 336.5 MB 2025-05-07T19:45:18.0597522Z 2025-05-07T19:45:18.0597663Z The following NEW packages will be INSTALLED: 2025-05-07T19:45:18.0597901Z 2025-05-07T19:45:18.0598147Z alsa-lib conda-forge/linux-64::alsa-lib-1.2.14-hb9d3cd8_0 2025-05-07T19:45:18.0598612Z attrs conda-forge/noarch::attrs-25.3.0-pyh71513ae_0 2025-05-07T19:45:18.0599234Z auditwheel conda-forge/noarch::auditwheel-6.2.0-pyha804496_1 2025-05-07T19:45:18.0599754Z bazel conda-forge/linux-64::bazel-7.5.0-h96810dc_2 2025-05-07T19:45:18.0600200Z c-ares conda-forge/linux-64::c-ares-1.34.5-hb9d3cd8_0 2025-05-07T19:45:18.0600804Z cairo conda-forge/linux-64::cairo-1.18.0-hbb29018_2 2025-05-07T19:45:18.0601248Z click conda-forge/noarch::click-8.1.8-pyh707e725_0 2025-05-07T19:45:18.0601707Z cmake conda-forge/linux-64::cmake-4.0.2-h74e3db0_0 2025-05-07T19:45:18.0602177Z distro conda-forge/noarch::distro-1.9.0-pyhd8ed1ab_1 2025-05-07T19:45:18.0602822Z exceptiongroup conda-forge/noarch::exceptiongroup-1.2.2-pyhd8ed1ab_1 2025-05-07T19:45:18.0603451Z font-ttf-dejavu-s~ conda-forge/noarch::font-ttf-dejavu-sans-mono-2.37-hab24e00_0 2025-05-07T19:45:18.0604073Z font-ttf-inconsol~ conda-forge/noarch::font-ttf-inconsolata-3.000-h77eed37_0 2025-05-07T19:45:18.0604720Z font-ttf-source-c~ conda-forge/noarch::font-ttf-source-code-pro-2.038-h77eed37_0 2025-05-07T19:45:18.0605339Z font-ttf-ubuntu conda-forge/noarch::font-ttf-ubuntu-0.83-h77eed37_3 2025-05-07T19:45:18.0605856Z fontconfig conda-forge/linux-64::fontconfig-2.15.0-h7e30c49_1 2025-05-07T19:45:18.0606393Z fonts-conda-ecosy~ conda-forge/noarch::fonts-conda-ecosystem-1-0 2025-05-07T19:45:18.0606900Z fonts-conda-forge conda-forge/noarch::fonts-conda-forge-1-0 2025-05-07T19:45:18.0607404Z freetype conda-forge/linux-64::freetype-2.13.3-ha770c72_1 2025-05-07T19:45:18.0607875Z giflib conda-forge/linux-64::giflib-5.2.2-hd590300_0 2025-05-07T19:45:18.0608337Z graphite2 conda-forge/linux-64::graphite2-1.3.13-h59595ed_1003 2025-05-07T19:45:18.0608846Z harfbuzz conda-forge/linux-64::harfbuzz-9.0.0-hfac3d4d_0 2025-05-07T19:45:18.0609332Z hypothesis conda-forge/noarch::hypothesis-6.131.14-pyha770c72_0 2025-05-07T19:45:18.0609829Z ijar conda-forge/linux-64::ijar-7.5.0-h5888daf_0 2025-05-07T19:45:18.0610270Z jinja2 conda-forge/noarch::jinja2-3.1.6-pyhd8ed1ab_0 2025-05-07T19:45:18.0610794Z keyutils conda-forge/linux-64::keyutils-1.6.1-h166bdaf_0 2025-05-07T19:45:18.0611259Z krb5 conda-forge/linux-64::krb5-1.21.3-h659f571_0 2025-05-07T19:45:18.0611666Z lcms2 conda-forge/linux-64::lcms2-2.17-h717163a_0 2025-05-07T19:45:18.0612100Z lerc conda-forge/linux-64::lerc-4.0.0-h0aef613_1 2025-05-07T19:45:18.0612579Z libabseil conda-forge/linux-64::libabseil-20250127.1-cxx17_hbbce691_0 2025-05-07T19:45:18.0613105Z libcups conda-forge/linux-64::libcups-2.3.3-h4637d8d_4 2025-05-07T19:45:18.0613565Z libcurl conda-forge/linux-64::libcurl-8.13.0-h332b0f4_0 2025-05-07T19:45:18.0614023Z libdeflate conda-forge/linux-64::libdeflate-1.23-h86f0d12_0 2025-05-07T19:45:18.0614546Z libedit conda-forge/linux-64::libedit-3.1.20250104-pl5321h7949ede_0 2025-05-07T19:45:18.0615007Z libev conda-forge/linux-64::libev-4.33-hd590300_2 2025-05-07T19:45:18.0615463Z libexpat conda-forge/linux-64::libexpat-2.7.0-h5888daf_0 2025-05-07T19:45:18.0615956Z libfreetype conda-forge/linux-64::libfreetype-2.13.3-ha770c72_1 2025-05-07T19:45:18.0616462Z libfreetype6 conda-forge/linux-64::libfreetype6-2.13.3-h48d6fc4_1 2025-05-07T19:45:18.0616992Z libgfortran conda-forge/linux-64::libgfortran-15.1.0-h69a702a_2 2025-05-07T19:45:18.0617489Z libgfortran5 conda-forge/linux-64::libgfortran5-15.1.0-hcea5267_2 2025-05-07T19:45:18.0617989Z libglib conda-forge/linux-64::libglib-2.84.0-h2ff4ddf_0 2025-05-07T19:45:18.0618457Z libgrpc conda-forge/linux-64::libgrpc-1.71.0-h8e591d7_1 2025-05-07T19:45:18.0618939Z libjpeg-turbo conda-forge/linux-64::libjpeg-turbo-3.1.0-hb9d3cd8_0 2025-05-07T19:45:18.0619498Z liblzma conda-forge/linux-64::liblzma-5.8.1-hb9d3cd8_1 2025-05-07T19:45:18.0619983Z liblzma-devel conda-forge/linux-64::liblzma-devel-5.8.1-hb9d3cd8_1 2025-05-07T19:45:18.0620518Z libnghttp2 conda-forge/linux-64::libnghttp2-1.64.0-h161d5f1_0 2025-05-07T19:45:18.0621015Z libnsl conda-forge/linux-64::libnsl-2.0.1-hd590300_0 2025-05-07T19:45:18.0621511Z libopenblas conda-forge/linux-64::libopenblas-0.3.29-pthreads_h94d23a6_0 2025-05-07T19:45:18.0622037Z libpng conda-forge/linux-64::libpng-1.6.47-h943b412_0 2025-05-07T19:45:18.0622492Z libprotobuf conda-forge/linux-64::libprotobuf-5.29.3-h501fc15_1 2025-05-07T19:45:18.0623004Z libre2-11 conda-forge/linux-64::libre2-11-2024.07.02-hba17884_3 2025-05-07T19:45:18.0623504Z libsqlite conda-forge/linux-64::libsqlite-3.49.2-hee588c1_0 2025-05-07T19:45:18.0623958Z libssh2 conda-forge/linux-64::libssh2-1.11.1-hcf80075_0 2025-05-07T19:45:18.0624418Z libtiff conda-forge/linux-64::libtiff-4.7.0-hd9ff511_4 2025-05-07T19:45:18.0624847Z libuv conda-forge/linux-64::libuv-1.50.0-hb9d3cd8_0 2025-05-07T19:45:18.0625342Z libwebp-base conda-forge/linux-64::libwebp-base-1.5.0-h851e524_0 2025-05-07T19:45:18.0625841Z libxcb conda-forge/linux-64::libxcb-1.17.0-h8a09558_0 2025-05-07T19:45:18.0626254Z make conda-forge/linux-64::make-4.4.1-hb9d3cd8_2 2025-05-07T19:45:18.0626917Z markupsafe conda-forge/linux-64::markupsafe-3.0.2-py311h2dc5d0c_1 2025-05-07T19:45:18.0627409Z ninja conda-forge/linux-64::ninja-1.12.1-hff21bea_1 2025-05-07T19:45:18.0628104Z openblas conda-forge/linux-64::openblas-0.3.29-pthreads_h6ec200e_0 2025-05-07T19:45:18.0629030Z openjdk conda-forge/linux-64::openjdk-23.0.1-h4c11d01_0 2025-05-07T19:45:18.0629540Z packaging conda-forge/noarch::packaging-25.0-pyh29332c3_1 2025-05-07T19:45:18.0630074Z patchelf conda-forge/linux-64::patchelf-0.18.0-h3f2d84a_2 2025-05-07T19:45:18.0630543Z pcre2 conda-forge/linux-64::pcre2-10.44-hc749103_2 2025-05-07T19:45:18.0631162Z pixman conda-forge/linux-64::pixman-0.46.0-h29eaf8c_0 2025-05-07T19:45:18.0631683Z pthread-stubs conda-forge/linux-64::pthread-stubs-0.4-hb9d3cd8_1002 2025-05-07T19:45:18.0632263Z pyelftools conda-forge/noarch::pyelftools-0.32-pyh707e725_1 2025-05-07T19:45:18.0632782Z pyyaml conda-forge/linux-64::pyyaml-6.0.2-py311h2dc5d0c_2 2025-05-07T19:45:18.0633241Z re2 conda-forge/linux-64::re2-2024.07.02-h9925aae_3 2025-05-07T19:45:18.0633704Z rhash conda-forge/linux-64::rhash-1.4.5-hb9d3cd8_0 2025-05-07T19:45:18.0634297Z scikit-build conda-forge/noarch::scikit-build-0.18.1-pyhae55e72_2 2025-05-07T19:45:18.0634898Z singlejar conda-forge/linux-64::singlejar-7.5.0-h0e684df_1 2025-05-07T19:45:18.0635492Z sortedcontainers conda-forge/noarch::sortedcontainers-2.4.0-pyhd8ed1ab_1 2025-05-07T19:45:18.0636040Z tomli conda-forge/noarch::tomli-2.2.1-pyhd8ed1ab_1 2025-05-07T19:45:18.0636561Z xorg-libice conda-forge/linux-64::xorg-libice-1.1.2-hb9d3cd8_0 2025-05-07T19:45:18.0637078Z xorg-libsm conda-forge/linux-64::xorg-libsm-1.2.6-he73a12e_0 2025-05-07T19:45:18.0637615Z xorg-libx11 conda-forge/linux-64::xorg-libx11-1.8.12-h4f16b4b_0 2025-05-07T19:45:18.0638164Z xorg-libxau conda-forge/linux-64::xorg-libxau-1.0.12-hb9d3cd8_0 2025-05-07T19:45:18.0638709Z xorg-libxdmcp conda-forge/linux-64::xorg-libxdmcp-1.1.5-hb9d3cd8_0 2025-05-07T19:45:18.0639284Z xorg-libxext conda-forge/linux-64::xorg-libxext-1.3.6-hb9d3cd8_0 2025-05-07T19:45:18.0639839Z xorg-libxfixes conda-forge/linux-64::xorg-libxfixes-6.0.1-hb9d3cd8_0 2025-05-07T19:45:18.0640402Z xorg-libxi conda-forge/linux-64::xorg-libxi-1.8.2-hb9d3cd8_0 2025-05-07T19:45:18.0640959Z xorg-libxrandr conda-forge/linux-64::xorg-libxrandr-1.5.4-hb9d3cd8_0 2025-05-07T19:45:18.0641678Z xorg-libxrender conda-forge/linux-64::xorg-libxrender-0.9.12-hb9d3cd8_0 2025-05-07T19:45:18.0642270Z xorg-libxt conda-forge/linux-64::xorg-libxt-1.3.1-hb9d3cd8_0 2025-05-07T19:45:18.0642790Z xorg-libxtst conda-forge/linux-64::xorg-libxtst-1.2.5-hb9d3cd8_3 2025-05-07T19:45:18.0643356Z xz-gpl-tools conda-forge/linux-64::xz-gpl-tools-5.8.1-hbcc6ac9_1 2025-05-07T19:45:18.0644067Z xz-tools conda-forge/linux-64::xz-tools-5.8.1-hb9d3cd8_1 2025-05-07T19:45:18.0644511Z yaml conda-forge/linux-64::yaml-0.2.5-h7f98852_2 2025-05-07T19:45:18.0644804Z 2025-05-07T19:45:18.0644933Z The following packages will be UPDATED: 2025-05-07T19:45:18.0645154Z 2025-05-07T19:45:18.0645453Z libuuid pkgs/main::libuuid-1.41.5-h5eee18b_0 --> conda-forge::libuuid-2.38.1-h0b41bf4_0 2025-05-07T19:45:18.0646047Z libzlib 1.2.13-h4ab18f5_6 --> 1.3.1-hb9d3cd8_2 2025-05-07T19:45:18.0646743Z ncurses pkgs/main::ncurses-6.4-h6a678d5_0 --> conda-forge::ncurses-6.5-h2d0b736_3 2025-05-07T19:45:18.0647433Z python pkgs/main::python-3.11.11-he870216_0 --> conda-forge::python-3.11.11-h9e4cc4f_2_cpython 2025-05-07T19:45:18.0648147Z sqlite pkgs/main::sqlite-3.45.3-h5eee18b_0 --> conda-forge::sqlite-3.49.2-h9eae976_0 2025-05-07T19:45:18.0648852Z wheel pkgs/main/linux-64::wheel-0.45.1-py31~ --> conda-forge/noarch::wheel-0.45.1-pyhd8ed1ab_1 2025-05-07T19:45:18.0649475Z xz pkgs/main::xz-5.6.4-h5eee18b_1 --> conda-forge::xz-5.8.1-hbcc6ac9_1 2025-05-07T19:45:18.0649966Z zlib 1.2.13-h4ab18f5_6 --> 1.3.1-hb9d3cd8_2 2025-05-07T19:45:18.0650367Z zstd 1.5.6-ha6fb4c9_0 --> 1.5.7-hb8e6e7a_2 2025-05-07T19:45:18.0650644Z 2025-05-07T19:45:18.0650878Z The following packages will be SUPERSEDED by a higher-priority channel: 2025-05-07T19:45:18.0651217Z 2025-05-07T19:45:18.0651483Z tk pkgs/main::tk-8.6.14-h39e8969_0 --> conda-forge::tk-8.6.13-noxft_h4845f30_101 2025-05-07T19:45:18.0651827Z 2025-05-07T19:45:18.0651857Z 2025-05-07T19:45:18.0651933Z 2025-05-07T19:45:18.0652094Z Downloading and Extracting Packages: ...working... 2025-05-07T19:45:18.0652524Z openjdk-23.0.1 | 181.3 MB | | 0% 2025-05-07T19:45:18.0652769Z 2025-05-07T19:45:18.0653104Z bazel-7.5.0 | 47.4 MB | | 0%  2025-05-07T19:45:18.0653347Z 2025-05-07T19:45:18.0653351Z 2025-05-07T19:45:18.0653572Z python-3.11.11 | 29.2 MB | | 0%  2025-05-07T19:45:18.0653856Z 2025-05-07T19:45:18.0653859Z 2025-05-07T19:45:18.0653863Z 2025-05-07T19:45:18.0662223Z cmake-4.0.2 | 19.4 MB | | 0%  2025-05-07T19:45:18.0662969Z 2025-05-07T19:45:18.0663000Z 2025-05-07T19:45:18.0663011Z 2025-05-07T19:45:18.0663022Z 2025-05-07T19:45:18.0663713Z libgrpc-1.71.0 | 7.6 MB | | 0%  2025-05-07T19:45:18.0664504Z 2025-05-07T19:45:18.0664515Z 2025-05-07T19:45:18.0664526Z 2025-05-07T19:45:18.0664537Z 2025-05-07T19:45:18.0664563Z 2025-05-07T19:45:18.0665304Z openblas-0.3.29 | 5.8 MB | | 0%  2025-05-07T19:45:18.0666099Z 2025-05-07T19:45:18.0666110Z 2025-05-07T19:45:18.0666121Z 2025-05-07T19:45:18.0666132Z 2025-05-07T19:45:18.0666142Z 2025-05-07T19:45:18.0666171Z 2025-05-07T19:45:18.0666914Z libopenblas-0.3.29 | 5.6 MB | | 0%  2025-05-07T19:45:18.0667791Z 2025-05-07T19:45:18.0667802Z 2025-05-07T19:45:18.0667812Z 2025-05-07T19:45:18.0667822Z 2025-05-07T19:45:18.0667833Z 2025-05-07T19:45:18.0667843Z 2025-05-07T19:45:18.0667854Z 2025-05-07T19:45:18.0668553Z libcups-2.3.3 | 4.3 MB | | 0%  2025-05-07T19:45:18.0669475Z 2025-05-07T19:45:18.0669478Z 2025-05-07T19:45:18.0669482Z 2025-05-07T19:45:18.0669485Z 2025-05-07T19:45:18.0669570Z 2025-05-07T19:45:18.0669573Z 2025-05-07T19:45:18.0669577Z 2025-05-07T19:45:18.0669580Z 2025-05-07T19:45:18.0669844Z libglib-2.84.0 | 3.8 MB | | 0%  2025-05-07T19:45:18.0670161Z 2025-05-07T19:45:18.0670165Z 2025-05-07T19:45:18.0670169Z 2025-05-07T19:45:18.0670172Z 2025-05-07T19:45:18.0670176Z 2025-05-07T19:45:18.0670179Z 2025-05-07T19:45:18.0670183Z 2025-05-07T19:45:18.0670187Z 2025-05-07T19:45:18.0670190Z 2025-05-07T19:45:18.0670475Z libprotobuf-5.29.3 | 3.2 MB | | 0%  2025-05-07T19:45:18.0670812Z 2025-05-07T19:45:18.0670815Z 2025-05-07T19:45:18.0670818Z 2025-05-07T19:45:18.0670822Z 2025-05-07T19:45:18.0670825Z 2025-05-07T19:45:18.0670829Z 2025-05-07T19:45:18.0670832Z 2025-05-07T19:45:18.0670836Z 2025-05-07T19:45:18.0670839Z 2025-05-07T19:45:18.0670843Z 2025-05-07T19:45:18.0671238Z tk-8.6.13 | 3.2 MB | | 0%  2025-05-07T19:45:18.0671541Z 2025-05-07T19:45:18.0671554Z 2025-05-07T19:45:18.0671558Z 2025-05-07T19:45:18.0671561Z 2025-05-07T19:45:18.0671565Z 2025-05-07T19:45:18.0671569Z 2025-05-07T19:45:18.0671572Z 2025-05-07T19:45:18.0671575Z 2025-05-07T19:45:18.0671583Z 2025-05-07T19:45:18.0671587Z 2025-05-07T19:45:18.0671590Z 2025-05-07T19:45:18.0672749Z font-ttf-ubuntu-0.83 | 1.5 MB | | 0%  2025-05-07T19:45:18.0673073Z 2025-05-07T19:45:18.0673088Z 2025-05-07T19:45:18.0673091Z 2025-05-07T19:45:18.0673095Z 2025-05-07T19:45:18.0673098Z 2025-05-07T19:45:18.0673102Z 2025-05-07T19:45:18.0673105Z 2025-05-07T19:45:18.0673109Z 2025-05-07T19:45:18.0673112Z 2025-05-07T19:45:18.0673115Z 2025-05-07T19:45:18.0673119Z 2025-05-07T19:45:18.0673123Z 2025-05-07T19:45:18.0674248Z harfbuzz-9.0.0 | 1.5 MB | | 0%  2025-05-07T19:45:18.0674567Z 2025-05-07T19:45:18.0674571Z 2025-05-07T19:45:18.0674574Z 2025-05-07T19:45:18.0674584Z 2025-05-07T19:45:18.0674588Z 2025-05-07T19:45:18.0674591Z 2025-05-07T19:45:18.0674595Z 2025-05-07T19:45:18.0674599Z 2025-05-07T19:45:18.0674603Z 2025-05-07T19:45:18.0674634Z 2025-05-07T19:45:18.0674638Z 2025-05-07T19:45:18.0674870Z 2025-05-07T19:45:18.0674874Z 2025-05-07T19:45:18.0675366Z libgfortran5-15.1.0 | 1.5 MB | | 0%  2025-05-07T19:45:18.0675701Z 2025-05-07T19:45:18.0675705Z 2025-05-07T19:45:18.0675708Z 2025-05-07T19:45:18.0675740Z 2025-05-07T19:45:18.0675744Z 2025-05-07T19:45:18.0675747Z 2025-05-07T19:45:18.0675751Z 2025-05-07T19:45:18.0675754Z 2025-05-07T19:45:18.0675758Z 2025-05-07T19:45:18.0675761Z 2025-05-07T19:45:18.0675765Z 2025-05-07T19:45:18.0675768Z 2025-05-07T19:45:18.0675771Z 2025-05-07T19:45:18.0675775Z 2025-05-07T19:45:18.0676555Z krb5-1.21.3 | 1.3 MB | | 0%  2025-05-07T19:45:18.0676889Z 2025-05-07T19:45:18.0676893Z 2025-05-07T19:45:18.0676902Z 2025-05-07T19:45:18.0676905Z 2025-05-07T19:45:18.0676910Z 2025-05-07T19:45:18.0676913Z 2025-05-07T19:45:18.0676917Z 2025-05-07T19:45:18.0676920Z 2025-05-07T19:45:18.0676924Z 2025-05-07T19:45:18.0676927Z 2025-05-07T19:45:18.0676933Z 2025-05-07T19:45:18.0676937Z 2025-05-07T19:45:18.0676940Z 2025-05-07T19:45:18.0676944Z 2025-05-07T19:45:18.0676947Z 2025-05-07T19:45:18.0678104Z libabseil-20250127.1 | 1.3 MB | | 0%  2025-05-07T19:45:18.0678446Z 2025-05-07T19:45:18.0678464Z 2025-05-07T19:45:18.0678467Z 2025-05-07T19:45:18.0678470Z 2025-05-07T19:45:18.0678474Z 2025-05-07T19:45:18.0678478Z 2025-05-07T19:45:18.0678481Z 2025-05-07T19:45:18.0678485Z 2025-05-07T19:45:18.0678488Z 2025-05-07T19:45:18.0678492Z 2025-05-07T19:45:18.0678495Z 2025-05-07T19:45:18.0678499Z 2025-05-07T19:45:18.0678502Z 2025-05-07T19:45:18.0678533Z 2025-05-07T19:45:18.0678537Z 2025-05-07T19:45:18.0678540Z 2025-05-07T19:45:18.0679105Z cairo-1.18.0 | 961 KB | | 0%  2025-05-07T19:45:18.0679494Z 2025-05-07T19:45:18.0679511Z 2025-05-07T19:45:18.0679515Z 2025-05-07T19:45:18.0679519Z 2025-05-07T19:45:18.0679555Z 2025-05-07T19:45:18.0679560Z 2025-05-07T19:45:18.0679564Z 2025-05-07T19:45:18.0679567Z 2025-05-07T19:45:18.0679570Z 2025-05-07T19:45:18.0679574Z 2025-05-07T19:45:18.0679577Z 2025-05-07T19:45:18.0679581Z 2025-05-07T19:45:18.0679585Z 2025-05-07T19:45:18.0679588Z 2025-05-07T19:45:18.0679592Z 2025-05-07T19:45:18.0679595Z 2025-05-07T19:45:18.0679599Z 2025-05-07T19:45:18.0680410Z pcre2-10.44 | 934 KB | | 0%  2025-05-07T19:45:18.0680744Z 2025-05-07T19:45:18.0680759Z 2025-05-07T19:45:18.0680763Z 2025-05-07T19:45:18.0680766Z 2025-05-07T19:45:18.0680770Z 2025-05-07T19:45:18.0680773Z 2025-05-07T19:45:18.0680777Z 2025-05-07T19:45:18.0680781Z 2025-05-07T19:45:18.0680784Z 2025-05-07T19:45:18.0680793Z 2025-05-07T19:45:18.0680796Z 2025-05-07T19:45:18.0680799Z 2025-05-07T19:45:18.0680803Z 2025-05-07T19:45:18.0680807Z 2025-05-07T19:45:18.0680810Z 2025-05-07T19:45:18.0680814Z 2025-05-07T19:45:18.0680817Z 2025-05-07T19:45:18.0680850Z 2025-05-07T19:45:18.0681767Z libsqlite-3.49.2 | 895 KB | | 0%  2025-05-07T19:45:18.0682106Z 2025-05-07T19:45:18.0682110Z 2025-05-07T19:45:18.0682114Z 2025-05-07T19:45:18.0682118Z 2025-05-07T19:45:18.0682122Z 2025-05-07T19:45:18.0682153Z 2025-05-07T19:45:18.0682157Z 2025-05-07T19:45:18.0682160Z 2025-05-07T19:45:18.0682164Z 2025-05-07T19:45:18.0682167Z 2025-05-07T19:45:18.0682171Z 2025-05-07T19:45:18.0682174Z 2025-05-07T19:45:18.0682178Z 2025-05-07T19:45:18.0682182Z 2025-05-07T19:45:18.0682185Z 2025-05-07T19:45:18.0682189Z 2025-05-07T19:45:18.0682192Z 2025-05-07T19:45:18.0682195Z 2025-05-07T19:45:18.0682199Z 2025-05-07T19:45:18.3144726Z ... (more hidden) ... 2025-05-07T19:45:18.3145198Z 2025-05-07T19:45:18.3145203Z 2025-05-07T19:45:18.3145207Z 2025-05-07T19:45:18.3145210Z 2025-05-07T19:45:18.4151335Z libgrpc-1.71.0 | 7.6 MB | | 0%  2025-05-07T19:45:18.4152239Z 2025-05-07T19:45:18.4152255Z 2025-05-07T19:45:18.4152267Z 2025-05-07T19:45:18.4152277Z 2025-05-07T19:45:18.4180398Z libgrpc-1.71.0 | 7.6 MB | 7 | 7%  2025-05-07T19:45:18.4181310Z 2025-05-07T19:45:18.4181326Z 2025-05-07T19:45:18.4221455Z python-3.11.11 | 29.2 MB | | 0%  2025-05-07T19:45:18.4283396Z openjdk-23.0.1 | 181.3 MB | | 0% 2025-05-07T19:45:18.4283723Z 2025-05-07T19:45:18.4285156Z bazel-7.5.0 | 47.4 MB | | 0%  2025-05-07T19:45:18.4285418Z 2025-05-07T19:45:18.4285430Z 2025-05-07T19:45:18.4285434Z 2025-05-07T19:45:18.5150433Z cmake-4.0.2 | 19.4 MB | | 0%  2025-05-07T19:45:18.5151323Z 2025-05-07T19:45:18.5151339Z 2025-05-07T19:45:18.5151351Z 2025-05-07T19:45:18.5151361Z 2025-05-07T19:45:18.5181246Z libgrpc-1.71.0 | 7.6 MB | #####6 | 56%  2025-05-07T19:45:18.5182232Z 2025-05-07T19:45:18.5182276Z 2025-05-07T19:45:18.5222217Z python-3.11.11 | 29.2 MB | ##6 | 26%  2025-05-07T19:45:18.5283960Z openjdk-23.0.1 | 181.3 MB | 4 | 4% 2025-05-07T19:45:18.5284331Z 2025-05-07T19:45:18.5285867Z bazel-7.5.0 | 47.4 MB | #6 | 17%  2025-05-07T19:45:18.5286117Z 2025-05-07T19:45:18.5286128Z 2025-05-07T19:45:18.5286607Z 2025-05-07T19:45:18.6143870Z cmake-4.0.2 | 19.4 MB | ####4 | 45%  2025-05-07T19:45:18.6144209Z 2025-05-07T19:45:18.6144215Z 2025-05-07T19:45:18.6144220Z 2025-05-07T19:45:18.6144226Z 2025-05-07T19:45:18.6184319Z libgrpc-1.71.0 | 7.6 MB | ########## | 100%  2025-05-07T19:45:18.6184689Z 2025-05-07T19:45:18.6184695Z 2025-05-07T19:45:18.6224024Z python-3.11.11 | 29.2 MB | ######5 | 66%  2025-05-07T19:45:18.6285887Z openjdk-23.0.1 | 181.3 MB | 7 | 7% 2025-05-07T19:45:18.6286353Z 2025-05-07T19:45:18.6286359Z 2025-05-07T19:45:18.6286476Z 2025-05-07T19:45:18.6290809Z cmake-4.0.2 | 19.4 MB | ########4 | 85%  2025-05-07T19:45:18.6292981Z 2025-05-07T19:45:18.6646489Z bazel-7.5.0 | 47.4 MB | ###6 | 36%  2025-05-07T19:45:18.6647087Z 2025-05-07T19:45:18.6647092Z 2025-05-07T19:45:18.6647096Z 2025-05-07T19:45:18.6647099Z 2025-05-07T19:45:18.6647103Z 2025-05-07T19:45:18.7225334Z openblas-0.3.29 | 5.8 MB | | 0%  2025-05-07T19:45:18.7356838Z openjdk-23.0.1 | 181.3 MB | #1 | 12% 2025-05-07T19:45:18.7357628Z 2025-05-07T19:45:18.7379892Z bazel-7.5.0 | 47.4 MB | ##### | 51%  2025-05-07T19:45:18.7380674Z 2025-05-07T19:45:18.7380688Z 2025-05-07T19:45:18.8029536Z python-3.11.11 | 29.2 MB | #########2 | 92%  2025-05-07T19:45:18.8030477Z 2025-05-07T19:45:18.8030491Z 2025-05-07T19:45:18.8030503Z 2025-05-07T19:45:18.8030513Z 2025-05-07T19:45:18.8030523Z 2025-05-07T19:45:18.8031294Z openblas-0.3.29 | 5.8 MB | ########## | 100%  2025-05-07T19:45:18.8032147Z 2025-05-07T19:45:18.8032157Z 2025-05-07T19:45:18.8032168Z 2025-05-07T19:45:18.8032178Z 2025-05-07T19:45:18.8032188Z 2025-05-07T19:45:18.8287549Z openblas-0.3.29 | 5.8 MB | ########## | 100%  2025-05-07T19:45:18.8357110Z openjdk-23.0.1 | 181.3 MB | #6 | 16% 2025-05-07T19:45:18.8357899Z 2025-05-07T19:45:18.8411667Z bazel-7.5.0 | 47.4 MB | #######1 | 72%  2025-05-07T19:45:18.8412504Z 2025-05-07T19:45:18.8412519Z 2025-05-07T19:45:18.8412530Z 2025-05-07T19:45:18.8412540Z 2025-05-07T19:45:18.8412552Z 2025-05-07T19:45:18.8412563Z 2025-05-07T19:45:18.8861935Z libopenblas-0.3.29 | 5.6 MB | | 0%  2025-05-07T19:45:18.8862923Z 2025-05-07T19:45:18.8862982Z 2025-05-07T19:45:18.8862994Z 2025-05-07T19:45:18.9245180Z cmake-4.0.2 | 19.4 MB | ########## | 100%  2025-05-07T19:45:18.9245492Z 2025-05-07T19:45:18.9245497Z 2025-05-07T19:45:18.9245807Z 2025-05-07T19:45:18.9245814Z 2025-05-07T19:45:18.9245844Z 2025-05-07T19:45:18.9245849Z 2025-05-07T19:45:18.9245853Z 2025-05-07T19:45:18.9291333Z libcups-2.3.3 | 4.3 MB | | 0%  2025-05-07T19:45:18.9476477Z openjdk-23.0.1 | 181.3 MB | ##1 | 22% 2025-05-07T19:45:18.9476758Z 2025-05-07T19:45:18.9584577Z bazel-7.5.0 | 47.4 MB | ########8 | 88%  2025-05-07T19:45:18.9584914Z 2025-05-07T19:45:18.9584920Z 2025-05-07T19:45:18.9584933Z 2025-05-07T19:45:18.9584938Z 2025-05-07T19:45:18.9584943Z 2025-05-07T19:45:18.9584949Z 2025-05-07T19:45:19.0345433Z libopenblas-0.3.29 | 5.6 MB | ######1 | 61%  2025-05-07T19:45:19.0426540Z openjdk-23.0.1 | 181.3 MB | ##6 | 26% 2025-05-07T19:45:19.0427438Z 2025-05-07T19:45:19.0427453Z 2025-05-07T19:45:19.0427463Z 2025-05-07T19:45:19.0427474Z 2025-05-07T19:45:19.0427484Z 2025-05-07T19:45:19.0427495Z 2025-05-07T19:45:19.0427505Z 2025-05-07T19:45:19.0429187Z libcups-2.3.3 | 4.3 MB | ########## | 100%  2025-05-07T19:45:19.0429525Z 2025-05-07T19:45:19.0429532Z 2025-05-07T19:45:19.0429538Z 2025-05-07T19:45:19.0429543Z 2025-05-07T19:45:19.0429547Z 2025-05-07T19:45:19.0429551Z 2025-05-07T19:45:19.0429554Z 2025-05-07T19:45:19.0782833Z libcups-2.3.3 | 4.3 MB | ########## | 100%  2025-05-07T19:45:19.0783225Z 2025-05-07T19:45:19.0783232Z 2025-05-07T19:45:19.0783242Z 2025-05-07T19:45:19.0783248Z 2025-05-07T19:45:19.0783253Z 2025-05-07T19:45:19.0783258Z 2025-05-07T19:45:19.0783541Z libopenblas-0.3.29 | 5.6 MB | ########## | 100%  2025-05-07T19:45:19.0783839Z 2025-05-07T19:45:19.0783844Z 2025-05-07T19:45:19.0783849Z 2025-05-07T19:45:19.0784206Z 2025-05-07T19:45:19.0784213Z 2025-05-07T19:45:19.0784217Z 2025-05-07T19:45:19.0804953Z libopenblas-0.3.29 | 5.6 MB | ########## | 100%  2025-05-07T19:45:19.0805920Z 2025-05-07T19:45:19.0805935Z 2025-05-07T19:45:19.0805996Z 2025-05-07T19:45:19.0806008Z 2025-05-07T19:45:19.0806018Z 2025-05-07T19:45:19.0806060Z 2025-05-07T19:45:19.0806071Z 2025-05-07T19:45:19.0806081Z 2025-05-07T19:45:19.1101432Z libglib-2.84.0 | 3.8 MB | | 0%  2025-05-07T19:45:19.1102325Z 2025-05-07T19:45:19.1102338Z 2025-05-07T19:45:19.1187401Z python-3.11.11 | 29.2 MB | ########## | 100%  2025-05-07T19:45:19.1187711Z 2025-05-07T19:45:19.1187870Z 2025-05-07T19:45:19.1187874Z 2025-05-07T19:45:19.1188126Z 2025-05-07T19:45:19.1188148Z 2025-05-07T19:45:19.1188157Z 2025-05-07T19:45:19.1188165Z 2025-05-07T19:45:19.1188173Z 2025-05-07T19:45:19.1188181Z 2025-05-07T19:45:19.1346018Z libprotobuf-5.29.3 | 3.2 MB | | 0%  2025-05-07T19:45:19.1626264Z openjdk-23.0.1 | 181.3 MB | ###1 | 31% 2025-05-07T19:45:19.1626597Z 2025-05-07T19:45:19.1626605Z 2025-05-07T19:45:19.1626611Z 2025-05-07T19:45:19.1626616Z 2025-05-07T19:45:19.1626663Z 2025-05-07T19:45:19.1626667Z 2025-05-07T19:45:19.1626671Z 2025-05-07T19:45:19.1626674Z 2025-05-07T19:45:19.1626678Z 2025-05-07T19:45:19.1626681Z 2025-05-07T19:45:19.1660471Z tk-8.6.13 | 3.2 MB | | 0%  2025-05-07T19:45:19.1660802Z 2025-05-07T19:45:19.1660877Z 2025-05-07T19:45:19.1660881Z 2025-05-07T19:45:19.1660888Z 2025-05-07T19:45:19.1660892Z 2025-05-07T19:45:19.1660994Z 2025-05-07T19:45:19.1661008Z 2025-05-07T19:45:19.1661015Z 2025-05-07T19:45:19.2022006Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:19.2022365Z 2025-05-07T19:45:19.2022467Z 2025-05-07T19:45:19.2022474Z 2025-05-07T19:45:19.2022589Z 2025-05-07T19:45:19.2022600Z 2025-05-07T19:45:19.2022640Z 2025-05-07T19:45:19.2022647Z 2025-05-07T19:45:19.2022653Z 2025-05-07T19:45:19.2022705Z 2025-05-07T19:45:19.2055239Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:19.2055883Z 2025-05-07T19:45:19.2055907Z 2025-05-07T19:45:19.2055912Z 2025-05-07T19:45:19.2055915Z 2025-05-07T19:45:19.2055919Z 2025-05-07T19:45:19.2055951Z 2025-05-07T19:45:19.2055955Z 2025-05-07T19:45:19.2055958Z 2025-05-07T19:45:19.2055962Z 2025-05-07T19:45:19.2055965Z 2025-05-07T19:45:19.2055969Z 2025-05-07T19:45:19.2421047Z font-ttf-ubuntu-0.83 | 1.5 MB | 1 | 1%  2025-05-07T19:45:19.2421585Z 2025-05-07T19:45:19.2421682Z 2025-05-07T19:45:19.2421689Z 2025-05-07T19:45:19.2421769Z 2025-05-07T19:45:19.2421777Z 2025-05-07T19:45:19.2421816Z 2025-05-07T19:45:19.2421821Z 2025-05-07T19:45:19.2421846Z 2025-05-07T19:45:19.2421850Z 2025-05-07T19:45:19.2421890Z 2025-05-07T19:45:19.2421893Z 2025-05-07T19:45:19.2421985Z 2025-05-07T19:45:19.2470889Z harfbuzz-9.0.0 | 1.5 MB | 1 | 1%  2025-05-07T19:45:19.2507980Z openjdk-23.0.1 | 181.3 MB | ###5 | 35% 2025-05-07T19:45:19.2508398Z 2025-05-07T19:45:19.2508546Z 2025-05-07T19:45:19.2508550Z 2025-05-07T19:45:19.2508615Z 2025-05-07T19:45:19.2508619Z 2025-05-07T19:45:19.2509072Z 2025-05-07T19:45:19.2509172Z 2025-05-07T19:45:19.2509178Z 2025-05-07T19:45:19.2509206Z 2025-05-07T19:45:19.2509210Z 2025-05-07T19:45:19.2509214Z 2025-05-07T19:45:19.2642417Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:19.2642817Z 2025-05-07T19:45:19.2642822Z 2025-05-07T19:45:19.2642826Z 2025-05-07T19:45:19.2642829Z 2025-05-07T19:45:19.2642833Z 2025-05-07T19:45:19.2642837Z 2025-05-07T19:45:19.2642840Z 2025-05-07T19:45:19.2642844Z 2025-05-07T19:45:19.2642848Z 2025-05-07T19:45:19.2642882Z 2025-05-07T19:45:19.2643297Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:19.2643821Z 2025-05-07T19:45:19.2643836Z 2025-05-07T19:45:19.2643840Z 2025-05-07T19:45:19.2643843Z 2025-05-07T19:45:19.2643847Z 2025-05-07T19:45:19.2643850Z 2025-05-07T19:45:19.2643861Z 2025-05-07T19:45:19.2643864Z 2025-05-07T19:45:19.2643868Z 2025-05-07T19:45:19.2643871Z 2025-05-07T19:45:19.2740910Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:19.2741750Z 2025-05-07T19:45:19.2741764Z 2025-05-07T19:45:19.2741776Z 2025-05-07T19:45:19.2741786Z 2025-05-07T19:45:19.2742763Z libgrpc-1.71.0 | 7.6 MB | ########## | 100%  2025-05-07T19:45:19.2743581Z 2025-05-07T19:45:19.2743593Z 2025-05-07T19:45:19.2743603Z 2025-05-07T19:45:19.2743624Z 2025-05-07T19:45:19.2827032Z libgrpc-1.71.0 | 7.6 MB | ########## | 100%  2025-05-07T19:45:19.2827932Z 2025-05-07T19:45:19.2827946Z 2025-05-07T19:45:19.2827958Z 2025-05-07T19:45:19.2827968Z 2025-05-07T19:45:19.2828011Z 2025-05-07T19:45:19.2828021Z 2025-05-07T19:45:19.2828032Z 2025-05-07T19:45:19.2828042Z 2025-05-07T19:45:19.2828053Z 2025-05-07T19:45:19.2828063Z 2025-05-07T19:45:19.2828074Z 2025-05-07T19:45:19.2828084Z 2025-05-07T19:45:19.2911412Z harfbuzz-9.0.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:19.2912397Z 2025-05-07T19:45:19.2912411Z 2025-05-07T19:45:19.2912423Z 2025-05-07T19:45:19.2912433Z 2025-05-07T19:45:19.2912444Z 2025-05-07T19:45:19.2912455Z 2025-05-07T19:45:19.2912465Z 2025-05-07T19:45:19.2912475Z 2025-05-07T19:45:19.2912486Z 2025-05-07T19:45:19.2912496Z 2025-05-07T19:45:19.2912506Z 2025-05-07T19:45:19.2912517Z 2025-05-07T19:45:19.2912527Z 2025-05-07T19:45:19.3150232Z libgfortran5-15.1.0 | 1.5 MB | 1 | 1%  2025-05-07T19:45:19.3151249Z 2025-05-07T19:45:19.3151263Z 2025-05-07T19:45:19.3151275Z 2025-05-07T19:45:19.3151285Z 2025-05-07T19:45:19.3151296Z 2025-05-07T19:45:19.3151306Z 2025-05-07T19:45:19.3151349Z 2025-05-07T19:45:19.3151359Z 2025-05-07T19:45:19.3151369Z 2025-05-07T19:45:19.3151380Z 2025-05-07T19:45:19.3151390Z 2025-05-07T19:45:19.3151401Z 2025-05-07T19:45:19.3151440Z 2025-05-07T19:45:19.3151852Z 2025-05-07T19:45:19.3151867Z 2025-05-07T19:45:19.3276446Z libabseil-20250127.1 | 1.3 MB | 1 | 1%  2025-05-07T19:45:19.3276831Z 2025-05-07T19:45:19.3276836Z 2025-05-07T19:45:19.3276840Z 2025-05-07T19:45:19.3276844Z 2025-05-07T19:45:19.3276847Z 2025-05-07T19:45:19.3276879Z 2025-05-07T19:45:19.3276883Z 2025-05-07T19:45:19.3276887Z 2025-05-07T19:45:19.3276890Z 2025-05-07T19:45:19.3276894Z 2025-05-07T19:45:19.3276897Z 2025-05-07T19:45:19.3276900Z 2025-05-07T19:45:19.3276904Z 2025-05-07T19:45:19.3284908Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:19.3285326Z 2025-05-07T19:45:19.3285331Z 2025-05-07T19:45:19.3285335Z 2025-05-07T19:45:19.3285357Z 2025-05-07T19:45:19.3285360Z 2025-05-07T19:45:19.3285364Z 2025-05-07T19:45:19.3285367Z 2025-05-07T19:45:19.3285371Z 2025-05-07T19:45:19.3285374Z 2025-05-07T19:45:19.3285378Z 2025-05-07T19:45:19.3285381Z 2025-05-07T19:45:19.3285397Z 2025-05-07T19:45:19.3285401Z 2025-05-07T19:45:19.3285404Z 2025-05-07T19:45:19.3471463Z krb5-1.21.3 | 1.3 MB | 1 | 1%  2025-05-07T19:45:19.3564664Z openjdk-23.0.1 | 181.3 MB | #### | 40% 2025-05-07T19:45:19.3564957Z 2025-05-07T19:45:19.3564961Z 2025-05-07T19:45:19.3564965Z 2025-05-07T19:45:19.3564969Z 2025-05-07T19:45:19.3564973Z 2025-05-07T19:45:19.3564976Z 2025-05-07T19:45:19.3564980Z 2025-05-07T19:45:19.3564983Z 2025-05-07T19:45:19.3564987Z 2025-05-07T19:45:19.3564990Z 2025-05-07T19:45:19.3564994Z 2025-05-07T19:45:19.3565015Z 2025-05-07T19:45:19.3565018Z 2025-05-07T19:45:19.3565022Z 2025-05-07T19:45:19.3565025Z 2025-05-07T19:45:19.3657319Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:19.3658855Z 2025-05-07T19:45:19.3658870Z 2025-05-07T19:45:19.3658881Z 2025-05-07T19:45:19.3658891Z 2025-05-07T19:45:19.3658902Z 2025-05-07T19:45:19.3658928Z 2025-05-07T19:45:19.3658939Z 2025-05-07T19:45:19.3658950Z 2025-05-07T19:45:19.3658960Z 2025-05-07T19:45:19.3658970Z 2025-05-07T19:45:19.3658980Z 2025-05-07T19:45:19.3658990Z 2025-05-07T19:45:19.3659000Z 2025-05-07T19:45:19.3659011Z 2025-05-07T19:45:19.3659022Z 2025-05-07T19:45:19.3659032Z 2025-05-07T19:45:19.3727188Z cairo-1.18.0 | 961 KB | 1 | 2%  2025-05-07T19:45:19.3728125Z 2025-05-07T19:45:19.3728171Z 2025-05-07T19:45:19.3728182Z 2025-05-07T19:45:19.3728192Z 2025-05-07T19:45:19.3728203Z 2025-05-07T19:45:19.3728213Z 2025-05-07T19:45:19.3728224Z 2025-05-07T19:45:19.3728234Z 2025-05-07T19:45:19.3728244Z 2025-05-07T19:45:19.3728255Z 2025-05-07T19:45:19.3728265Z 2025-05-07T19:45:19.3728682Z 2025-05-07T19:45:19.3728730Z 2025-05-07T19:45:19.3728752Z 2025-05-07T19:45:19.3902583Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:19.3903530Z 2025-05-07T19:45:19.3903575Z 2025-05-07T19:45:19.3903616Z 2025-05-07T19:45:19.3903628Z 2025-05-07T19:45:19.3903638Z 2025-05-07T19:45:19.3903648Z 2025-05-07T19:45:19.3903659Z 2025-05-07T19:45:19.3903669Z 2025-05-07T19:45:19.3903679Z 2025-05-07T19:45:19.3903690Z 2025-05-07T19:45:19.3903700Z 2025-05-07T19:45:19.3903710Z 2025-05-07T19:45:19.3903721Z 2025-05-07T19:45:19.3903731Z 2025-05-07T19:45:19.3903742Z 2025-05-07T19:45:19.3903752Z 2025-05-07T19:45:19.3904606Z cairo-1.18.0 | 961 KB | ########## | 100%  2025-05-07T19:45:19.3905535Z 2025-05-07T19:45:19.3905546Z 2025-05-07T19:45:19.3905556Z 2025-05-07T19:45:19.3905567Z 2025-05-07T19:45:19.3905577Z 2025-05-07T19:45:19.3905587Z 2025-05-07T19:45:19.3905597Z 2025-05-07T19:45:19.3905623Z 2025-05-07T19:45:19.3905634Z 2025-05-07T19:45:19.3905644Z 2025-05-07T19:45:19.3905654Z 2025-05-07T19:45:19.3905665Z 2025-05-07T19:45:19.3905675Z 2025-05-07T19:45:19.3905685Z 2025-05-07T19:45:19.3906081Z 2025-05-07T19:45:19.3906095Z 2025-05-07T19:45:19.3906106Z 2025-05-07T19:45:19.4104112Z pcre2-10.44 | 934 KB | 1 | 2%  2025-05-07T19:45:19.4104716Z 2025-05-07T19:45:19.4104720Z 2025-05-07T19:45:19.4104724Z 2025-05-07T19:45:19.4104727Z 2025-05-07T19:45:19.4104731Z 2025-05-07T19:45:19.4104735Z 2025-05-07T19:45:19.4104738Z 2025-05-07T19:45:19.4104741Z 2025-05-07T19:45:19.4104745Z 2025-05-07T19:45:19.4104772Z 2025-05-07T19:45:19.4104775Z 2025-05-07T19:45:19.4104779Z 2025-05-07T19:45:19.4104782Z 2025-05-07T19:45:19.4104786Z 2025-05-07T19:45:19.4104789Z 2025-05-07T19:45:19.4104792Z 2025-05-07T19:45:19.4104796Z 2025-05-07T19:45:19.4233900Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:19.4234564Z 2025-05-07T19:45:19.4234644Z 2025-05-07T19:45:19.4234688Z 2025-05-07T19:45:19.4234711Z 2025-05-07T19:45:19.4234715Z 2025-05-07T19:45:19.4234735Z 2025-05-07T19:45:19.4234739Z 2025-05-07T19:45:19.4234743Z 2025-05-07T19:45:19.4234747Z 2025-05-07T19:45:19.4234751Z 2025-05-07T19:45:19.4234761Z 2025-05-07T19:45:19.4234765Z 2025-05-07T19:45:19.4234934Z 2025-05-07T19:45:19.4234940Z 2025-05-07T19:45:19.4234944Z 2025-05-07T19:45:19.4234948Z 2025-05-07T19:45:19.4234952Z 2025-05-07T19:45:19.4234955Z 2025-05-07T19:45:19.4286129Z libsqlite-3.49.2 | 895 KB | 1 | 2%  2025-05-07T19:45:19.4286563Z 2025-05-07T19:45:19.4286569Z 2025-05-07T19:45:19.4286573Z 2025-05-07T19:45:19.4286577Z 2025-05-07T19:45:19.4286581Z 2025-05-07T19:45:19.4286585Z 2025-05-07T19:45:19.4286589Z 2025-05-07T19:45:19.4286594Z 2025-05-07T19:45:19.4286598Z 2025-05-07T19:45:19.4287340Z 2025-05-07T19:45:19.4287345Z 2025-05-07T19:45:19.4287348Z 2025-05-07T19:45:19.4287352Z 2025-05-07T19:45:19.4287355Z 2025-05-07T19:45:19.4287359Z 2025-05-07T19:45:19.4287362Z 2025-05-07T19:45:19.4287366Z 2025-05-07T19:45:19.4287377Z 2025-05-07T19:45:19.4287392Z 2025-05-07T19:45:19.4560311Z ... (more hidden) ... 2025-05-07T19:45:19.4560634Z 2025-05-07T19:45:19.4560639Z 2025-05-07T19:45:19.4560642Z 2025-05-07T19:45:19.4560646Z 2025-05-07T19:45:19.4560650Z 2025-05-07T19:45:19.4560653Z 2025-05-07T19:45:19.4560656Z 2025-05-07T19:45:19.4560660Z 2025-05-07T19:45:19.4560663Z 2025-05-07T19:45:19.4560667Z 2025-05-07T19:45:19.4560670Z 2025-05-07T19:45:19.4560674Z 2025-05-07T19:45:19.4560677Z 2025-05-07T19:45:19.4560706Z 2025-05-07T19:45:19.4560719Z 2025-05-07T19:45:19.4560722Z 2025-05-07T19:45:19.4560726Z 2025-05-07T19:45:19.4560729Z 2025-05-07T19:45:19.4566941Z libsqlite-3.49.2 | 895 KB | ########## | 100%  2025-05-07T19:45:19.4567301Z 2025-05-07T19:45:19.4567342Z 2025-05-07T19:45:19.4567346Z 2025-05-07T19:45:19.4567349Z 2025-05-07T19:45:19.4567353Z 2025-05-07T19:45:19.4567356Z 2025-05-07T19:45:19.4567367Z 2025-05-07T19:45:19.4567370Z 2025-05-07T19:45:19.4567374Z 2025-05-07T19:45:19.4567377Z 2025-05-07T19:45:19.4567381Z 2025-05-07T19:45:19.4567384Z 2025-05-07T19:45:19.4567388Z 2025-05-07T19:45:19.4567391Z 2025-05-07T19:45:19.4567394Z 2025-05-07T19:45:19.4567398Z 2025-05-07T19:45:19.4567401Z 2025-05-07T19:45:19.4567405Z 2025-05-07T19:45:19.4567408Z 2025-05-07T19:45:19.4634305Z ... (more hidden) ... 2025-05-07T19:45:19.4679699Z openjdk-23.0.1 | 181.3 MB | ####4 | 45% 2025-05-07T19:45:19.4680526Z 2025-05-07T19:45:19.4680541Z 2025-05-07T19:45:19.4680553Z 2025-05-07T19:45:19.4680563Z 2025-05-07T19:45:19.4680574Z 2025-05-07T19:45:19.4680584Z 2025-05-07T19:45:19.4680594Z 2025-05-07T19:45:19.4834449Z libcups-2.3.3 | 4.3 MB | ########## | 100%  2025-05-07T19:45:19.4835374Z 2025-05-07T19:45:19.4835387Z 2025-05-07T19:45:19.4835398Z 2025-05-07T19:45:19.4835409Z 2025-05-07T19:45:19.4835811Z 2025-05-07T19:45:19.5636625Z openblas-0.3.29 | 5.8 MB | ########## | 100%  2025-05-07T19:45:19.6374095Z openjdk-23.0.1 | 181.3 MB | ####9 | 49% 2025-05-07T19:45:19.6374882Z 2025-05-07T19:45:19.6637055Z bazel-7.5.0 | 47.4 MB | ########## | 100%  2025-05-07T19:45:19.8015629Z openjdk-23.0.1 | 181.3 MB | #####4 | 54% 2025-05-07T19:45:19.8016469Z 2025-05-07T19:45:19.8016484Z 2025-05-07T19:45:19.8016496Z 2025-05-07T19:45:19.8016507Z 2025-05-07T19:45:19.8016517Z 2025-05-07T19:45:19.8016529Z 2025-05-07T19:45:19.8098503Z libopenblas-0.3.29 | 5.6 MB | ########## | 100%  2025-05-07T19:45:19.9376847Z openjdk-23.0.1 | 181.3 MB | #####8 | 59% 2025-05-07T19:45:20.0492681Z openjdk-23.0.1 | 181.3 MB | ######2 | 63% 2025-05-07T19:45:20.1494259Z openjdk-23.0.1 | 181.3 MB | ######6 | 67% 2025-05-07T19:45:20.1688303Z openjdk-23.0.1 | 181.3 MB | ####### | 71% 2025-05-07T19:45:20.1688847Z 2025-05-07T19:45:20.1688852Z 2025-05-07T19:45:20.1688856Z 2025-05-07T19:45:20.1688860Z 2025-05-07T19:45:20.1688863Z 2025-05-07T19:45:20.1688867Z 2025-05-07T19:45:20.1688871Z 2025-05-07T19:45:20.1688874Z 2025-05-07T19:45:20.1689262Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:20.1689592Z 2025-05-07T19:45:20.1689597Z 2025-05-07T19:45:20.1689601Z 2025-05-07T19:45:20.1689604Z 2025-05-07T19:45:20.1689608Z 2025-05-07T19:45:20.1689611Z 2025-05-07T19:45:20.1689615Z 2025-05-07T19:45:20.1689618Z 2025-05-07T19:45:20.2826127Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:20.3966319Z openjdk-23.0.1 | 181.3 MB | #######4 | 74% 2025-05-07T19:45:20.4841639Z openjdk-23.0.1 | 181.3 MB | #######7 | 78% 2025-05-07T19:45:20.4843020Z 2025-05-07T19:45:20.4843034Z 2025-05-07T19:45:20.4843047Z 2025-05-07T19:45:20.4843057Z 2025-05-07T19:45:20.4843068Z 2025-05-07T19:45:20.4843078Z 2025-05-07T19:45:20.4843110Z 2025-05-07T19:45:20.4843121Z 2025-05-07T19:45:20.4843132Z 2025-05-07T19:45:20.4844129Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:20.4844716Z 2025-05-07T19:45:20.4844720Z 2025-05-07T19:45:20.4844724Z 2025-05-07T19:45:20.4844727Z 2025-05-07T19:45:20.4844731Z 2025-05-07T19:45:20.4844734Z 2025-05-07T19:45:20.4844738Z 2025-05-07T19:45:20.4844742Z 2025-05-07T19:45:20.4844746Z 2025-05-07T19:45:20.4966014Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:20.5437435Z openjdk-23.0.1 | 181.3 MB | ########2 | 82% 2025-05-07T19:45:20.5438274Z 2025-05-07T19:45:20.5438289Z 2025-05-07T19:45:20.5438301Z 2025-05-07T19:45:20.5438312Z 2025-05-07T19:45:20.5438358Z 2025-05-07T19:45:20.5438369Z 2025-05-07T19:45:20.5438379Z 2025-05-07T19:45:20.5438389Z 2025-05-07T19:45:20.5438399Z 2025-05-07T19:45:20.5438410Z 2025-05-07T19:45:20.5438420Z 2025-05-07T19:45:20.5440931Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:20.5441894Z 2025-05-07T19:45:20.5441905Z 2025-05-07T19:45:20.5441916Z 2025-05-07T19:45:20.5441927Z 2025-05-07T19:45:20.5441938Z 2025-05-07T19:45:20.5441949Z 2025-05-07T19:45:20.5441959Z 2025-05-07T19:45:20.5441969Z 2025-05-07T19:45:20.5441979Z 2025-05-07T19:45:20.5441989Z 2025-05-07T19:45:20.5442011Z 2025-05-07T19:45:20.5967717Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:20.6967984Z openjdk-23.0.1 | 181.3 MB | ########6 | 86% 2025-05-07T19:45:20.7971518Z openjdk-23.0.1 | 181.3 MB | #########1 | 92% 2025-05-07T19:45:20.9724425Z openjdk-23.0.1 | 181.3 MB | #########9 | 99% 2025-05-07T19:45:20.9725278Z 2025-05-07T19:45:20.9725293Z 2025-05-07T19:45:20.9725304Z 2025-05-07T19:45:20.9725314Z 2025-05-07T19:45:20.9725326Z 2025-05-07T19:45:20.9725337Z 2025-05-07T19:45:20.9725347Z 2025-05-07T19:45:20.9725357Z 2025-05-07T19:45:20.9725808Z 2025-05-07T19:45:20.9725856Z 2025-05-07T19:45:21.0733698Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:21.0734197Z 2025-05-07T19:45:21.0734202Z 2025-05-07T19:45:21.0734205Z 2025-05-07T19:45:21.0734209Z 2025-05-07T19:45:21.0734213Z 2025-05-07T19:45:21.0734216Z 2025-05-07T19:45:21.0734246Z 2025-05-07T19:45:21.0734250Z 2025-05-07T19:45:21.0734254Z 2025-05-07T19:45:21.0734258Z 2025-05-07T19:45:21.0734261Z 2025-05-07T19:45:21.0734265Z 2025-05-07T19:45:21.0734551Z harfbuzz-9.0.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:21.0734857Z 2025-05-07T19:45:21.0734861Z 2025-05-07T19:45:21.0734865Z 2025-05-07T19:45:21.0734868Z 2025-05-07T19:45:21.0734900Z 2025-05-07T19:45:21.0734922Z 2025-05-07T19:45:21.0734926Z 2025-05-07T19:45:21.0734930Z 2025-05-07T19:45:21.0734933Z 2025-05-07T19:45:21.0734937Z 2025-05-07T19:45:21.0734940Z 2025-05-07T19:45:21.0734943Z 2025-05-07T19:45:21.1584432Z harfbuzz-9.0.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:21.1584800Z 2025-05-07T19:45:21.1584806Z 2025-05-07T19:45:21.1584809Z 2025-05-07T19:45:21.1584813Z 2025-05-07T19:45:21.1584935Z 2025-05-07T19:45:21.1584946Z 2025-05-07T19:45:21.1584952Z 2025-05-07T19:45:21.1584957Z 2025-05-07T19:45:21.1584962Z 2025-05-07T19:45:21.1584967Z 2025-05-07T19:45:21.1584972Z 2025-05-07T19:45:21.1584976Z 2025-05-07T19:45:21.1584990Z 2025-05-07T19:45:21.1585605Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:21.1585983Z 2025-05-07T19:45:21.1585988Z 2025-05-07T19:45:21.1585992Z 2025-05-07T19:45:21.1585997Z 2025-05-07T19:45:21.1586000Z 2025-05-07T19:45:21.1586004Z 2025-05-07T19:45:21.1586266Z 2025-05-07T19:45:21.1586269Z 2025-05-07T19:45:21.1586273Z 2025-05-07T19:45:21.1586276Z 2025-05-07T19:45:21.1586280Z 2025-05-07T19:45:21.1586284Z 2025-05-07T19:45:21.1586287Z 2025-05-07T19:45:21.7273030Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:21.7274321Z 2025-05-07T19:45:21.7274337Z 2025-05-07T19:45:21.7274347Z 2025-05-07T19:45:21.7274358Z 2025-05-07T19:45:21.7274368Z 2025-05-07T19:45:21.7274378Z 2025-05-07T19:45:21.7274388Z 2025-05-07T19:45:21.7274399Z 2025-05-07T19:45:21.7274409Z 2025-05-07T19:45:21.7274421Z 2025-05-07T19:45:21.7274431Z 2025-05-07T19:45:21.7274442Z 2025-05-07T19:45:21.7274452Z 2025-05-07T19:45:21.7274496Z 2025-05-07T19:45:21.7274507Z 2025-05-07T19:45:21.7275474Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:21.7276475Z 2025-05-07T19:45:21.7276486Z 2025-05-07T19:45:21.7276496Z 2025-05-07T19:45:21.7276507Z 2025-05-07T19:45:21.7276536Z 2025-05-07T19:45:21.7276546Z 2025-05-07T19:45:21.7276557Z 2025-05-07T19:45:21.7276596Z 2025-05-07T19:45:21.7276606Z 2025-05-07T19:45:21.7276617Z 2025-05-07T19:45:21.7276627Z 2025-05-07T19:45:21.7276650Z 2025-05-07T19:45:21.7276660Z 2025-05-07T19:45:21.7276671Z 2025-05-07T19:45:21.7276681Z 2025-05-07T19:45:21.7602295Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:21.7603344Z 2025-05-07T19:45:21.7603406Z 2025-05-07T19:45:21.8296248Z python-3.11.11 | 29.2 MB | ########## | 100%  2025-05-07T19:45:21.8297113Z 2025-05-07T19:45:21.8297127Z 2025-05-07T19:45:21.8297138Z 2025-05-07T19:45:21.8297183Z 2025-05-07T19:45:21.8297194Z 2025-05-07T19:45:21.8297204Z 2025-05-07T19:45:21.8297215Z 2025-05-07T19:45:21.8297225Z 2025-05-07T19:45:21.8297236Z 2025-05-07T19:45:21.8297246Z 2025-05-07T19:45:21.8297257Z 2025-05-07T19:45:21.8297267Z 2025-05-07T19:45:21.8297277Z 2025-05-07T19:45:21.8297322Z 2025-05-07T19:45:21.8297333Z 2025-05-07T19:45:21.8297343Z 2025-05-07T19:45:21.8298198Z cairo-1.18.0 | 961 KB | ########## | 100%  2025-05-07T19:45:21.8299125Z 2025-05-07T19:45:21.8299578Z 2025-05-07T19:45:21.8299592Z 2025-05-07T19:45:21.8299603Z 2025-05-07T19:45:21.8299613Z 2025-05-07T19:45:21.8299624Z 2025-05-07T19:45:21.8299634Z 2025-05-07T19:45:21.8299645Z 2025-05-07T19:45:21.8299655Z 2025-05-07T19:45:21.8299665Z 2025-05-07T19:45:21.8299675Z 2025-05-07T19:45:21.8299686Z 2025-05-07T19:45:21.8299696Z 2025-05-07T19:45:21.8299706Z 2025-05-07T19:45:21.8299716Z 2025-05-07T19:45:21.8299726Z 2025-05-07T19:45:21.8880647Z cairo-1.18.0 | 961 KB | ########## | 100%  2025-05-07T19:45:21.8881598Z 2025-05-07T19:45:21.8881657Z 2025-05-07T19:45:21.8881669Z 2025-05-07T19:45:21.8881680Z 2025-05-07T19:45:21.8881691Z 2025-05-07T19:45:21.8881701Z 2025-05-07T19:45:21.8881711Z 2025-05-07T19:45:21.8881756Z 2025-05-07T19:45:21.8881766Z 2025-05-07T19:45:21.8881777Z 2025-05-07T19:45:21.8881787Z 2025-05-07T19:45:21.8881834Z 2025-05-07T19:45:21.8881845Z 2025-05-07T19:45:21.8881855Z 2025-05-07T19:45:21.8882664Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:21.8883503Z 2025-05-07T19:45:21.8883515Z 2025-05-07T19:45:21.8883525Z 2025-05-07T19:45:21.8883536Z 2025-05-07T19:45:21.8883546Z 2025-05-07T19:45:21.8883556Z 2025-05-07T19:45:21.8883567Z 2025-05-07T19:45:21.8883612Z 2025-05-07T19:45:21.8883623Z 2025-05-07T19:45:21.8883633Z 2025-05-07T19:45:21.8883643Z 2025-05-07T19:45:21.8883654Z 2025-05-07T19:45:21.8883664Z 2025-05-07T19:45:21.8883675Z 2025-05-07T19:45:21.9206477Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:21.9207305Z 2025-05-07T19:45:21.9207310Z 2025-05-07T19:45:21.9207313Z 2025-05-07T19:45:21.9207317Z 2025-05-07T19:45:21.9207320Z 2025-05-07T19:45:21.9208371Z 2025-05-07T19:45:21.9208375Z 2025-05-07T19:45:21.9208378Z 2025-05-07T19:45:21.9208382Z 2025-05-07T19:45:21.9208386Z 2025-05-07T19:45:21.9208389Z 2025-05-07T19:45:21.9208393Z 2025-05-07T19:45:21.9208407Z 2025-05-07T19:45:21.9208410Z 2025-05-07T19:45:21.9208414Z 2025-05-07T19:45:21.9208417Z 2025-05-07T19:45:21.9208420Z 2025-05-07T19:45:21.9208424Z 2025-05-07T19:45:21.9208846Z libsqlite-3.49.2 | 895 KB | ########## | 100%  2025-05-07T19:45:21.9209189Z 2025-05-07T19:45:21.9209192Z 2025-05-07T19:45:21.9209196Z 2025-05-07T19:45:21.9209199Z 2025-05-07T19:45:21.9209203Z 2025-05-07T19:45:21.9209206Z 2025-05-07T19:45:21.9209209Z 2025-05-07T19:45:21.9209213Z 2025-05-07T19:45:21.9209216Z 2025-05-07T19:45:21.9209219Z 2025-05-07T19:45:21.9209223Z 2025-05-07T19:45:21.9209226Z 2025-05-07T19:45:21.9209230Z 2025-05-07T19:45:21.9209260Z 2025-05-07T19:45:21.9209263Z 2025-05-07T19:45:21.9209266Z 2025-05-07T19:45:21.9209274Z 2025-05-07T19:45:21.9209278Z 2025-05-07T19:45:21.9914527Z libsqlite-3.49.2 | 895 KB | ########## | 100%  2025-05-07T19:45:21.9914919Z 2025-05-07T19:45:21.9914947Z 2025-05-07T19:45:21.9914979Z 2025-05-07T19:45:22.0167311Z cmake-4.0.2 | 19.4 MB | ########## | 100%  2025-05-07T19:45:22.0167607Z 2025-05-07T19:45:22.0167900Z 2025-05-07T19:45:22.0167919Z 2025-05-07T19:45:22.0167928Z 2025-05-07T19:45:22.0167937Z 2025-05-07T19:45:22.0167943Z 2025-05-07T19:45:22.0167999Z 2025-05-07T19:45:22.0168005Z 2025-05-07T19:45:22.0168011Z 2025-05-07T19:45:22.0168016Z 2025-05-07T19:45:22.0168021Z 2025-05-07T19:45:22.0168027Z 2025-05-07T19:45:22.0168032Z 2025-05-07T19:45:22.0168056Z 2025-05-07T19:45:22.0168065Z 2025-05-07T19:45:22.0168073Z 2025-05-07T19:45:22.0168079Z 2025-05-07T19:45:22.0170418Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:22.0170837Z 2025-05-07T19:45:22.0170841Z 2025-05-07T19:45:22.0170862Z 2025-05-07T19:45:22.0170866Z 2025-05-07T19:45:22.0170870Z 2025-05-07T19:45:22.0170873Z 2025-05-07T19:45:22.0170877Z 2025-05-07T19:45:22.0170880Z 2025-05-07T19:45:22.0171118Z 2025-05-07T19:45:22.0171123Z 2025-05-07T19:45:22.0171126Z 2025-05-07T19:45:22.0171131Z 2025-05-07T19:45:22.0171135Z 2025-05-07T19:45:22.0171138Z 2025-05-07T19:45:22.0171142Z 2025-05-07T19:45:22.0171145Z 2025-05-07T19:45:22.0171149Z 2025-05-07T19:45:22.6952576Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:23.1215534Z openjdk-23.0.1 | 181.3 MB | ########## | 100% 2025-05-07T19:45:23.1216977Z 2025-05-07T19:45:23.1217070Z 2025-05-07T19:45:23.1217092Z 2025-05-07T19:45:23.1217115Z 2025-05-07T19:45:23.1217133Z 2025-05-07T19:45:23.1217154Z 2025-05-07T19:45:23.1217176Z 2025-05-07T19:45:23.1217199Z 2025-05-07T19:45:23.1217215Z 2025-05-07T19:45:23.1217236Z 2025-05-07T19:45:23.1217300Z 2025-05-07T19:45:23.1217367Z 2025-05-07T19:45:23.1217385Z 2025-05-07T19:45:23.1217404Z 2025-05-07T19:45:23.1217419Z 2025-05-07T19:45:23.1217434Z 2025-05-07T19:45:23.1217451Z 2025-05-07T19:45:23.1217467Z 2025-05-07T19:45:23.1217504Z 2025-05-07T19:45:23.1218211Z ... (more hidden) ... 2025-05-07T19:45:23.1218518Z 2025-05-07T19:45:23.1218523Z 2025-05-07T19:45:23.1218554Z 2025-05-07T19:45:23.1218557Z 2025-05-07T19:45:23.1218561Z 2025-05-07T19:45:23.1218564Z 2025-05-07T19:45:23.1218567Z 2025-05-07T19:45:23.1218571Z 2025-05-07T19:45:23.1218574Z 2025-05-07T19:45:23.1218578Z 2025-05-07T19:45:23.1218581Z 2025-05-07T19:45:23.1218584Z 2025-05-07T19:45:23.1218588Z 2025-05-07T19:45:23.1218592Z 2025-05-07T19:45:23.1218595Z 2025-05-07T19:45:23.1218599Z 2025-05-07T19:45:23.1218602Z 2025-05-07T19:45:23.1218605Z 2025-05-07T19:45:23.1218609Z 2025-05-07T19:45:23.4307032Z ... (more hidden) ... 2025-05-07T19:45:23.4308678Z 2025-05-07T19:45:24.4718787Z bazel-7.5.0 | 47.4 MB | ########## | 100%  2025-05-07T19:45:24.4722783Z openjdk-23.0.1 | 181.3 MB | ########## | 100% 2025-05-07T19:45:24.4724207Z 2025-05-07T19:45:24.4724228Z 2025-05-07T19:45:24.4724246Z 2025-05-07T19:45:24.4724269Z 2025-05-07T19:45:24.4724285Z 2025-05-07T19:45:24.4724300Z 2025-05-07T19:45:24.4724317Z 2025-05-07T19:45:24.4724338Z 2025-05-07T19:45:24.4724355Z 2025-05-07T19:45:24.4724372Z 2025-05-07T19:45:24.4724388Z 2025-05-07T19:45:24.4724403Z 2025-05-07T19:45:24.4724418Z 2025-05-07T19:45:24.4724433Z 2025-05-07T19:45:24.4724497Z 2025-05-07T19:45:24.4724516Z 2025-05-07T19:45:24.4724534Z 2025-05-07T19:45:24.4724553Z 2025-05-07T19:45:24.4724576Z 2025-05-07T19:45:24.4724943Z 2025-05-07T19:45:24.4726099Z  2025-05-07T19:45:24.4726689Z 2025-05-07T19:45:24.4727080Z 2025-05-07T19:45:24.4727419Z  2025-05-07T19:45:24.4727767Z 2025-05-07T19:45:24.4727776Z 2025-05-07T19:45:24.4728096Z  2025-05-07T19:45:24.4728729Z 2025-05-07T19:45:24.4728735Z 2025-05-07T19:45:24.4728743Z 2025-05-07T19:45:24.4729102Z  2025-05-07T19:45:24.4729463Z 2025-05-07T19:45:24.4729471Z 2025-05-07T19:45:24.4729476Z 2025-05-07T19:45:24.4729480Z 2025-05-07T19:45:24.4729816Z  2025-05-07T19:45:24.4730228Z 2025-05-07T19:45:24.4730236Z 2025-05-07T19:45:24.4730243Z 2025-05-07T19:45:24.4730251Z 2025-05-07T19:45:24.4730257Z 2025-05-07T19:45:24.4730614Z  2025-05-07T19:45:24.4730973Z 2025-05-07T19:45:24.4730978Z 2025-05-07T19:45:24.4730986Z 2025-05-07T19:45:24.4730994Z 2025-05-07T19:45:24.4731244Z 2025-05-07T19:45:24.4731251Z 2025-05-07T19:45:24.4731540Z  2025-05-07T19:45:24.4731908Z 2025-05-07T19:45:24.4731913Z 2025-05-07T19:45:24.4732256Z 2025-05-07T19:45:24.4732265Z 2025-05-07T19:45:24.4732273Z 2025-05-07T19:45:24.4732278Z 2025-05-07T19:45:24.4732312Z 2025-05-07T19:45:24.4732601Z  2025-05-07T19:45:24.4732995Z 2025-05-07T19:45:24.4733003Z 2025-05-07T19:45:24.4733008Z 2025-05-07T19:45:24.4733013Z 2025-05-07T19:45:24.4733022Z 2025-05-07T19:45:24.4733031Z 2025-05-07T19:45:24.4733036Z 2025-05-07T19:45:24.4733045Z 2025-05-07T19:45:24.4733392Z  2025-05-07T19:45:24.4733749Z 2025-05-07T19:45:24.4733757Z 2025-05-07T19:45:24.4733762Z 2025-05-07T19:45:24.4733770Z 2025-05-07T19:45:24.4733777Z 2025-05-07T19:45:24.4733785Z 2025-05-07T19:45:24.4733791Z 2025-05-07T19:45:24.4733808Z 2025-05-07T19:45:24.4733816Z 2025-05-07T19:45:24.4734133Z  2025-05-07T19:45:24.4734516Z 2025-05-07T19:45:24.4734521Z 2025-05-07T19:45:24.4734537Z 2025-05-07T19:45:24.4734542Z 2025-05-07T19:45:24.4734550Z 2025-05-07T19:45:24.4734555Z 2025-05-07T19:45:24.4734562Z 2025-05-07T19:45:24.4734567Z 2025-05-07T19:45:24.4734575Z 2025-05-07T19:45:24.4734579Z 2025-05-07T19:45:24.4734953Z  2025-05-07T19:45:24.4735361Z 2025-05-07T19:45:24.4735367Z 2025-05-07T19:45:24.4735374Z 2025-05-07T19:45:24.4735382Z 2025-05-07T19:45:24.4735388Z 2025-05-07T19:45:24.4735395Z 2025-05-07T19:45:24.4735403Z 2025-05-07T19:45:24.4735411Z 2025-05-07T19:45:24.4735417Z 2025-05-07T19:45:24.4735425Z 2025-05-07T19:45:24.4735434Z 2025-05-07T19:45:24.4735817Z  2025-05-07T19:45:24.4736368Z 2025-05-07T19:45:24.4736373Z 2025-05-07T19:45:24.4736378Z 2025-05-07T19:45:24.4736383Z 2025-05-07T19:45:24.4736389Z 2025-05-07T19:45:24.4736397Z 2025-05-07T19:45:24.4736404Z 2025-05-07T19:45:24.4736416Z 2025-05-07T19:45:24.4736424Z 2025-05-07T19:45:24.4736430Z 2025-05-07T19:45:24.4736438Z 2025-05-07T19:45:24.4736491Z 2025-05-07T19:45:24.4736802Z  2025-05-07T19:45:24.4737164Z 2025-05-07T19:45:24.4737169Z 2025-05-07T19:45:24.4737174Z 2025-05-07T19:45:24.4737179Z 2025-05-07T19:45:24.4737183Z 2025-05-07T19:45:24.4737188Z 2025-05-07T19:45:24.4737193Z 2025-05-07T19:45:24.4737201Z 2025-05-07T19:45:24.4737207Z 2025-05-07T19:45:24.4737249Z 2025-05-07T19:45:24.4737257Z 2025-05-07T19:45:24.4737265Z 2025-05-07T19:45:24.4737270Z 2025-05-07T19:45:24.4737587Z  2025-05-07T19:45:24.4737945Z 2025-05-07T19:45:24.4737955Z 2025-05-07T19:45:24.4737960Z 2025-05-07T19:45:24.4737964Z 2025-05-07T19:45:24.4737970Z 2025-05-07T19:45:24.4737977Z 2025-05-07T19:45:24.4738016Z 2025-05-07T19:45:24.4738024Z 2025-05-07T19:45:24.4738036Z 2025-05-07T19:45:24.4738043Z 2025-05-07T19:45:24.4738052Z 2025-05-07T19:45:24.4738057Z 2025-05-07T19:45:24.4738064Z 2025-05-07T19:45:24.4738070Z 2025-05-07T19:45:24.4738399Z  2025-05-07T19:45:24.4738740Z 2025-05-07T19:45:24.4738774Z 2025-05-07T19:45:24.4738779Z 2025-05-07T19:45:24.4738786Z 2025-05-07T19:45:24.4738792Z 2025-05-07T19:45:24.4738800Z 2025-05-07T19:45:24.4738805Z 2025-05-07T19:45:24.4738809Z 2025-05-07T19:45:24.4738813Z 2025-05-07T19:45:24.4738818Z 2025-05-07T19:45:24.4738822Z 2025-05-07T19:45:24.4738827Z 2025-05-07T19:45:24.4738832Z 2025-05-07T19:45:24.4738837Z 2025-05-07T19:45:24.4738841Z 2025-05-07T19:45:24.4739166Z  2025-05-07T19:45:24.4739548Z 2025-05-07T19:45:24.4739554Z 2025-05-07T19:45:24.4739561Z 2025-05-07T19:45:24.4739570Z 2025-05-07T19:45:24.4739578Z 2025-05-07T19:45:24.4741121Z 2025-05-07T19:45:24.4741128Z 2025-05-07T19:45:24.4741133Z 2025-05-07T19:45:24.4741138Z 2025-05-07T19:45:24.4741143Z 2025-05-07T19:45:24.4741149Z 2025-05-07T19:45:24.4741157Z 2025-05-07T19:45:24.4741165Z 2025-05-07T19:45:24.4741173Z 2025-05-07T19:45:24.4741178Z 2025-05-07T19:45:24.4741186Z 2025-05-07T19:45:24.4741579Z  2025-05-07T19:45:24.4741962Z 2025-05-07T19:45:24.4741974Z 2025-05-07T19:45:24.4741979Z 2025-05-07T19:45:24.4741983Z 2025-05-07T19:45:24.4741989Z 2025-05-07T19:45:24.4741994Z 2025-05-07T19:45:24.4742004Z 2025-05-07T19:45:24.4742009Z 2025-05-07T19:45:24.4742013Z 2025-05-07T19:45:24.4742018Z 2025-05-07T19:45:24.4742023Z 2025-05-07T19:45:24.4742034Z 2025-05-07T19:45:24.4742041Z 2025-05-07T19:45:24.4742074Z 2025-05-07T19:45:24.4742079Z 2025-05-07T19:45:24.4742086Z 2025-05-07T19:45:24.4742091Z 2025-05-07T19:45:24.4742436Z  2025-05-07T19:45:24.4742843Z 2025-05-07T19:45:24.4742847Z 2025-05-07T19:45:24.4742857Z 2025-05-07T19:45:24.4742862Z 2025-05-07T19:45:24.4742866Z 2025-05-07T19:45:24.4742872Z 2025-05-07T19:45:24.4742907Z 2025-05-07T19:45:24.4742911Z 2025-05-07T19:45:24.4742916Z 2025-05-07T19:45:24.4742920Z 2025-05-07T19:45:24.4742926Z 2025-05-07T19:45:24.4742930Z 2025-05-07T19:45:24.4742934Z 2025-05-07T19:45:24.4742939Z 2025-05-07T19:45:24.4742943Z 2025-05-07T19:45:24.4742947Z 2025-05-07T19:45:24.4742952Z 2025-05-07T19:45:24.4742957Z 2025-05-07T19:45:24.4743315Z  2025-05-07T19:45:24.4743781Z 2025-05-07T19:45:24.4743883Z 2025-05-07T19:45:24.4744059Z  2025-05-07T19:45:24.4744267Z 2025-05-07T19:45:24.4744275Z 2025-05-07T19:45:24.4744492Z  2025-05-07T19:45:24.4744673Z 2025-05-07T19:45:24.4744678Z 2025-05-07T19:45:24.4744689Z 2025-05-07T19:45:24.4744862Z  2025-05-07T19:45:24.4745057Z 2025-05-07T19:45:24.4745062Z 2025-05-07T19:45:24.4745067Z 2025-05-07T19:45:24.4745072Z 2025-05-07T19:45:24.4745308Z  2025-05-07T19:45:24.4745557Z 2025-05-07T19:45:24.4745564Z 2025-05-07T19:45:24.4745571Z 2025-05-07T19:45:24.4745579Z 2025-05-07T19:45:24.4745586Z 2025-05-07T19:45:24.4745843Z  2025-05-07T19:45:24.4746013Z 2025-05-07T19:45:24.4746016Z 2025-05-07T19:45:24.4746021Z 2025-05-07T19:45:24.4746024Z 2025-05-07T19:45:24.4746028Z 2025-05-07T19:45:24.4746031Z 2025-05-07T19:45:24.4746161Z  2025-05-07T19:45:24.4746513Z 2025-05-07T19:45:24.4746517Z 2025-05-07T19:45:24.4746521Z 2025-05-07T19:45:24.4746524Z 2025-05-07T19:45:24.4746528Z 2025-05-07T19:45:24.4746536Z 2025-05-07T19:45:24.4746540Z 2025-05-07T19:45:24.4746666Z  2025-05-07T19:45:24.4746844Z 2025-05-07T19:45:24.4746848Z 2025-05-07T19:45:24.4746851Z 2025-05-07T19:45:24.4746855Z 2025-05-07T19:45:24.4746862Z 2025-05-07T19:45:24.4746866Z 2025-05-07T19:45:24.4746870Z 2025-05-07T19:45:24.4746873Z 2025-05-07T19:45:24.4747009Z  2025-05-07T19:45:24.4747177Z 2025-05-07T19:45:24.4747213Z 2025-05-07T19:45:24.4747216Z 2025-05-07T19:45:24.4747220Z 2025-05-07T19:45:24.4747223Z 2025-05-07T19:45:24.4747226Z 2025-05-07T19:45:24.4747230Z 2025-05-07T19:45:24.4747233Z 2025-05-07T19:45:24.4747237Z 2025-05-07T19:45:24.4747374Z  2025-05-07T19:45:24.4747551Z 2025-05-07T19:45:24.4747555Z 2025-05-07T19:45:24.4747593Z 2025-05-07T19:45:24.4747596Z 2025-05-07T19:45:24.4747600Z 2025-05-07T19:45:24.4747603Z 2025-05-07T19:45:24.4747607Z 2025-05-07T19:45:24.4747610Z 2025-05-07T19:45:24.4747615Z 2025-05-07T19:45:24.4747621Z 2025-05-07T19:45:24.4747832Z  2025-05-07T19:45:24.4748112Z 2025-05-07T19:45:24.4748117Z 2025-05-07T19:45:24.4748122Z 2025-05-07T19:45:24.4748161Z 2025-05-07T19:45:24.4748166Z 2025-05-07T19:45:24.4748171Z 2025-05-07T19:45:24.4748265Z 2025-05-07T19:45:24.4748271Z 2025-05-07T19:45:24.4748276Z 2025-05-07T19:45:24.4748280Z 2025-05-07T19:45:24.4748285Z 2025-05-07T19:45:24.4748481Z  2025-05-07T19:45:24.4748739Z 2025-05-07T19:45:24.4748744Z 2025-05-07T19:45:24.4748749Z 2025-05-07T19:45:24.4748786Z 2025-05-07T19:45:24.4748790Z 2025-05-07T19:45:24.4748794Z 2025-05-07T19:45:24.4748799Z 2025-05-07T19:45:24.4748804Z 2025-05-07T19:45:24.4748809Z 2025-05-07T19:45:24.4748814Z 2025-05-07T19:45:24.4748818Z 2025-05-07T19:45:24.4748822Z 2025-05-07T19:45:24.4749063Z  2025-05-07T19:45:24.4749446Z 2025-05-07T19:45:24.4749452Z 2025-05-07T19:45:24.4749499Z 2025-05-07T19:45:24.4749506Z 2025-05-07T19:45:24.4749514Z 2025-05-07T19:45:24.4749528Z 2025-05-07T19:45:24.4749536Z 2025-05-07T19:45:24.4749544Z 2025-05-07T19:45:24.4749550Z 2025-05-07T19:45:24.4749558Z 2025-05-07T19:45:24.4749566Z 2025-05-07T19:45:24.4749574Z 2025-05-07T19:45:24.4749587Z 2025-05-07T19:45:24.4749794Z  2025-05-07T19:45:24.4750030Z 2025-05-07T19:45:24.4750034Z 2025-05-07T19:45:24.4750038Z 2025-05-07T19:45:24.4750041Z 2025-05-07T19:45:24.4750045Z 2025-05-07T19:45:24.4750048Z 2025-05-07T19:45:24.4750052Z 2025-05-07T19:45:24.4750055Z 2025-05-07T19:45:24.4750059Z 2025-05-07T19:45:24.4750062Z 2025-05-07T19:45:24.4750066Z 2025-05-07T19:45:24.4750069Z 2025-05-07T19:45:24.4750073Z 2025-05-07T19:45:24.4750077Z 2025-05-07T19:45:24.4750233Z  2025-05-07T19:45:24.4750478Z 2025-05-07T19:45:24.4750482Z 2025-05-07T19:45:24.4750485Z 2025-05-07T19:45:24.4750489Z 2025-05-07T19:45:24.4750492Z 2025-05-07T19:45:24.4750496Z 2025-05-07T19:45:24.4750500Z 2025-05-07T19:45:24.4750570Z 2025-05-07T19:45:24.4750574Z 2025-05-07T19:45:24.4750577Z 2025-05-07T19:45:24.4750581Z 2025-05-07T19:45:24.4750584Z 2025-05-07T19:45:24.4750587Z 2025-05-07T19:45:24.4750591Z 2025-05-07T19:45:24.4750594Z 2025-05-07T19:45:24.4750793Z  2025-05-07T19:45:24.4751013Z 2025-05-07T19:45:24.4751016Z 2025-05-07T19:45:24.4751020Z 2025-05-07T19:45:24.4751024Z 2025-05-07T19:45:24.4751027Z 2025-05-07T19:45:24.4751031Z 2025-05-07T19:45:24.4751034Z 2025-05-07T19:45:24.4751037Z 2025-05-07T19:45:24.4751041Z 2025-05-07T19:45:24.4751044Z 2025-05-07T19:45:24.4751048Z 2025-05-07T19:45:24.4751051Z 2025-05-07T19:45:24.4751054Z 2025-05-07T19:45:24.4751058Z 2025-05-07T19:45:24.4751062Z 2025-05-07T19:45:24.4751066Z 2025-05-07T19:45:24.4751261Z  2025-05-07T19:45:24.4751482Z 2025-05-07T19:45:24.4751485Z 2025-05-07T19:45:24.4751489Z 2025-05-07T19:45:24.4751493Z 2025-05-07T19:45:24.4751496Z 2025-05-07T19:45:24.4751504Z 2025-05-07T19:45:24.4751507Z 2025-05-07T19:45:24.4751511Z 2025-05-07T19:45:24.4751514Z 2025-05-07T19:45:24.4751518Z 2025-05-07T19:45:24.4751548Z 2025-05-07T19:45:24.4751551Z 2025-05-07T19:45:24.4751555Z 2025-05-07T19:45:24.4751561Z 2025-05-07T19:45:24.4751565Z 2025-05-07T19:45:24.4751568Z 2025-05-07T19:45:24.4751572Z 2025-05-07T19:45:24.4751740Z  2025-05-07T19:45:24.4751975Z 2025-05-07T19:45:24.4751979Z 2025-05-07T19:45:24.4751984Z 2025-05-07T19:45:24.4751987Z 2025-05-07T19:45:24.4752015Z 2025-05-07T19:45:24.4752019Z 2025-05-07T19:45:24.4752022Z 2025-05-07T19:45:24.4752026Z 2025-05-07T19:45:24.4752029Z 2025-05-07T19:45:24.4752033Z 2025-05-07T19:45:24.4752036Z 2025-05-07T19:45:24.4752040Z 2025-05-07T19:45:24.4752043Z 2025-05-07T19:45:24.4752047Z 2025-05-07T19:45:24.4752051Z 2025-05-07T19:45:24.4752054Z 2025-05-07T19:45:24.4752058Z 2025-05-07T19:45:24.4752061Z 2025-05-07T19:45:24.4752236Z  2025-05-07T19:45:24.4752502Z 2025-05-07T19:45:24.4752506Z 2025-05-07T19:45:24.4752616Z  2025-05-07T19:45:24.4752741Z 2025-05-07T19:45:24.4752745Z 2025-05-07T19:45:24.4770487Z  2025-05-07T19:45:24.4770885Z 2025-05-07T19:45:24.4770891Z 2025-05-07T19:45:24.4770895Z 2025-05-07T19:45:24.4771135Z  2025-05-07T19:45:24.4771267Z 2025-05-07T19:45:24.4771271Z 2025-05-07T19:45:24.4771275Z 2025-05-07T19:45:24.4771278Z 2025-05-07T19:45:24.4771399Z  2025-05-07T19:45:24.4771567Z 2025-05-07T19:45:24.4771571Z 2025-05-07T19:45:24.4771574Z 2025-05-07T19:45:24.4771577Z 2025-05-07T19:45:24.4771581Z 2025-05-07T19:45:24.4771702Z  2025-05-07T19:45:24.4771868Z 2025-05-07T19:45:24.4771872Z 2025-05-07T19:45:24.4771875Z 2025-05-07T19:45:24.4771879Z 2025-05-07T19:45:24.4771882Z 2025-05-07T19:45:24.4771885Z 2025-05-07T19:45:24.4772014Z  2025-05-07T19:45:24.4772159Z 2025-05-07T19:45:24.4772162Z 2025-05-07T19:45:24.4772166Z 2025-05-07T19:45:24.4772176Z 2025-05-07T19:45:24.4772180Z 2025-05-07T19:45:24.4772183Z 2025-05-07T19:45:24.4772214Z 2025-05-07T19:45:24.4772337Z  2025-05-07T19:45:24.4772496Z 2025-05-07T19:45:24.4772500Z 2025-05-07T19:45:24.4772507Z 2025-05-07T19:45:24.4772511Z 2025-05-07T19:45:24.4772514Z 2025-05-07T19:45:24.4772517Z 2025-05-07T19:45:24.4772521Z 2025-05-07T19:45:24.4772524Z 2025-05-07T19:45:24.4772680Z  2025-05-07T19:45:24.4772843Z 2025-05-07T19:45:24.4772847Z 2025-05-07T19:45:24.4772850Z 2025-05-07T19:45:24.4772854Z 2025-05-07T19:45:24.4772857Z 2025-05-07T19:45:24.4772860Z 2025-05-07T19:45:24.4772864Z 2025-05-07T19:45:24.4772867Z 2025-05-07T19:45:24.4772870Z 2025-05-07T19:45:24.4773031Z  2025-05-07T19:45:24.4773181Z 2025-05-07T19:45:24.4773185Z 2025-05-07T19:45:24.4773188Z 2025-05-07T19:45:24.4773192Z 2025-05-07T19:45:24.4773195Z 2025-05-07T19:45:24.4773198Z 2025-05-07T19:45:24.4773202Z 2025-05-07T19:45:24.4773205Z 2025-05-07T19:45:24.4773276Z 2025-05-07T19:45:24.4773279Z 2025-05-07T19:45:24.4773444Z  2025-05-07T19:45:24.4773617Z 2025-05-07T19:45:24.4773621Z 2025-05-07T19:45:24.4773625Z 2025-05-07T19:45:24.4773633Z 2025-05-07T19:45:24.4773637Z 2025-05-07T19:45:24.4773640Z 2025-05-07T19:45:24.4773643Z 2025-05-07T19:45:24.4773647Z 2025-05-07T19:45:24.4773650Z 2025-05-07T19:45:24.4773653Z 2025-05-07T19:45:24.4773656Z 2025-05-07T19:45:24.4773828Z  2025-05-07T19:45:24.4774016Z 2025-05-07T19:45:24.4774019Z 2025-05-07T19:45:24.4774023Z 2025-05-07T19:45:24.4774026Z 2025-05-07T19:45:24.4774030Z 2025-05-07T19:45:24.4774033Z 2025-05-07T19:45:24.4774036Z 2025-05-07T19:45:24.4774040Z 2025-05-07T19:45:24.4774043Z 2025-05-07T19:45:24.4774047Z 2025-05-07T19:45:24.4774051Z 2025-05-07T19:45:24.4774054Z 2025-05-07T19:45:24.4774225Z  2025-05-07T19:45:24.4774415Z 2025-05-07T19:45:24.4774418Z 2025-05-07T19:45:24.4774426Z 2025-05-07T19:45:24.4774430Z 2025-05-07T19:45:24.4774435Z 2025-05-07T19:45:24.4774438Z 2025-05-07T19:45:24.4774441Z 2025-05-07T19:45:24.4774445Z 2025-05-07T19:45:24.4774448Z 2025-05-07T19:45:24.4774451Z 2025-05-07T19:45:24.4774458Z 2025-05-07T19:45:24.4774462Z 2025-05-07T19:45:24.4774465Z 2025-05-07T19:45:24.4774640Z  2025-05-07T19:45:24.4774844Z 2025-05-07T19:45:24.4774847Z 2025-05-07T19:45:24.4774851Z 2025-05-07T19:45:24.4774854Z 2025-05-07T19:45:24.4774857Z 2025-05-07T19:45:24.4774861Z 2025-05-07T19:45:24.4774864Z 2025-05-07T19:45:24.4774867Z 2025-05-07T19:45:24.4774870Z 2025-05-07T19:45:24.4774874Z 2025-05-07T19:45:24.4774878Z 2025-05-07T19:45:24.4774881Z 2025-05-07T19:45:24.4774910Z 2025-05-07T19:45:24.4774914Z 2025-05-07T19:45:24.4775070Z  2025-05-07T19:45:24.4775282Z 2025-05-07T19:45:24.4775286Z 2025-05-07T19:45:24.4775289Z 2025-05-07T19:45:24.4775293Z 2025-05-07T19:45:24.4775296Z 2025-05-07T19:45:24.4775304Z 2025-05-07T19:45:24.4775307Z 2025-05-07T19:45:24.4775311Z 2025-05-07T19:45:24.4775342Z 2025-05-07T19:45:24.4775346Z 2025-05-07T19:45:24.4775349Z 2025-05-07T19:45:24.4775353Z 2025-05-07T19:45:24.4775356Z 2025-05-07T19:45:24.4775417Z 2025-05-07T19:45:24.4775421Z 2025-05-07T19:45:24.4775577Z  2025-05-07T19:45:24.4775795Z 2025-05-07T19:45:24.4775799Z 2025-05-07T19:45:24.4775802Z 2025-05-07T19:45:24.4775806Z 2025-05-07T19:45:24.4775835Z 2025-05-07T19:45:24.4775838Z 2025-05-07T19:45:24.4775842Z 2025-05-07T19:45:24.4775847Z 2025-05-07T19:45:24.4775850Z 2025-05-07T19:45:24.4775854Z 2025-05-07T19:45:24.4775857Z 2025-05-07T19:45:24.4775860Z 2025-05-07T19:45:24.4775864Z 2025-05-07T19:45:24.4775867Z 2025-05-07T19:45:24.4775870Z 2025-05-07T19:45:24.4775874Z 2025-05-07T19:45:24.4776035Z  2025-05-07T19:45:24.4776288Z 2025-05-07T19:45:24.4776292Z 2025-05-07T19:45:24.4776295Z 2025-05-07T19:45:24.4776303Z 2025-05-07T19:45:24.4776307Z 2025-05-07T19:45:24.4776311Z 2025-05-07T19:45:24.4776314Z 2025-05-07T19:45:24.4776318Z 2025-05-07T19:45:24.4776321Z 2025-05-07T19:45:24.4776325Z 2025-05-07T19:45:24.4776328Z 2025-05-07T19:45:24.4776335Z 2025-05-07T19:45:24.4776338Z 2025-05-07T19:45:24.4776342Z 2025-05-07T19:45:24.4776345Z 2025-05-07T19:45:24.4776349Z 2025-05-07T19:45:24.4776352Z 2025-05-07T19:45:24.4776551Z  2025-05-07T19:45:24.4776774Z 2025-05-07T19:45:24.4776778Z 2025-05-07T19:45:24.4776781Z 2025-05-07T19:45:24.4776785Z 2025-05-07T19:45:24.4776788Z 2025-05-07T19:45:24.4776791Z 2025-05-07T19:45:24.4776795Z 2025-05-07T19:45:24.4776798Z 2025-05-07T19:45:24.4776801Z 2025-05-07T19:45:24.4776805Z 2025-05-07T19:45:24.4776808Z 2025-05-07T19:45:24.4776811Z 2025-05-07T19:45:24.4776815Z 2025-05-07T19:45:24.4776818Z 2025-05-07T19:45:24.4776821Z 2025-05-07T19:45:24.4776825Z 2025-05-07T19:45:24.4776858Z 2025-05-07T19:45:24.4776920Z 2025-05-07T19:45:24.4777094Z  2025-05-07T19:45:24.4777323Z 2025-05-07T19:45:24.4777326Z 2025-05-07T19:45:24.4777434Z  2025-05-07T19:45:24.4777560Z 2025-05-07T19:45:24.4777568Z 2025-05-07T19:45:24.4777678Z  2025-05-07T19:45:24.4777782Z 2025-05-07T19:45:24.4777786Z 2025-05-07T19:45:24.4777790Z 2025-05-07T19:45:24.4777891Z  2025-05-07T19:45:24.4778018Z 2025-05-07T19:45:24.4778022Z 2025-05-07T19:45:24.4778051Z 2025-05-07T19:45:24.4778055Z 2025-05-07T19:45:24.4778167Z  2025-05-07T19:45:24.4778296Z 2025-05-07T19:45:24.4778299Z 2025-05-07T19:45:24.4778302Z 2025-05-07T19:45:24.4778306Z 2025-05-07T19:45:24.4778309Z 2025-05-07T19:45:24.4778449Z  2025-05-07T19:45:24.4778586Z 2025-05-07T19:45:24.4778590Z 2025-05-07T19:45:24.4778594Z 2025-05-07T19:45:24.4778597Z 2025-05-07T19:45:24.4778600Z 2025-05-07T19:45:24.4778604Z 2025-05-07T19:45:24.4778729Z  2025-05-07T19:45:24.4778894Z 2025-05-07T19:45:24.4778902Z 2025-05-07T19:45:24.4778905Z 2025-05-07T19:45:24.4778909Z 2025-05-07T19:45:24.4778912Z 2025-05-07T19:45:24.4778916Z 2025-05-07T19:45:24.4778920Z 2025-05-07T19:45:24.4779040Z  2025-05-07T19:45:24.4779222Z 2025-05-07T19:45:24.4779226Z 2025-05-07T19:45:24.4779229Z 2025-05-07T19:45:24.4779232Z 2025-05-07T19:45:24.4779236Z 2025-05-07T19:45:24.4779239Z 2025-05-07T19:45:24.4779242Z 2025-05-07T19:45:24.4779246Z 2025-05-07T19:45:24.4779369Z  2025-05-07T19:45:24.4779528Z 2025-05-07T19:45:24.4779532Z 2025-05-07T19:45:24.4779535Z 2025-05-07T19:45:24.4779564Z 2025-05-07T19:45:24.4779567Z 2025-05-07T19:45:24.4779570Z 2025-05-07T19:45:24.4779574Z 2025-05-07T19:45:24.4779577Z 2025-05-07T19:45:24.4779580Z 2025-05-07T19:45:24.4779716Z  2025-05-07T19:45:24.4779880Z 2025-05-07T19:45:24.4779884Z 2025-05-07T19:45:24.4779887Z 2025-05-07T19:45:24.4779891Z 2025-05-07T19:45:24.4779894Z 2025-05-07T19:45:24.4779926Z 2025-05-07T19:45:24.4779929Z 2025-05-07T19:45:24.4779933Z 2025-05-07T19:45:24.4779936Z 2025-05-07T19:45:24.4779939Z 2025-05-07T19:45:24.4780070Z  2025-05-07T19:45:24.4780242Z 2025-05-07T19:45:24.4780305Z 2025-05-07T19:45:24.4780310Z 2025-05-07T19:45:24.4780313Z 2025-05-07T19:45:24.4780317Z 2025-05-07T19:45:24.4780320Z 2025-05-07T19:45:24.4780348Z 2025-05-07T19:45:24.4780352Z 2025-05-07T19:45:24.4780355Z 2025-05-07T19:45:24.4780359Z 2025-05-07T19:45:24.4780362Z 2025-05-07T19:45:24.4780504Z  2025-05-07T19:45:24.4780687Z 2025-05-07T19:45:24.4780691Z 2025-05-07T19:45:24.4780694Z 2025-05-07T19:45:24.4780698Z 2025-05-07T19:45:24.4780701Z 2025-05-07T19:45:24.4780705Z 2025-05-07T19:45:24.4780732Z 2025-05-07T19:45:24.4780735Z 2025-05-07T19:45:24.4780739Z 2025-05-07T19:45:24.4780742Z 2025-05-07T19:45:24.4780746Z 2025-05-07T19:45:24.4780749Z 2025-05-07T19:45:24.4780889Z  2025-05-07T19:45:24.4781084Z 2025-05-07T19:45:24.4781088Z 2025-05-07T19:45:24.4781091Z 2025-05-07T19:45:24.4781095Z 2025-05-07T19:45:24.4781098Z 2025-05-07T19:45:24.4781126Z 2025-05-07T19:45:24.4781129Z 2025-05-07T19:45:24.4781133Z 2025-05-07T19:45:24.4781140Z 2025-05-07T19:45:24.4781143Z 2025-05-07T19:45:24.4781147Z 2025-05-07T19:45:24.4781150Z 2025-05-07T19:45:24.4781153Z 2025-05-07T19:45:24.4781295Z  2025-05-07T19:45:24.4781496Z 2025-05-07T19:45:24.4781500Z 2025-05-07T19:45:24.4781527Z 2025-05-07T19:45:24.4781530Z 2025-05-07T19:45:24.4781534Z 2025-05-07T19:45:24.4781537Z 2025-05-07T19:45:24.4781541Z 2025-05-07T19:45:24.4781544Z 2025-05-07T19:45:24.4781547Z 2025-05-07T19:45:24.4781551Z 2025-05-07T19:45:24.4781554Z 2025-05-07T19:45:24.4781558Z 2025-05-07T19:45:24.4781561Z 2025-05-07T19:45:24.4781564Z 2025-05-07T19:45:24.4781712Z  2025-05-07T19:45:24.4781945Z 2025-05-07T19:45:24.4781948Z 2025-05-07T19:45:24.4781952Z 2025-05-07T19:45:24.4782014Z 2025-05-07T19:45:24.4782018Z 2025-05-07T19:45:24.4782021Z 2025-05-07T19:45:24.4782024Z 2025-05-07T19:45:24.4782027Z 2025-05-07T19:45:24.4782031Z 2025-05-07T19:45:24.4782034Z 2025-05-07T19:45:24.4782040Z 2025-05-07T19:45:24.4782044Z 2025-05-07T19:45:24.4782047Z 2025-05-07T19:45:24.4782050Z 2025-05-07T19:45:24.4782054Z 2025-05-07T19:45:24.4782208Z  2025-05-07T19:45:24.4782441Z 2025-05-07T19:45:24.4782445Z 2025-05-07T19:45:24.4782448Z 2025-05-07T19:45:24.4782451Z 2025-05-07T19:45:24.4782455Z 2025-05-07T19:45:24.4782458Z 2025-05-07T19:45:24.4782461Z 2025-05-07T19:45:24.4782465Z 2025-05-07T19:45:24.4782468Z 2025-05-07T19:45:24.4782471Z 2025-05-07T19:45:24.4782475Z 2025-05-07T19:45:24.4782478Z 2025-05-07T19:45:24.4782481Z 2025-05-07T19:45:24.4782485Z 2025-05-07T19:45:24.4782488Z 2025-05-07T19:45:24.4782492Z 2025-05-07T19:45:24.4782669Z  2025-05-07T19:45:24.4782884Z 2025-05-07T19:45:24.4782891Z 2025-05-07T19:45:24.4782895Z 2025-05-07T19:45:24.4782898Z 2025-05-07T19:45:24.4782901Z 2025-05-07T19:45:24.4782905Z 2025-05-07T19:45:24.4782908Z 2025-05-07T19:45:24.4782911Z 2025-05-07T19:45:24.4782917Z 2025-05-07T19:45:24.4782921Z 2025-05-07T19:45:24.4782924Z 2025-05-07T19:45:24.4782927Z 2025-05-07T19:45:24.4782930Z 2025-05-07T19:45:24.4782958Z 2025-05-07T19:45:24.4782961Z 2025-05-07T19:45:24.4782965Z 2025-05-07T19:45:24.4782968Z 2025-05-07T19:45:24.4783128Z  2025-05-07T19:45:24.4783352Z 2025-05-07T19:45:24.4783356Z 2025-05-07T19:45:24.4783359Z 2025-05-07T19:45:24.4783362Z 2025-05-07T19:45:24.4783366Z 2025-05-07T19:45:24.4783369Z 2025-05-07T19:45:24.4783399Z 2025-05-07T19:45:24.4783403Z 2025-05-07T19:45:24.4783406Z 2025-05-07T19:45:24.4783410Z 2025-05-07T19:45:24.4783413Z 2025-05-07T19:45:24.4783416Z 2025-05-07T19:45:24.4783420Z 2025-05-07T19:45:24.4783423Z 2025-05-07T19:45:24.4783426Z 2025-05-07T19:45:24.4783434Z 2025-05-07T19:45:24.4783438Z 2025-05-07T19:45:24.4783441Z 2025-05-07T19:45:24.4783610Z  2025-05-07T19:45:24.4783863Z 2025-05-07T19:45:24.4783867Z 2025-05-07T19:45:24.4784028Z  2025-05-07T19:45:24.4784148Z 2025-05-07T19:45:24.4784151Z 2025-05-07T19:45:24.4784258Z  2025-05-07T19:45:24.4784398Z 2025-05-07T19:45:24.4784402Z 2025-05-07T19:45:24.4784406Z 2025-05-07T19:45:24.4784512Z  2025-05-07T19:45:24.4784634Z 2025-05-07T19:45:24.4784637Z 2025-05-07T19:45:24.4784665Z 2025-05-07T19:45:24.4784668Z 2025-05-07T19:45:24.4784777Z  2025-05-07T19:45:24.4784907Z 2025-05-07T19:45:24.4784910Z 2025-05-07T19:45:24.4784914Z 2025-05-07T19:45:24.4784918Z 2025-05-07T19:45:24.4784921Z 2025-05-07T19:45:24.4785058Z  2025-05-07T19:45:24.4785188Z 2025-05-07T19:45:24.4785192Z 2025-05-07T19:45:24.4785195Z 2025-05-07T19:45:24.4785198Z 2025-05-07T19:45:24.4785202Z 2025-05-07T19:45:24.4785205Z 2025-05-07T19:45:24.4785329Z  2025-05-07T19:45:24.4785489Z 2025-05-07T19:45:24.4785493Z 2025-05-07T19:45:24.4785496Z 2025-05-07T19:45:24.4785500Z 2025-05-07T19:45:24.4785503Z 2025-05-07T19:45:24.4785507Z 2025-05-07T19:45:24.4785514Z 2025-05-07T19:45:24.4785635Z  2025-05-07T19:45:24.4785808Z 2025-05-07T19:45:24.4785812Z 2025-05-07T19:45:24.4785815Z 2025-05-07T19:45:24.4785818Z 2025-05-07T19:45:24.4785822Z 2025-05-07T19:45:24.4785825Z 2025-05-07T19:45:24.4785828Z 2025-05-07T19:45:24.4785832Z 2025-05-07T19:45:24.4785957Z  2025-05-07T19:45:24.4786115Z 2025-05-07T19:45:24.4786119Z 2025-05-07T19:45:24.4786122Z 2025-05-07T19:45:24.4786150Z 2025-05-07T19:45:24.4786153Z 2025-05-07T19:45:24.4786211Z 2025-05-07T19:45:24.4786215Z 2025-05-07T19:45:24.4786218Z 2025-05-07T19:45:24.4786221Z 2025-05-07T19:45:24.4786377Z  2025-05-07T19:45:24.4786543Z 2025-05-07T19:45:24.4786546Z 2025-05-07T19:45:24.4786550Z 2025-05-07T19:45:24.4786627Z 2025-05-07T19:45:24.4786630Z 2025-05-07T19:45:24.4786634Z 2025-05-07T19:45:24.4786637Z 2025-05-07T19:45:24.4786641Z 2025-05-07T19:45:24.4786644Z 2025-05-07T19:45:24.4786647Z 2025-05-07T19:45:24.4786816Z  2025-05-07T19:45:24.4786996Z 2025-05-07T19:45:24.4787000Z 2025-05-07T19:45:24.4787003Z 2025-05-07T19:45:24.4787006Z 2025-05-07T19:45:24.4787010Z 2025-05-07T19:45:24.4787013Z 2025-05-07T19:45:24.4787017Z 2025-05-07T19:45:24.4787020Z 2025-05-07T19:45:24.4787024Z 2025-05-07T19:45:24.4787027Z 2025-05-07T19:45:24.4787030Z 2025-05-07T19:45:24.4787196Z  2025-05-07T19:45:24.4787378Z 2025-05-07T19:45:24.4787382Z 2025-05-07T19:45:24.4787385Z 2025-05-07T19:45:24.4787388Z 2025-05-07T19:45:24.4787392Z 2025-05-07T19:45:24.4787395Z 2025-05-07T19:45:24.4787398Z 2025-05-07T19:45:24.4787402Z 2025-05-07T19:45:24.4787406Z 2025-05-07T19:45:24.4787409Z 2025-05-07T19:45:24.4787412Z 2025-05-07T19:45:24.4787416Z 2025-05-07T19:45:24.4787593Z  2025-05-07T19:45:24.4787789Z 2025-05-07T19:45:24.4787792Z 2025-05-07T19:45:24.4787796Z 2025-05-07T19:45:24.4787799Z 2025-05-07T19:45:24.4787803Z 2025-05-07T19:45:24.4787810Z 2025-05-07T19:45:24.4787814Z 2025-05-07T19:45:24.4787817Z 2025-05-07T19:45:24.4787820Z 2025-05-07T19:45:24.4787824Z 2025-05-07T19:45:24.4787827Z 2025-05-07T19:45:24.4787830Z 2025-05-07T19:45:24.4787834Z 2025-05-07T19:45:24.4787997Z  2025-05-07T19:45:24.4788197Z 2025-05-07T19:45:24.4788201Z 2025-05-07T19:45:24.4788204Z 2025-05-07T19:45:24.4788208Z 2025-05-07T19:45:24.4788211Z 2025-05-07T19:45:24.4788215Z 2025-05-07T19:45:24.4788218Z 2025-05-07T19:45:24.4788222Z 2025-05-07T19:45:24.4788225Z 2025-05-07T19:45:24.4788228Z 2025-05-07T19:45:24.4788231Z 2025-05-07T19:45:24.4788235Z 2025-05-07T19:45:24.4788238Z 2025-05-07T19:45:24.4788267Z 2025-05-07T19:45:24.4788415Z  2025-05-07T19:45:24.4788623Z 2025-05-07T19:45:24.4788627Z 2025-05-07T19:45:24.4788630Z 2025-05-07T19:45:24.4788634Z 2025-05-07T19:45:24.4788637Z 2025-05-07T19:45:24.4788641Z 2025-05-07T19:45:24.4788644Z 2025-05-07T19:45:24.4788647Z 2025-05-07T19:45:24.4788710Z 2025-05-07T19:45:24.4788714Z 2025-05-07T19:45:24.4788745Z 2025-05-07T19:45:24.4788748Z 2025-05-07T19:45:24.4788752Z 2025-05-07T19:45:24.4788755Z 2025-05-07T19:45:24.4788758Z 2025-05-07T19:45:24.4788918Z  2025-05-07T19:45:24.4789130Z 2025-05-07T19:45:24.4789133Z 2025-05-07T19:45:24.4789137Z 2025-05-07T19:45:24.4789140Z 2025-05-07T19:45:24.4789144Z 2025-05-07T19:45:24.4789173Z 2025-05-07T19:45:24.4789176Z 2025-05-07T19:45:24.4789180Z 2025-05-07T19:45:24.4789183Z 2025-05-07T19:45:24.4789186Z 2025-05-07T19:45:24.4789190Z 2025-05-07T19:45:24.4789193Z 2025-05-07T19:45:24.4789196Z 2025-05-07T19:45:24.4789200Z 2025-05-07T19:45:24.4789203Z 2025-05-07T19:45:24.4789207Z 2025-05-07T19:45:24.4789369Z  2025-05-07T19:45:24.4789619Z 2025-05-07T19:45:24.4789623Z 2025-05-07T19:45:24.4789626Z 2025-05-07T19:45:24.4789630Z 2025-05-07T19:45:24.4789633Z 2025-05-07T19:45:24.4789637Z 2025-05-07T19:45:24.4789643Z 2025-05-07T19:45:24.4789646Z 2025-05-07T19:45:24.4789650Z 2025-05-07T19:45:24.4789653Z 2025-05-07T19:45:24.4789657Z 2025-05-07T19:45:24.4789660Z 2025-05-07T19:45:24.4789664Z 2025-05-07T19:45:24.4789667Z 2025-05-07T19:45:24.4789671Z 2025-05-07T19:45:24.4789674Z 2025-05-07T19:45:24.4789677Z 2025-05-07T19:45:24.4789845Z  2025-05-07T19:45:24.4790092Z 2025-05-07T19:45:24.4790095Z 2025-05-07T19:45:24.4790099Z 2025-05-07T19:45:24.4790102Z 2025-05-07T19:45:24.4790106Z 2025-05-07T19:45:24.4790109Z 2025-05-07T19:45:24.4790112Z 2025-05-07T19:45:24.4790115Z 2025-05-07T19:45:24.4790119Z 2025-05-07T19:45:24.4790122Z 2025-05-07T19:45:24.4790126Z 2025-05-07T19:45:24.4790129Z 2025-05-07T19:45:24.4790132Z 2025-05-07T19:45:24.4790192Z 2025-05-07T19:45:24.4790195Z 2025-05-07T19:45:24.4790198Z 2025-05-07T19:45:24.4790202Z 2025-05-07T19:45:24.4790230Z 2025-05-07T19:45:24.4790399Z  2025-05-07T19:45:24.4790630Z 2025-05-07T19:45:24.4790633Z 2025-05-07T19:45:24.4790739Z  2025-05-07T19:45:24.4790877Z 2025-05-07T19:45:24.4790880Z 2025-05-07T19:45:24.4790987Z  2025-05-07T19:45:24.4791101Z 2025-05-07T19:45:24.4791104Z 2025-05-07T19:45:24.4791108Z 2025-05-07T19:45:24.4791250Z  done 2025-05-07T19:45:24.7862900Z Preparing transaction: | / - done 2025-05-07T19:45:28.5240290Z Verifying transaction: | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / done 2025-05-07T19:45:31.0390638Z Executing transaction: \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ done 2025-05-07T19:45:31.4709461Z [INSTALL] Adding symlink librhash.so.0, which is needed by CMake ... 2025-05-07T19:45:33.1197059Z + ln -s /github/home/miniconda/envs/build_binary/lib/librhash.so /github/home/miniconda/envs/build_binary/lib/librhash.so.0 2025-05-07T19:45:33.1197691Z 2025-05-07T19:45:33.1221946Z 2025-05-07T19:45:33.1250705Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install build 2025-05-07T19:45:35.1615202Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:45:35.1618898Z Collecting build 2025-05-07T19:45:35.1619293Z Downloading build-1.2.2.post1-py3-none-any.whl.metadata (6.5 kB) 2025-05-07T19:45:35.1620079Z Requirement already satisfied: packaging>=19.1 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from build) (25.0) 2025-05-07T19:45:35.1620826Z Collecting pyproject_hooks (from build) 2025-05-07T19:45:35.1621284Z Downloading pyproject_hooks-1.2.0-py3-none-any.whl.metadata (1.3 kB) 2025-05-07T19:45:35.1622147Z Downloading build-1.2.2.post1-py3-none-any.whl (22 kB) 2025-05-07T19:45:35.1622626Z Downloading pyproject_hooks-1.2.0-py3-none-any.whl (10 kB) 2025-05-07T19:45:35.1623067Z Installing collected packages: pyproject_hooks, build 2025-05-07T19:45:35.1623359Z 2025-05-07T19:45:35.1623557Z Successfully installed build-1.2.2.post1 pyproject_hooks-1.2.0 2025-05-07T19:45:35.1623854Z 2025-05-07T19:45:35.1623877Z 2025-05-07T19:45:36.8184067Z /github/home/miniconda/envs/build_binary/bin/make 2025-05-07T19:45:36.8184889Z 2025-05-07T19:45:36.8909679Z [CHECK] Binary make found in PATH 2025-05-07T19:45:38.4793109Z /github/home/miniconda/envs/build_binary/bin/cmake 2025-05-07T19:45:38.4793526Z 2025-05-07T19:45:38.5378542Z [CHECK] Binary cmake found in PATH 2025-05-07T19:45:40.1354564Z /github/home/miniconda/envs/build_binary/bin/ninja 2025-05-07T19:45:40.1355474Z 2025-05-07T19:45:40.1918411Z [CHECK] Binary ninja found in PATH 2025-05-07T19:45:41.8724443Z [CHECK] Python (sub-)package 'click' found ... 2025-05-07T19:45:43.6613662Z [CHECK] Python (sub-)package 'hypothesis' found ... 2025-05-07T19:45:45.3690508Z [CHECK] Python (sub-)package 'jinja2' found ... 2025-05-07T19:45:47.1436792Z [CHECK] Python (sub-)package 'skbuild' found ... 2025-05-07T19:45:48.7912541Z [CHECK] Python (sub-)package 'wheel' found ... 2025-05-07T19:45:48.7913072Z [INSTALL] Successfully installed all the build tools 2025-05-07T19:45:48.7987242Z ##[group]Run . $PRELUDE; install_cuda $BUILD_ENV 12.6.3 2025-05-07T19:45:48.7987749Z . $PRELUDE; install_cuda $BUILD_ENV 12.6.3 2025-05-07T19:45:48.7988392Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:45:48.7988771Z env: 2025-05-07T19:45:48.7989046Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:45:48.7989394Z BUILD_ENV: build_binary 2025-05-07T19:45:48.7989846Z BUILD_TARGET: default 2025-05-07T19:45:48.7990134Z BUILD_VARIANT: cuda 2025-05-07T19:45:48.7990414Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:45:48.7990686Z ##[endgroup] 2025-05-07T19:45:49.2210041Z ################################################################################ 2025-05-07T19:45:49.2211055Z # Install CUDA 2025-05-07T19:45:49.2211665Z # 2025-05-07T19:45:49.2227566Z # [2025-05-07T19:45:49.222Z] + install_cuda build_binary 12.6.3 2025-05-07T19:45:49.2228905Z ################################################################################ 2025-05-07T19:45:49.2229245Z 2025-05-07T19:45:49.2243060Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:45:49.3074235Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:45:49.3075336Z [SETUP] Cleaning up Conda packages ... 2025-05-07T19:45:49.3076296Z + conda clean --packages --tarball -y 2025-05-07T19:45:49.3076909Z 2025-05-07T19:45:49.8556973Z Will remove 147 (628.1 MB) tarball(s). 2025-05-07T19:45:49.8558035Z Will remove 21 (102.9 MB) package(s). 2025-05-07T19:45:49.9118031Z 2025-05-07T19:45:49.9124386Z + conda clean --all -y 2025-05-07T19:45:49.9124593Z 2025-05-07T19:45:50.5217652Z There are no unused tarball(s) to remove. 2025-05-07T19:45:50.5218230Z Will remove 1 index cache(s). 2025-05-07T19:45:50.5218569Z There are no unused package(s) to remove. 2025-05-07T19:45:50.5218906Z There are no tempfile(s) to remove. 2025-05-07T19:45:50.5219245Z There are no logfile(s) to remove. 2025-05-07T19:45:50.5775027Z 2025-05-07T19:45:50.5785048Z [INSTALL] Installing CUDA 12.6.3 ... 2025-05-07T19:45:50.5814012Z [EXEC] [ATTEMPT 0/3] + conda install --force-reinstall -n build_binary -c conda-forge --override-channels -y cuda=12.6.3 2025-05-07T19:45:51.4134029Z Channels: 2025-05-07T19:45:51.4135309Z - conda-forge 2025-05-07T19:45:51.4136111Z Platform: linux-64 2025-05-07T19:46:01.1501424Z Collecting package metadata (repodata.json): - \ | / - \ | / - \ | / - \ | / - done 2025-05-07T19:46:02.6684014Z Solving environment: | / - \ done 2025-05-07T19:46:02.8148771Z 2025-05-07T19:46:02.8149310Z ## Package Plan ## 2025-05-07T19:46:02.8149839Z 2025-05-07T19:46:02.8150472Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:46:02.8151442Z 2025-05-07T19:46:02.8151762Z added / updated specs: 2025-05-07T19:46:02.8152524Z - cuda=12.6.3 2025-05-07T19:46:02.8152702Z 2025-05-07T19:46:02.8152707Z 2025-05-07T19:46:02.8152848Z The following packages will be downloaded: 2025-05-07T19:46:02.8153087Z 2025-05-07T19:46:02.8153260Z package | build 2025-05-07T19:46:02.8153644Z ---------------------------|----------------- 2025-05-07T19:46:02.8154211Z attr-2.5.1 | h166bdaf_1 69 KB conda-forge 2025-05-07T19:46:02.8154709Z binutils-2.40 | h4852527_7 31 KB conda-forge 2025-05-07T19:46:02.8155233Z c-compiler-1.5.2 | h0b41bf4_0 6 KB conda-forge 2025-05-07T19:46:02.8155717Z cuda-12.6.3 | ha804496_0 26 KB conda-forge 2025-05-07T19:46:02.8156212Z cuda-cccl_linux-64-12.6.77 | ha770c72_0 1.0 MB conda-forge 2025-05-07T19:46:02.8156778Z cuda-command-line-tools-12.6.3| ha770c72_0 20 KB conda-forge 2025-05-07T19:46:02.8157486Z cuda-compiler-12.6.3 | hbad6d8a_0 20 KB conda-forge 2025-05-07T19:46:02.8158008Z cuda-crt-dev_linux-64-12.6.85| ha770c72_0 87 KB conda-forge 2025-05-07T19:46:02.8158958Z cuda-crt-tools-12.6.85 | ha770c72_0 26 KB conda-forge 2025-05-07T19:46:02.8159471Z cuda-cudart-12.6.77 | h5888daf_0 22 KB conda-forge 2025-05-07T19:46:02.8160007Z cuda-cudart-dev-12.6.77 | h5888daf_0 22 KB conda-forge 2025-05-07T19:46:02.8160582Z cuda-cudart-dev_linux-64-12.6.77| h3f2d84a_0 357 KB conda-forge 2025-05-07T19:46:02.8161148Z cuda-cudart-static-12.6.77 | h5888daf_0 22 KB conda-forge 2025-05-07T19:46:02.8161913Z cuda-cudart-static_linux-64-12.6.77| h3f2d84a_0 744 KB conda-forge 2025-05-07T19:46:02.8162490Z cuda-cudart_linux-64-12.6.77| h3f2d84a_0 184 KB conda-forge 2025-05-07T19:46:02.8163056Z cuda-cuobjdump-12.6.77 | hbd13f7d_1 241 KB conda-forge 2025-05-07T19:46:02.8163680Z cuda-cupti-12.6.80 | hbd13f7d_0 1.9 MB conda-forge 2025-05-07T19:46:02.8164200Z cuda-cupti-dev-12.6.80 | h5888daf_0 3.4 MB conda-forge 2025-05-07T19:46:02.8164725Z cuda-cuxxfilt-12.6.77 | hbd13f7d_1 211 KB conda-forge 2025-05-07T19:46:02.8165227Z cuda-driver-dev-12.6.77 | h5888daf_0 22 KB conda-forge 2025-05-07T19:46:02.8165794Z cuda-driver-dev_linux-64-12.6.77| h3f2d84a_0 35 KB conda-forge 2025-05-07T19:46:02.8166304Z cuda-gdb-12.6.77 | h50b4baa_1 370 KB conda-forge 2025-05-07T19:46:02.8166926Z cuda-libraries-12.6.3 | ha770c72_0 20 KB conda-forge 2025-05-07T19:46:02.8167416Z cuda-libraries-dev-12.6.3 | ha770c72_0 20 KB conda-forge 2025-05-07T19:46:02.8167935Z cuda-nsight-12.6.77 | h7938cbb_0 113.2 MB conda-forge 2025-05-07T19:46:02.8168455Z cuda-nvcc-12.6.85 | hcdd1206_0 23 KB conda-forge 2025-05-07T19:46:02.8168929Z cuda-nvcc-dev_linux-64-12.6.85| he91c749_0 10.8 MB conda-forge 2025-05-07T19:46:02.8169449Z cuda-nvcc-impl-12.6.85 | h85509e4_0 25 KB conda-forge 2025-05-07T19:46:02.8169924Z cuda-nvcc-tools-12.6.85 | he02047a_0 23.0 MB conda-forge 2025-05-07T19:46:02.8170441Z cuda-nvcc_linux-64-12.6.85 | h04802cd_0 25 KB conda-forge 2025-05-07T19:46:02.8170952Z cuda-nvdisasm-12.6.77 | hbd13f7d_1 47.6 MB conda-forge 2025-05-07T19:46:02.8171419Z cuda-nvml-dev-12.6.77 | hbd13f7d_1 159 KB conda-forge 2025-05-07T19:46:02.8171913Z cuda-nvprof-12.6.80 | hbd13f7d_0 2.6 MB conda-forge 2025-05-07T19:46:02.8172378Z cuda-nvprune-12.6.77 | hbd13f7d_1 66 KB conda-forge 2025-05-07T19:46:02.8172862Z cuda-nvrtc-12.6.85 | hbd13f7d_0 17.3 MB conda-forge 2025-05-07T19:46:02.8173318Z cuda-nvrtc-dev-12.6.85 | h5888daf_0 31 KB conda-forge 2025-05-07T19:46:02.8173805Z cuda-nvtx-12.6.77 | hbd13f7d_0 31 KB conda-forge 2025-05-07T19:46:02.8174303Z cuda-nvvm-dev_linux-64-12.6.85| ha770c72_0 25 KB conda-forge 2025-05-07T19:46:02.8174794Z cuda-nvvm-impl-12.6.85 | he02047a_0 7.7 MB conda-forge 2025-05-07T19:46:02.8175291Z cuda-nvvm-tools-12.6.85 | he02047a_0 10.4 MB conda-forge 2025-05-07T19:46:02.8175744Z cuda-nvvp-12.6.80 | hbd13f7d_1 109.3 MB conda-forge 2025-05-07T19:46:02.8176213Z cuda-opencl-12.6.77 | hbd13f7d_0 29 KB conda-forge 2025-05-07T19:46:02.8176676Z cuda-opencl-dev-12.6.77 | h5888daf_0 93 KB conda-forge 2025-05-07T19:46:02.8177186Z cuda-profiler-api-12.6.77 | h7938cbb_0 22 KB conda-forge 2025-05-07T19:46:02.8177684Z cuda-runtime-12.6.3 | ha804496_0 19 KB conda-forge 2025-05-07T19:46:02.8178169Z cuda-sanitizer-api-12.6.77 | hbd13f7d_1 8.9 MB conda-forge 2025-05-07T19:46:02.8178811Z cuda-toolkit-12.6.3 | ha804496_0 19 KB conda-forge 2025-05-07T19:46:02.8179259Z cuda-tools-12.6.3 | ha770c72_0 19 KB conda-forge 2025-05-07T19:46:02.8179726Z cuda-version-12.6 | h7480c83_3 20 KB conda-forge 2025-05-07T19:46:02.8180230Z cuda-visual-tools-12.6.3 | ha770c72_0 19 KB conda-forge 2025-05-07T19:46:02.8180705Z cxx-compiler-1.5.2 | hf52228f_0 6 KB conda-forge 2025-05-07T19:46:02.8181243Z dbus-1.13.6 | h5008d03_3 604 KB conda-forge 2025-05-07T19:46:02.8181643Z expat-2.7.0 | h5888daf_0 137 KB conda-forge 2025-05-07T19:46:02.8182066Z gcc-11.4.0 | h602e360_13 49 KB conda-forge 2025-05-07T19:46:02.8182477Z gds-tools-1.11.1.6 | h5888daf_4 37.8 MB conda-forge 2025-05-07T19:46:02.8182927Z gmp-6.3.0 | hac33072_2 449 KB conda-forge 2025-05-07T19:46:02.8183346Z gxx-11.4.0 | h602e360_13 49 KB conda-forge 2025-05-07T19:46:02.8183748Z libcap-2.75 | h39aace5_0 118 KB conda-forge 2025-05-07T19:46:02.8184213Z libcublas-12.6.4.1 | h5888daf_1 256.2 MB conda-forge 2025-05-07T19:46:02.8184665Z libcublas-dev-12.6.4.1 | h5888daf_1 88 KB conda-forge 2025-05-07T19:46:02.8185146Z libcufft-11.3.0.4 | hbd13f7d_0 156.2 MB conda-forge 2025-05-07T19:46:02.8185622Z libcufft-dev-11.3.0.4 | h5888daf_0 33 KB conda-forge 2025-05-07T19:46:02.8186071Z libcufile-1.11.1.6 | h12f29b5_4 900 KB conda-forge 2025-05-07T19:46:02.8186551Z libcufile-dev-1.11.1.6 | h5888daf_4 35 KB conda-forge 2025-05-07T19:46:02.8187010Z libcurand-10.3.7.77 | hbd13f7d_0 39.9 MB conda-forge 2025-05-07T19:46:02.8187502Z libcurand-dev-10.3.7.77 | h5888daf_0 262 KB conda-forge 2025-05-07T19:46:02.8187970Z libcusolver-11.7.1.2 | h5888daf_1 95.8 MB conda-forge 2025-05-07T19:46:02.8188475Z libcusolver-dev-11.7.1.2 | h5888daf_1 59 KB conda-forge 2025-05-07T19:46:02.8188978Z libcusparse-12.5.4.2 | hbd13f7d_0 118.6 MB conda-forge 2025-05-07T19:46:02.8189454Z libcusparse-dev-12.5.4.2 | h5888daf_0 51 KB conda-forge 2025-05-07T19:46:02.8189960Z libgcrypt-lib-1.11.0 | hb9d3cd8_2 572 KB conda-forge 2025-05-07T19:46:02.8190418Z libgpg-error-1.55 | h3f2d84a_0 305 KB conda-forge 2025-05-07T19:46:02.8190874Z libnl-3.11.0 | hb9d3cd8_0 724 KB conda-forge 2025-05-07T19:46:02.8191292Z libnpp-12.3.1.54 | h5888daf_0 93.4 MB conda-forge 2025-05-07T19:46:02.8191752Z libnpp-dev-12.3.1.54 | h5888daf_0 441 KB conda-forge 2025-05-07T19:46:02.8192219Z libnuma-2.0.18 | h4ab18f5_2 42 KB conda-forge 2025-05-07T19:46:02.8192663Z libnvfatbin-12.6.77 | hbd13f7d_0 783 KB conda-forge 2025-05-07T19:46:02.8193161Z libnvfatbin-dev-12.6.77 | h5888daf_0 26 KB conda-forge 2025-05-07T19:46:02.8193635Z libnvjitlink-12.6.85 | hbd13f7d_0 14.9 MB conda-forge 2025-05-07T19:46:02.8194282Z libnvjitlink-dev-12.6.85 | h5888daf_0 25 KB conda-forge 2025-05-07T19:46:02.8195068Z libnvjpeg-12.3.3.54 | h5888daf_0 2.4 MB conda-forge 2025-05-07T19:46:02.8195572Z libnvjpeg-dev-12.3.3.54 | ha770c72_0 31 KB conda-forge 2025-05-07T19:46:02.8196108Z libsystemd0-257.4 | h4e0b6ca_1 477 KB conda-forge 2025-05-07T19:46:02.8196586Z libudev1-257.4 | hbe16f8c_1 141 KB conda-forge 2025-05-07T19:46:02.8197202Z libxkbcommon-1.7.0 | h2c5496b_1 579 KB conda-forge 2025-05-07T19:46:02.8197690Z libxkbfile-1.1.0 | h166bdaf_1 111 KB conda-forge 2025-05-07T19:46:02.8198169Z lz4-c-1.10.0 | h5888daf_1 163 KB conda-forge 2025-05-07T19:46:02.8198683Z nsight-compute-2024.3.2.3 | hb5ebaad_0 443.1 MB conda-forge 2025-05-07T19:46:02.8199165Z nspr-4.36 | h5888daf_0 225 KB conda-forge 2025-05-07T19:46:02.8199687Z nss-3.111 | h159eef7_0 1.9 MB conda-forge 2025-05-07T19:46:02.8200119Z ocl-icd-2.3.3 | hb9d3cd8_0 104 KB conda-forge 2025-05-07T19:46:02.8200746Z opencl-headers-2024.10.24 | h5888daf_0 53 KB conda-forge 2025-05-07T19:46:02.8201209Z rdma-core-57.0 | h5888daf_0 1.2 MB conda-forge 2025-05-07T19:46:02.8201654Z wayland-1.23.1 | h3e06ad9_0 314 KB conda-forge 2025-05-07T19:46:02.8202097Z xcb-util-0.4.1 | hb711507_2 19 KB conda-forge 2025-05-07T19:46:02.8202580Z xcb-util-cursor-0.1.5 | hb9d3cd8_0 20 KB conda-forge 2025-05-07T19:46:02.8203047Z xcb-util-image-0.4.0 | hb711507_2 24 KB conda-forge 2025-05-07T19:46:02.8203554Z xcb-util-keysyms-0.4.1 | hb711507_0 14 KB conda-forge 2025-05-07T19:46:02.8204050Z xcb-util-renderutil-0.3.10 | hb711507_0 17 KB conda-forge 2025-05-07T19:46:02.8204548Z xcb-util-wm-0.4.2 | hb711507_0 50 KB conda-forge 2025-05-07T19:46:02.8205015Z xkeyboard-config-2.44 | hb9d3cd8_0 384 KB conda-forge 2025-05-07T19:46:02.8205547Z xorg-libxcomposite-0.4.6 | hb9d3cd8_2 13 KB conda-forge 2025-05-07T19:46:02.8206071Z xorg-libxdamage-1.1.6 | hb9d3cd8_0 13 KB conda-forge 2025-05-07T19:46:02.8206509Z ------------------------------------------------------------ 2025-05-07T19:46:02.8206892Z Total: 1.59 GB 2025-05-07T19:46:02.8207118Z 2025-05-07T19:46:02.8207258Z The following NEW packages will be INSTALLED: 2025-05-07T19:46:02.8207525Z 2025-05-07T19:46:02.8207723Z attr conda-forge/linux-64::attr-2.5.1-h166bdaf_1 2025-05-07T19:46:02.8208194Z binutils conda-forge/linux-64::binutils-2.40-h4852527_7 2025-05-07T19:46:02.8208676Z c-compiler conda-forge/linux-64::c-compiler-1.5.2-h0b41bf4_0 2025-05-07T19:46:02.8209158Z cuda conda-forge/noarch::cuda-12.6.3-ha804496_0 2025-05-07T19:46:02.8209651Z cuda-cccl_linux-64 conda-forge/noarch::cuda-cccl_linux-64-12.6.77-ha770c72_0 2025-05-07T19:46:02.8210302Z cuda-command-line~ conda-forge/linux-64::cuda-command-line-tools-12.6.3-ha770c72_0 2025-05-07T19:46:02.8210929Z cuda-compiler conda-forge/noarch::cuda-compiler-12.6.3-hbad6d8a_0 2025-05-07T19:46:02.8211505Z cuda-crt-dev_linu~ conda-forge/noarch::cuda-crt-dev_linux-64-12.6.85-ha770c72_0 2025-05-07T19:46:02.8212121Z cuda-crt-tools conda-forge/linux-64::cuda-crt-tools-12.6.85-ha770c72_0 2025-05-07T19:46:02.8212659Z cuda-cudart conda-forge/linux-64::cuda-cudart-12.6.77-h5888daf_0 2025-05-07T19:46:02.8213236Z cuda-cudart-dev conda-forge/linux-64::cuda-cudart-dev-12.6.77-h5888daf_0 2025-05-07T19:46:02.8213867Z cuda-cudart-dev_l~ conda-forge/noarch::cuda-cudart-dev_linux-64-12.6.77-h3f2d84a_0 2025-05-07T19:46:02.8214501Z cuda-cudart-static conda-forge/linux-64::cuda-cudart-static-12.6.77-h5888daf_0 2025-05-07T19:46:02.8215168Z cuda-cudart-stati~ conda-forge/noarch::cuda-cudart-static_linux-64-12.6.77-h3f2d84a_0 2025-05-07T19:46:02.8215793Z cuda-cudart_linux~ conda-forge/noarch::cuda-cudart_linux-64-12.6.77-h3f2d84a_0 2025-05-07T19:46:02.8216395Z cuda-cuobjdump conda-forge/linux-64::cuda-cuobjdump-12.6.77-hbd13f7d_1 2025-05-07T19:46:02.8216941Z cuda-cupti conda-forge/linux-64::cuda-cupti-12.6.80-hbd13f7d_0 2025-05-07T19:46:02.8217546Z cuda-cupti-dev conda-forge/linux-64::cuda-cupti-dev-12.6.80-h5888daf_0 2025-05-07T19:46:02.8218115Z cuda-cuxxfilt conda-forge/linux-64::cuda-cuxxfilt-12.6.77-hbd13f7d_1 2025-05-07T19:46:02.8218662Z cuda-driver-dev conda-forge/linux-64::cuda-driver-dev-12.6.77-h5888daf_0 2025-05-07T19:46:02.8219274Z cuda-driver-dev_l~ conda-forge/noarch::cuda-driver-dev_linux-64-12.6.77-h3f2d84a_0 2025-05-07T19:46:02.8219929Z cuda-gdb conda-forge/linux-64::cuda-gdb-12.6.77-h50b4baa_1 2025-05-07T19:46:02.8220429Z cuda-libraries conda-forge/linux-64::cuda-libraries-12.6.3-ha770c72_0 2025-05-07T19:46:02.8221034Z cuda-libraries-dev conda-forge/linux-64::cuda-libraries-dev-12.6.3-ha770c72_0 2025-05-07T19:46:02.8221594Z cuda-nsight conda-forge/linux-64::cuda-nsight-12.6.77-h7938cbb_0 2025-05-07T19:46:02.8222121Z cuda-nvcc conda-forge/linux-64::cuda-nvcc-12.6.85-hcdd1206_0 2025-05-07T19:46:02.8222692Z cuda-nvcc-dev_lin~ conda-forge/noarch::cuda-nvcc-dev_linux-64-12.6.85-he91c749_0 2025-05-07T19:46:02.8223267Z cuda-nvcc-impl conda-forge/linux-64::cuda-nvcc-impl-12.6.85-h85509e4_0 2025-05-07T19:46:02.8223851Z cuda-nvcc-tools conda-forge/linux-64::cuda-nvcc-tools-12.6.85-he02047a_0 2025-05-07T19:46:02.8224420Z cuda-nvcc_linux-64 conda-forge/linux-64::cuda-nvcc_linux-64-12.6.85-h04802cd_0 2025-05-07T19:46:02.8225009Z cuda-nvdisasm conda-forge/linux-64::cuda-nvdisasm-12.6.77-hbd13f7d_1 2025-05-07T19:46:02.8225571Z cuda-nvml-dev conda-forge/linux-64::cuda-nvml-dev-12.6.77-hbd13f7d_1 2025-05-07T19:46:02.8226090Z cuda-nvprof conda-forge/linux-64::cuda-nvprof-12.6.80-hbd13f7d_0 2025-05-07T19:46:02.8226630Z cuda-nvprune conda-forge/linux-64::cuda-nvprune-12.6.77-hbd13f7d_1 2025-05-07T19:46:02.8227141Z cuda-nvrtc conda-forge/linux-64::cuda-nvrtc-12.6.85-hbd13f7d_0 2025-05-07T19:46:02.8227682Z cuda-nvrtc-dev conda-forge/linux-64::cuda-nvrtc-dev-12.6.85-h5888daf_0 2025-05-07T19:46:02.8228222Z cuda-nvtx conda-forge/linux-64::cuda-nvtx-12.6.77-hbd13f7d_0 2025-05-07T19:46:02.8229191Z cuda-nvvm-dev_lin~ conda-forge/noarch::cuda-nvvm-dev_linux-64-12.6.85-ha770c72_0 2025-05-07T19:46:02.8229853Z cuda-nvvm-impl conda-forge/linux-64::cuda-nvvm-impl-12.6.85-he02047a_0 2025-05-07T19:46:02.8230462Z cuda-nvvm-tools conda-forge/linux-64::cuda-nvvm-tools-12.6.85-he02047a_0 2025-05-07T19:46:02.8231063Z cuda-nvvp conda-forge/linux-64::cuda-nvvp-12.6.80-hbd13f7d_1 2025-05-07T19:46:02.8231628Z cuda-opencl conda-forge/linux-64::cuda-opencl-12.6.77-hbd13f7d_0 2025-05-07T19:46:02.8232212Z cuda-opencl-dev conda-forge/linux-64::cuda-opencl-dev-12.6.77-h5888daf_0 2025-05-07T19:46:02.8232868Z cuda-profiler-api conda-forge/linux-64::cuda-profiler-api-12.6.77-h7938cbb_0 2025-05-07T19:46:02.8233465Z cuda-runtime conda-forge/noarch::cuda-runtime-12.6.3-ha804496_0 2025-05-07T19:46:02.8234176Z cuda-sanitizer-api conda-forge/linux-64::cuda-sanitizer-api-12.6.77-hbd13f7d_1 2025-05-07T19:46:02.8234886Z cuda-toolkit conda-forge/noarch::cuda-toolkit-12.6.3-ha804496_0 2025-05-07T19:46:02.8235421Z cuda-tools conda-forge/linux-64::cuda-tools-12.6.3-ha770c72_0 2025-05-07T19:46:02.8235983Z cuda-version conda-forge/noarch::cuda-version-12.6-h7480c83_3 2025-05-07T19:46:02.8236571Z cuda-visual-tools conda-forge/linux-64::cuda-visual-tools-12.6.3-ha770c72_0 2025-05-07T19:46:02.8237215Z cxx-compiler conda-forge/linux-64::cxx-compiler-1.5.2-hf52228f_0 2025-05-07T19:46:02.8237745Z dbus conda-forge/linux-64::dbus-1.13.6-h5008d03_3 2025-05-07T19:46:02.8238189Z expat conda-forge/linux-64::expat-2.7.0-h5888daf_0 2025-05-07T19:46:02.8238652Z gcc conda-forge/linux-64::gcc-11.4.0-h602e360_13 2025-05-07T19:46:02.8239119Z gds-tools conda-forge/linux-64::gds-tools-1.11.1.6-h5888daf_4 2025-05-07T19:46:02.8239610Z gmp conda-forge/linux-64::gmp-6.3.0-hac33072_2 2025-05-07T19:46:02.8240212Z gxx conda-forge/linux-64::gxx-11.4.0-h602e360_13 2025-05-07T19:46:02.8240757Z libcap conda-forge/linux-64::libcap-2.75-h39aace5_0 2025-05-07T19:46:02.8241529Z libcublas conda-forge/linux-64::libcublas-12.6.4.1-h5888daf_1 2025-05-07T19:46:02.8242261Z libcublas-dev conda-forge/linux-64::libcublas-dev-12.6.4.1-h5888daf_1 2025-05-07T19:46:02.8242836Z libcufft conda-forge/linux-64::libcufft-11.3.0.4-hbd13f7d_0 2025-05-07T19:46:02.8243469Z libcufft-dev conda-forge/linux-64::libcufft-dev-11.3.0.4-h5888daf_0 2025-05-07T19:46:02.8244037Z libcufile conda-forge/linux-64::libcufile-1.11.1.6-h12f29b5_4 2025-05-07T19:46:02.8244615Z libcufile-dev conda-forge/linux-64::libcufile-dev-1.11.1.6-h5888daf_4 2025-05-07T19:46:02.8245170Z libcurand conda-forge/linux-64::libcurand-10.3.7.77-hbd13f7d_0 2025-05-07T19:46:02.8245759Z libcurand-dev conda-forge/linux-64::libcurand-dev-10.3.7.77-h5888daf_0 2025-05-07T19:46:02.8246336Z libcusolver conda-forge/linux-64::libcusolver-11.7.1.2-h5888daf_1 2025-05-07T19:46:02.8246951Z libcusolver-dev conda-forge/linux-64::libcusolver-dev-11.7.1.2-h5888daf_1 2025-05-07T19:46:02.8247568Z libcusparse conda-forge/linux-64::libcusparse-12.5.4.2-hbd13f7d_0 2025-05-07T19:46:02.8248159Z libcusparse-dev conda-forge/linux-64::libcusparse-dev-12.5.4.2-h5888daf_0 2025-05-07T19:46:02.8248784Z libgcrypt-lib conda-forge/linux-64::libgcrypt-lib-1.11.0-hb9d3cd8_2 2025-05-07T19:46:02.8249346Z libgpg-error conda-forge/linux-64::libgpg-error-1.55-h3f2d84a_0 2025-05-07T19:46:02.8249869Z libnl conda-forge/linux-64::libnl-3.11.0-hb9d3cd8_0 2025-05-07T19:46:02.8250365Z libnpp conda-forge/linux-64::libnpp-12.3.1.54-h5888daf_0 2025-05-07T19:46:02.8250877Z libnpp-dev conda-forge/linux-64::libnpp-dev-12.3.1.54-h5888daf_0 2025-05-07T19:46:02.8251695Z libnuma conda-forge/linux-64::libnuma-2.0.18-h4ab18f5_2 2025-05-07T19:46:02.8252217Z libnvfatbin conda-forge/linux-64::libnvfatbin-12.6.77-hbd13f7d_0 2025-05-07T19:46:02.8252833Z libnvfatbin-dev conda-forge/linux-64::libnvfatbin-dev-12.6.77-h5888daf_0 2025-05-07T19:46:02.8253461Z libnvjitlink conda-forge/linux-64::libnvjitlink-12.6.85-hbd13f7d_0 2025-05-07T19:46:02.8254074Z libnvjitlink-dev conda-forge/linux-64::libnvjitlink-dev-12.6.85-h5888daf_0 2025-05-07T19:46:02.8254692Z libnvjpeg conda-forge/linux-64::libnvjpeg-12.3.3.54-h5888daf_0 2025-05-07T19:46:02.8255248Z libnvjpeg-dev conda-forge/linux-64::libnvjpeg-dev-12.3.3.54-ha770c72_0 2025-05-07T19:46:02.8255839Z libsystemd0 conda-forge/linux-64::libsystemd0-257.4-h4e0b6ca_1 2025-05-07T19:46:02.8256374Z libudev1 conda-forge/linux-64::libudev1-257.4-hbe16f8c_1 2025-05-07T19:46:02.8256894Z libxkbcommon conda-forge/linux-64::libxkbcommon-1.7.0-h2c5496b_1 2025-05-07T19:46:02.8257461Z libxkbfile conda-forge/linux-64::libxkbfile-1.1.0-h166bdaf_1 2025-05-07T19:46:02.8257940Z lz4-c conda-forge/linux-64::lz4-c-1.10.0-h5888daf_1 2025-05-07T19:46:02.8258499Z nsight-compute conda-forge/linux-64::nsight-compute-2024.3.2.3-hb5ebaad_0 2025-05-07T19:46:02.8259056Z nspr conda-forge/linux-64::nspr-4.36-h5888daf_0 2025-05-07T19:46:02.8259476Z nss conda-forge/linux-64::nss-3.111-h159eef7_0 2025-05-07T19:46:02.8259947Z ocl-icd conda-forge/linux-64::ocl-icd-2.3.3-hb9d3cd8_0 2025-05-07T19:46:02.8260524Z opencl-headers conda-forge/linux-64::opencl-headers-2024.10.24-h5888daf_0 2025-05-07T19:46:02.8261111Z rdma-core conda-forge/linux-64::rdma-core-57.0-h5888daf_0 2025-05-07T19:46:02.8261626Z wayland conda-forge/linux-64::wayland-1.23.1-h3e06ad9_0 2025-05-07T19:46:02.8262111Z xcb-util conda-forge/linux-64::xcb-util-0.4.1-hb711507_2 2025-05-07T19:46:02.8262773Z xcb-util-cursor conda-forge/linux-64::xcb-util-cursor-0.1.5-hb9d3cd8_0 2025-05-07T19:46:02.8263365Z xcb-util-image conda-forge/linux-64::xcb-util-image-0.4.0-hb711507_2 2025-05-07T19:46:02.8263995Z xcb-util-keysyms conda-forge/linux-64::xcb-util-keysyms-0.4.1-hb711507_0 2025-05-07T19:46:02.8264779Z xcb-util-renderut~ conda-forge/linux-64::xcb-util-renderutil-0.3.10-hb711507_0 2025-05-07T19:46:02.8265364Z xcb-util-wm conda-forge/linux-64::xcb-util-wm-0.4.2-hb711507_0 2025-05-07T19:46:02.8266206Z xkeyboard-config conda-forge/linux-64::xkeyboard-config-2.44-hb9d3cd8_0 2025-05-07T19:46:02.8266848Z xorg-libxcomposite conda-forge/linux-64::xorg-libxcomposite-0.4.6-hb9d3cd8_2 2025-05-07T19:46:02.8267620Z xorg-libxdamage conda-forge/linux-64::xorg-libxdamage-1.1.6-hb9d3cd8_0 2025-05-07T19:46:02.8267982Z 2025-05-07T19:46:02.8267987Z 2025-05-07T19:46:02.8267991Z 2025-05-07T19:46:02.8268191Z Downloading and Extracting Packages: ...working... 2025-05-07T19:46:02.8268618Z nsight-compute-2024. | 443.1 MB | | 0% 2025-05-07T19:46:02.8268922Z 2025-05-07T19:46:02.8269269Z libcublas-12.6.4.1 | 256.2 MB | | 0%  2025-05-07T19:46:02.8269540Z 2025-05-07T19:46:02.8269544Z 2025-05-07T19:46:02.8269886Z libcufft-11.3.0.4 | 156.2 MB | | 0%  2025-05-07T19:46:02.8270165Z 2025-05-07T19:46:02.8270169Z 2025-05-07T19:46:02.8270172Z 2025-05-07T19:46:02.8275269Z libcusparse-12.5.4.2 | 118.6 MB | | 0%  2025-05-07T19:46:02.8275674Z 2025-05-07T19:46:02.8275677Z 2025-05-07T19:46:02.8275681Z 2025-05-07T19:46:02.8275684Z 2025-05-07T19:46:02.8291341Z cuda-nsight-12.6.77 | 113.2 MB | | 0%  2025-05-07T19:46:02.8291764Z 2025-05-07T19:46:02.8291943Z 2025-05-07T19:46:02.8291947Z 2025-05-07T19:46:02.8292025Z 2025-05-07T19:46:02.8292056Z 2025-05-07T19:46:02.8293579Z cuda-nvvp-12.6.80 | 109.3 MB | | 0%  2025-05-07T19:46:02.8293932Z 2025-05-07T19:46:02.8293939Z 2025-05-07T19:46:02.8293943Z 2025-05-07T19:46:02.8293973Z 2025-05-07T19:46:02.8294008Z 2025-05-07T19:46:02.8294012Z 2025-05-07T19:46:02.8294338Z libcusolver-11.7.1.2 | 95.8 MB | | 0%  2025-05-07T19:46:02.8294652Z 2025-05-07T19:46:02.8294656Z 2025-05-07T19:46:02.8294660Z 2025-05-07T19:46:02.8294665Z 2025-05-07T19:46:02.8294669Z 2025-05-07T19:46:02.8294682Z 2025-05-07T19:46:02.8294716Z 2025-05-07T19:46:02.8295251Z libnpp-12.3.1.54 | 93.4 MB | | 0%  2025-05-07T19:46:02.8295558Z 2025-05-07T19:46:02.8295563Z 2025-05-07T19:46:02.8295578Z 2025-05-07T19:46:02.8295582Z 2025-05-07T19:46:02.8295586Z 2025-05-07T19:46:02.8295589Z 2025-05-07T19:46:02.8295593Z 2025-05-07T19:46:02.8295626Z 2025-05-07T19:46:02.8296394Z cuda-nvdisasm-12.6.7 | 47.6 MB | | 0%  2025-05-07T19:46:02.8296709Z 2025-05-07T19:46:02.8296729Z 2025-05-07T19:46:02.8296733Z 2025-05-07T19:46:02.8296736Z 2025-05-07T19:46:02.8296740Z 2025-05-07T19:46:02.8296743Z 2025-05-07T19:46:02.8296754Z 2025-05-07T19:46:02.8296784Z 2025-05-07T19:46:02.8296787Z 2025-05-07T19:46:02.8297386Z libcurand-10.3.7.77 | 39.9 MB | | 0%  2025-05-07T19:46:02.8297727Z 2025-05-07T19:46:02.8297742Z 2025-05-07T19:46:02.8297746Z 2025-05-07T19:46:02.8297749Z 2025-05-07T19:46:02.8297753Z 2025-05-07T19:46:02.8297782Z 2025-05-07T19:46:02.8297786Z 2025-05-07T19:46:02.8297789Z 2025-05-07T19:46:02.8297800Z 2025-05-07T19:46:02.8297804Z 2025-05-07T19:46:02.8298470Z gds-tools-1.11.1.6 | 37.8 MB | | 0%  2025-05-07T19:46:02.8298782Z 2025-05-07T19:46:02.8298787Z 2025-05-07T19:46:02.8298790Z 2025-05-07T19:46:02.8298819Z 2025-05-07T19:46:02.8298822Z 2025-05-07T19:46:02.8298825Z 2025-05-07T19:46:02.8298829Z 2025-05-07T19:46:02.8298832Z 2025-05-07T19:46:02.8298836Z 2025-05-07T19:46:02.8298839Z 2025-05-07T19:46:02.8298842Z 2025-05-07T19:46:02.8299729Z cuda-nvcc-tools-12.6 | 23.0 MB | | 0%  2025-05-07T19:46:02.8300062Z 2025-05-07T19:46:02.8300109Z 2025-05-07T19:46:02.8300113Z 2025-05-07T19:46:02.8300116Z 2025-05-07T19:46:02.8300120Z 2025-05-07T19:46:02.8300123Z 2025-05-07T19:46:02.8300127Z 2025-05-07T19:46:02.8300130Z 2025-05-07T19:46:02.8300134Z 2025-05-07T19:46:02.8300137Z 2025-05-07T19:46:02.8300141Z 2025-05-07T19:46:02.8300144Z 2025-05-07T19:46:02.8300562Z cuda-nvrtc-12.6.85 | 17.3 MB | | 0%  2025-05-07T19:46:02.8300993Z 2025-05-07T19:46:02.8301012Z 2025-05-07T19:46:02.8301016Z 2025-05-07T19:46:02.8301019Z 2025-05-07T19:46:02.8301022Z 2025-05-07T19:46:02.8301026Z 2025-05-07T19:46:02.8301030Z 2025-05-07T19:46:02.8301033Z 2025-05-07T19:46:02.8301037Z 2025-05-07T19:46:02.8301040Z 2025-05-07T19:46:02.8301043Z 2025-05-07T19:46:02.8301047Z 2025-05-07T19:46:02.8301050Z 2025-05-07T19:46:02.8301653Z libnvjitlink-12.6.85 | 14.9 MB | | 0%  2025-05-07T19:46:02.8301988Z 2025-05-07T19:46:02.8302002Z 2025-05-07T19:46:02.8302006Z 2025-05-07T19:46:02.8302009Z 2025-05-07T19:46:02.8302013Z 2025-05-07T19:46:02.8302017Z 2025-05-07T19:46:02.8302020Z 2025-05-07T19:46:02.8302024Z 2025-05-07T19:46:02.8302028Z 2025-05-07T19:46:02.8302031Z 2025-05-07T19:46:02.8302034Z 2025-05-07T19:46:02.8302038Z 2025-05-07T19:46:02.8302041Z 2025-05-07T19:46:02.8302070Z 2025-05-07T19:46:02.8302672Z cuda-nvcc-dev_linux- | 10.8 MB | | 0%  2025-05-07T19:46:02.8303015Z 2025-05-07T19:46:02.8303031Z 2025-05-07T19:46:02.8303034Z 2025-05-07T19:46:02.8303038Z 2025-05-07T19:46:02.8303041Z 2025-05-07T19:46:02.8303044Z 2025-05-07T19:46:02.8303072Z 2025-05-07T19:46:02.8303075Z 2025-05-07T19:46:02.8303079Z 2025-05-07T19:46:02.8303082Z 2025-05-07T19:46:02.8303085Z 2025-05-07T19:46:02.8303089Z 2025-05-07T19:46:02.8303092Z 2025-05-07T19:46:02.8303096Z 2025-05-07T19:46:02.8303099Z 2025-05-07T19:46:02.8303691Z cuda-nvvm-tools-12.6 | 10.4 MB | | 0%  2025-05-07T19:46:02.8304055Z 2025-05-07T19:46:02.8304071Z 2025-05-07T19:46:02.8304076Z 2025-05-07T19:46:02.8304079Z 2025-05-07T19:46:02.8304083Z 2025-05-07T19:46:02.8304086Z 2025-05-07T19:46:02.8304089Z 2025-05-07T19:46:02.8304093Z 2025-05-07T19:46:02.8304096Z 2025-05-07T19:46:02.8304099Z 2025-05-07T19:46:02.8304103Z 2025-05-07T19:46:02.8304110Z 2025-05-07T19:46:02.8304114Z 2025-05-07T19:46:02.8304117Z 2025-05-07T19:46:02.8304121Z 2025-05-07T19:46:02.8304124Z 2025-05-07T19:46:02.8304785Z cuda-sanitizer-api-1 | 8.9 MB | | 0%  2025-05-07T19:46:02.8305132Z 2025-05-07T19:46:02.8305150Z 2025-05-07T19:46:02.8305153Z 2025-05-07T19:46:02.8305157Z 2025-05-07T19:46:02.8305160Z 2025-05-07T19:46:02.8305164Z 2025-05-07T19:46:02.8305167Z 2025-05-07T19:46:02.8305171Z 2025-05-07T19:46:02.8305174Z 2025-05-07T19:46:02.8305178Z 2025-05-07T19:46:02.8305181Z 2025-05-07T19:46:02.8305189Z 2025-05-07T19:46:02.8305193Z 2025-05-07T19:46:02.8305221Z 2025-05-07T19:46:02.8305224Z 2025-05-07T19:46:02.8305228Z 2025-05-07T19:46:02.8305231Z 2025-05-07T19:46:02.8305899Z cuda-nvvm-impl-12.6. | 7.7 MB | | 0%  2025-05-07T19:46:02.8306235Z 2025-05-07T19:46:02.8306239Z 2025-05-07T19:46:02.8306242Z 2025-05-07T19:46:02.8306287Z 2025-05-07T19:46:02.8306296Z 2025-05-07T19:46:02.8306300Z 2025-05-07T19:46:02.8306303Z 2025-05-07T19:46:02.8306307Z 2025-05-07T19:46:02.8306310Z 2025-05-07T19:46:02.8306314Z 2025-05-07T19:46:02.8306317Z 2025-05-07T19:46:02.8306321Z 2025-05-07T19:46:02.8306324Z 2025-05-07T19:46:02.8306327Z 2025-05-07T19:46:02.8306331Z 2025-05-07T19:46:02.8306334Z 2025-05-07T19:46:02.8306338Z 2025-05-07T19:46:02.8306341Z 2025-05-07T19:46:02.8306947Z cuda-cupti-dev-12.6. | 3.4 MB | | 0%  2025-05-07T19:46:02.8307321Z 2025-05-07T19:46:02.8307414Z 2025-05-07T19:46:02.8307418Z 2025-05-07T19:46:02.8307422Z 2025-05-07T19:46:02.8307425Z 2025-05-07T19:46:02.8307429Z 2025-05-07T19:46:02.8307432Z 2025-05-07T19:46:02.8307436Z 2025-05-07T19:46:02.8307439Z 2025-05-07T19:46:02.8307443Z 2025-05-07T19:46:02.8307446Z 2025-05-07T19:46:02.8307450Z 2025-05-07T19:46:02.8307453Z 2025-05-07T19:46:02.8307457Z 2025-05-07T19:46:02.8307460Z 2025-05-07T19:46:02.8307523Z 2025-05-07T19:46:02.8307553Z 2025-05-07T19:46:02.8307556Z 2025-05-07T19:46:02.8307560Z 2025-05-07T19:46:02.9246548Z ... (more hidden) ... 2025-05-07T19:46:02.9249746Z nsight-compute-2024. | 443.1 MB | | 1% 2025-05-07T19:46:02.9253067Z 2025-05-07T19:46:02.9267902Z libcublas-12.6.4.1 | 256.2 MB | | 0%  2025-05-07T19:46:02.9268243Z 2025-05-07T19:46:02.9268248Z 2025-05-07T19:46:02.9268252Z 2025-05-07T19:46:02.9269329Z libcusparse-12.5.4.2 | 118.6 MB | 7 | 7%  2025-05-07T19:46:02.9269664Z 2025-05-07T19:46:02.9269676Z 2025-05-07T19:46:02.9426203Z libcufft-11.3.0.4 | 156.2 MB | 1 | 1%  2025-05-07T19:46:02.9427048Z 2025-05-07T19:46:02.9427062Z 2025-05-07T19:46:02.9427073Z 2025-05-07T19:46:02.9427084Z 2025-05-07T19:46:03.0247697Z cuda-nsight-12.6.77 | 113.2 MB | | 0%  2025-05-07T19:46:03.0250355Z nsight-compute-2024. | 443.1 MB | 2 | 3% 2025-05-07T19:46:03.0251152Z 2025-05-07T19:46:03.0268847Z libcublas-12.6.4.1 | 256.2 MB | 2 | 3%  2025-05-07T19:46:03.0269598Z 2025-05-07T19:46:03.0269603Z 2025-05-07T19:46:03.0269607Z 2025-05-07T19:46:03.0271295Z libcusparse-12.5.4.2 | 118.6 MB | #8 | 18%  2025-05-07T19:46:03.0271586Z 2025-05-07T19:46:03.0271597Z 2025-05-07T19:46:03.0427683Z libcufft-11.3.0.4 | 156.2 MB | 5 | 6%  2025-05-07T19:46:03.0428050Z 2025-05-07T19:46:03.0428055Z 2025-05-07T19:46:03.0428085Z 2025-05-07T19:46:03.0428089Z 2025-05-07T19:46:03.1253383Z cuda-nsight-12.6.77 | 113.2 MB | 1 | 2%  2025-05-07T19:46:03.1253874Z nsight-compute-2024. | 443.1 MB | 4 | 4% 2025-05-07T19:46:03.1254166Z 2025-05-07T19:46:03.1275942Z libcublas-12.6.4.1 | 256.2 MB | 5 | 5%  2025-05-07T19:46:03.1276233Z 2025-05-07T19:46:03.1276248Z 2025-05-07T19:46:03.1429458Z libcufft-11.3.0.4 | 156.2 MB | # | 10%  2025-05-07T19:46:03.1429851Z 2025-05-07T19:46:03.1429856Z 2025-05-07T19:46:03.1429863Z 2025-05-07T19:46:03.1429868Z 2025-05-07T19:46:03.1921092Z cuda-nsight-12.6.77 | 113.2 MB | 4 | 5%  2025-05-07T19:46:03.1921426Z 2025-05-07T19:46:03.1921431Z 2025-05-07T19:46:03.1921446Z 2025-05-07T19:46:03.2258019Z libcusparse-12.5.4.2 | 118.6 MB | ##7 | 28%  2025-05-07T19:46:03.2258931Z 2025-05-07T19:46:03.2299540Z libcublas-12.6.4.1 | 256.2 MB | 7 | 8%  2025-05-07T19:46:03.2300407Z 2025-05-07T19:46:03.2300421Z 2025-05-07T19:46:03.2384021Z libcufft-11.3.0.4 | 156.2 MB | #4 | 14%  2025-05-07T19:46:03.2429566Z nsight-compute-2024. | 443.1 MB | 5 | 6% 2025-05-07T19:46:03.2429881Z 2025-05-07T19:46:03.2429887Z 2025-05-07T19:46:03.2429891Z 2025-05-07T19:46:03.2429894Z 2025-05-07T19:46:03.3258465Z cuda-nsight-12.6.77 | 113.2 MB | 9 | 9%  2025-05-07T19:46:03.3258836Z 2025-05-07T19:46:03.3300216Z libcublas-12.6.4.1 | 256.2 MB | # | 10%  2025-05-07T19:46:03.3300568Z 2025-05-07T19:46:03.3300575Z 2025-05-07T19:46:03.3414774Z libcufft-11.3.0.4 | 156.2 MB | #8 | 18%  2025-05-07T19:46:03.3415621Z 2025-05-07T19:46:03.3415635Z 2025-05-07T19:46:03.3415647Z 2025-05-07T19:46:03.3428887Z libcusparse-12.5.4.2 | 118.6 MB | ###5 | 35%  2025-05-07T19:46:03.3429220Z 2025-05-07T19:46:03.3429225Z 2025-05-07T19:46:03.3429230Z 2025-05-07T19:46:03.3429235Z 2025-05-07T19:46:03.3509345Z cuda-nsight-12.6.77 | 113.2 MB | #3 | 13%  2025-05-07T19:46:03.4259936Z nsight-compute-2024. | 443.1 MB | 7 | 7% 2025-05-07T19:46:03.4260228Z 2025-05-07T19:46:03.4301819Z libcublas-12.6.4.1 | 256.2 MB | #2 | 12%  2025-05-07T19:46:03.4302659Z 2025-05-07T19:46:03.4302674Z 2025-05-07T19:46:03.4431343Z libcufft-11.3.0.4 | 156.2 MB | ##1 | 22%  2025-05-07T19:46:03.4431872Z 2025-05-07T19:46:03.4431879Z 2025-05-07T19:46:03.4432100Z 2025-05-07T19:46:03.4432103Z 2025-05-07T19:46:03.4555395Z cuda-nsight-12.6.77 | 113.2 MB | #7 | 17%  2025-05-07T19:46:03.4644793Z nsight-compute-2024. | 443.1 MB | 8 | 9% 2025-05-07T19:46:03.4645610Z 2025-05-07T19:46:03.4645626Z 2025-05-07T19:46:03.4645638Z 2025-05-07T19:46:03.5259418Z libcusparse-12.5.4.2 | 118.6 MB | ####2 | 42%  2025-05-07T19:46:03.5259738Z 2025-05-07T19:46:03.5430725Z libcublas-12.6.4.1 | 256.2 MB | #4 | 15%  2025-05-07T19:46:03.5431054Z 2025-05-07T19:46:03.5431061Z 2025-05-07T19:46:03.5431112Z 2025-05-07T19:46:03.5431120Z 2025-05-07T19:46:03.5557459Z cuda-nsight-12.6.77 | 113.2 MB | ##1 | 22%  2025-05-07T19:46:03.5581758Z nsight-compute-2024. | 443.1 MB | # | 10% 2025-05-07T19:46:03.5582137Z 2025-05-07T19:46:03.5582348Z 2025-05-07T19:46:03.5828887Z libcufft-11.3.0.4 | 156.2 MB | ##5 | 26%  2025-05-07T19:46:03.5829180Z 2025-05-07T19:46:03.5829216Z 2025-05-07T19:46:03.5829253Z 2025-05-07T19:46:03.6268947Z libcusparse-12.5.4.2 | 118.6 MB | ####8 | 48%  2025-05-07T19:46:03.6269289Z 2025-05-07T19:46:03.6432761Z libcublas-12.6.4.1 | 256.2 MB | #7 | 17%  2025-05-07T19:46:03.6433302Z 2025-05-07T19:46:03.6433308Z 2025-05-07T19:46:03.6433312Z 2025-05-07T19:46:03.6433659Z 2025-05-07T19:46:03.6584649Z cuda-nsight-12.6.77 | 113.2 MB | ##5 | 26%  2025-05-07T19:46:03.6584977Z 2025-05-07T19:46:03.6584983Z 2025-05-07T19:46:03.6639594Z libcufft-11.3.0.4 | 156.2 MB | ##9 | 29%  2025-05-07T19:46:03.6983858Z nsight-compute-2024. | 443.1 MB | #1 | 11% 2025-05-07T19:46:03.6984347Z 2025-05-07T19:46:03.6984398Z 2025-05-07T19:46:03.6984406Z 2025-05-07T19:46:03.7291622Z libcusparse-12.5.4.2 | 118.6 MB | #####4 | 54%  2025-05-07T19:46:03.7291964Z 2025-05-07T19:46:03.7433890Z libcublas-12.6.4.1 | 256.2 MB | #9 | 20%  2025-05-07T19:46:03.7434413Z 2025-05-07T19:46:03.7434457Z 2025-05-07T19:46:03.7434461Z 2025-05-07T19:46:03.7434465Z 2025-05-07T19:46:03.7585509Z cuda-nsight-12.6.77 | 113.2 MB | ##9 | 30%  2025-05-07T19:46:03.7585843Z 2025-05-07T19:46:03.7585977Z 2025-05-07T19:46:03.7640111Z libcufft-11.3.0.4 | 156.2 MB | ###3 | 33%  2025-05-07T19:46:03.8095408Z nsight-compute-2024. | 443.1 MB | #2 | 13% 2025-05-07T19:46:03.8095747Z 2025-05-07T19:46:03.8095755Z 2025-05-07T19:46:03.8095760Z 2025-05-07T19:46:03.8292573Z libcusparse-12.5.4.2 | 118.6 MB | #####9 | 60%  2025-05-07T19:46:03.8292929Z 2025-05-07T19:46:03.8433295Z libcublas-12.6.4.1 | 256.2 MB | ##1 | 22%  2025-05-07T19:46:03.8433621Z 2025-05-07T19:46:03.8433773Z 2025-05-07T19:46:03.8433789Z 2025-05-07T19:46:03.8433797Z 2025-05-07T19:46:03.8592334Z cuda-nsight-12.6.77 | 113.2 MB | ###4 | 34%  2025-05-07T19:46:03.8593240Z 2025-05-07T19:46:03.8593255Z 2025-05-07T19:46:03.8640236Z libcufft-11.3.0.4 | 156.2 MB | ###7 | 37%  2025-05-07T19:46:03.9182904Z nsight-compute-2024. | 443.1 MB | #4 | 14% 2025-05-07T19:46:03.9183726Z 2025-05-07T19:46:03.9183742Z 2025-05-07T19:46:03.9183754Z 2025-05-07T19:46:03.9293011Z libcusparse-12.5.4.2 | 118.6 MB | ######5 | 65%  2025-05-07T19:46:03.9293909Z 2025-05-07T19:46:03.9557438Z libcublas-12.6.4.1 | 256.2 MB | ##4 | 25%  2025-05-07T19:46:03.9557769Z 2025-05-07T19:46:03.9557775Z 2025-05-07T19:46:03.9557780Z 2025-05-07T19:46:03.9557785Z 2025-05-07T19:46:03.9642974Z cuda-nsight-12.6.77 | 113.2 MB | ###8 | 38%  2025-05-07T19:46:04.0021985Z nsight-compute-2024. | 443.1 MB | #6 | 16% 2025-05-07T19:46:04.0022811Z 2025-05-07T19:46:04.0022826Z 2025-05-07T19:46:04.0183157Z libcufft-11.3.0.4 | 156.2 MB | #### | 41%  2025-05-07T19:46:04.0183479Z 2025-05-07T19:46:04.0183654Z 2025-05-07T19:46:04.0183659Z 2025-05-07T19:46:04.0303224Z libcusparse-12.5.4.2 | 118.6 MB | #######4 | 74%  2025-05-07T19:46:04.0303846Z 2025-05-07T19:46:04.0557478Z libcublas-12.6.4.1 | 256.2 MB | ##7 | 27%  2025-05-07T19:46:04.0557784Z 2025-05-07T19:46:04.0557788Z 2025-05-07T19:46:04.0557792Z 2025-05-07T19:46:04.0557796Z 2025-05-07T19:46:04.0642248Z cuda-nsight-12.6.77 | 113.2 MB | ####2 | 42%  2025-05-07T19:46:04.1185277Z nsight-compute-2024. | 443.1 MB | #8 | 18% 2025-05-07T19:46:04.1186128Z 2025-05-07T19:46:04.1186145Z 2025-05-07T19:46:04.1186158Z 2025-05-07T19:46:04.1326291Z libcusparse-12.5.4.2 | 118.6 MB | ######## | 81%  2025-05-07T19:46:04.1327220Z 2025-05-07T19:46:04.1527940Z libcublas-12.6.4.1 | 256.2 MB | ##9 | 29%  2025-05-07T19:46:04.1528247Z 2025-05-07T19:46:04.1528665Z 2025-05-07T19:46:04.1558170Z libcufft-11.3.0.4 | 156.2 MB | ####4 | 44%  2025-05-07T19:46:04.1558493Z 2025-05-07T19:46:04.1558506Z 2025-05-07T19:46:04.1558510Z 2025-05-07T19:46:04.1558518Z 2025-05-07T19:46:04.1774700Z cuda-nsight-12.6.77 | 113.2 MB | ####6 | 47%  2025-05-07T19:46:04.2341468Z nsight-compute-2024. | 443.1 MB | ## | 20% 2025-05-07T19:46:04.2342278Z 2025-05-07T19:46:04.2411465Z libcublas-12.6.4.1 | 256.2 MB | ###1 | 32%  2025-05-07T19:46:04.2411888Z 2025-05-07T19:46:04.2412013Z 2025-05-07T19:46:04.2412019Z 2025-05-07T19:46:04.2530445Z libcusparse-12.5.4.2 | 118.6 MB | ########7 | 87%  2025-05-07T19:46:04.2530762Z 2025-05-07T19:46:04.2530768Z 2025-05-07T19:46:04.2562504Z libcufft-11.3.0.4 | 156.2 MB | ####8 | 48%  2025-05-07T19:46:04.2563352Z 2025-05-07T19:46:04.2563366Z 2025-05-07T19:46:04.2563377Z 2025-05-07T19:46:04.2568592Z 2025-05-07T19:46:04.2942133Z cuda-nsight-12.6.77 | 113.2 MB | #####1 | 51%  2025-05-07T19:46:04.3353125Z nsight-compute-2024. | 443.1 MB | ##1 | 22% 2025-05-07T19:46:04.3353785Z 2025-05-07T19:46:04.3531665Z libcublas-12.6.4.1 | 256.2 MB | ###4 | 34%  2025-05-07T19:46:04.3532025Z 2025-05-07T19:46:04.3532031Z 2025-05-07T19:46:04.3560881Z libcufft-11.3.0.4 | 156.2 MB | #####1 | 52%  2025-05-07T19:46:04.3561186Z 2025-05-07T19:46:04.3561191Z 2025-05-07T19:46:04.3561196Z 2025-05-07T19:46:04.3561200Z 2025-05-07T19:46:04.3571054Z cuda-nsight-12.6.77 | 113.2 MB | #####5 | 56%  2025-05-07T19:46:04.3571360Z 2025-05-07T19:46:04.3571364Z 2025-05-07T19:46:04.3572369Z 2025-05-07T19:46:04.4159546Z libcusparse-12.5.4.2 | 118.6 MB | #########3 | 93%  2025-05-07T19:46:04.4361646Z nsight-compute-2024. | 443.1 MB | ##3 | 23% 2025-05-07T19:46:04.4362445Z 2025-05-07T19:46:04.4532771Z libcublas-12.6.4.1 | 256.2 MB | ###6 | 37%  2025-05-07T19:46:04.4533068Z 2025-05-07T19:46:04.4533073Z 2025-05-07T19:46:04.4578542Z libcufft-11.3.0.4 | 156.2 MB | #####5 | 56%  2025-05-07T19:46:04.4578870Z 2025-05-07T19:46:04.4578875Z 2025-05-07T19:46:04.4578879Z 2025-05-07T19:46:04.4578884Z 2025-05-07T19:46:04.4705932Z cuda-nsight-12.6.77 | 113.2 MB | #####9 | 60%  2025-05-07T19:46:04.4706281Z 2025-05-07T19:46:04.4706286Z 2025-05-07T19:46:04.4706290Z 2025-05-07T19:46:04.5158140Z libcusparse-12.5.4.2 | 118.6 MB | #########9 | 99%  2025-05-07T19:46:04.5360388Z nsight-compute-2024. | 443.1 MB | ##5 | 25% 2025-05-07T19:46:04.5360690Z 2025-05-07T19:46:04.5535154Z libcublas-12.6.4.1 | 256.2 MB | ###9 | 39%  2025-05-07T19:46:04.5535471Z 2025-05-07T19:46:04.5535738Z 2025-05-07T19:46:04.5579428Z libcufft-11.3.0.4 | 156.2 MB | ###### | 60%  2025-05-07T19:46:04.5579811Z 2025-05-07T19:46:04.5579817Z 2025-05-07T19:46:04.5579821Z 2025-05-07T19:46:04.5579825Z 2025-05-07T19:46:04.6158853Z cuda-nsight-12.6.77 | 113.2 MB | ######4 | 65%  2025-05-07T19:46:04.6362663Z nsight-compute-2024. | 443.1 MB | ##6 | 27% 2025-05-07T19:46:04.6362977Z 2025-05-07T19:46:04.6583906Z libcublas-12.6.4.1 | 256.2 MB | ####2 | 42%  2025-05-07T19:46:04.6584470Z 2025-05-07T19:46:04.6584479Z 2025-05-07T19:46:04.6584483Z 2025-05-07T19:46:04.6584488Z 2025-05-07T19:46:04.6645627Z cuda-nsight-12.6.77 | 113.2 MB | ####### | 71%  2025-05-07T19:46:04.6645957Z 2025-05-07T19:46:04.6645962Z 2025-05-07T19:46:04.7159898Z libcufft-11.3.0.4 | 156.2 MB | ######4 | 64%  2025-05-07T19:46:04.7365750Z nsight-compute-2024. | 443.1 MB | ##8 | 28% 2025-05-07T19:46:04.7366356Z 2025-05-07T19:46:04.7585907Z libcublas-12.6.4.1 | 256.2 MB | ####5 | 45%  2025-05-07T19:46:04.7586234Z 2025-05-07T19:46:04.7586407Z 2025-05-07T19:46:04.7586420Z 2025-05-07T19:46:04.7586424Z 2025-05-07T19:46:04.7735516Z cuda-nsight-12.6.77 | 113.2 MB | #######6 | 76%  2025-05-07T19:46:04.7735860Z 2025-05-07T19:46:04.7736054Z 2025-05-07T19:46:04.8160415Z libcufft-11.3.0.4 | 156.2 MB | ######7 | 68%  2025-05-07T19:46:04.8369802Z nsight-compute-2024. | 443.1 MB | ### | 30% 2025-05-07T19:46:04.8370228Z 2025-05-07T19:46:04.8616119Z libcublas-12.6.4.1 | 256.2 MB | ####8 | 48%  2025-05-07T19:46:04.8616462Z 2025-05-07T19:46:04.8616468Z 2025-05-07T19:46:04.8616472Z 2025-05-07T19:46:04.8616476Z 2025-05-07T19:46:04.8736033Z cuda-nsight-12.6.77 | 113.2 MB | ########2 | 82%  2025-05-07T19:46:04.8736354Z 2025-05-07T19:46:04.8736386Z 2025-05-07T19:46:04.9163901Z libcufft-11.3.0.4 | 156.2 MB | #######2 | 72%  2025-05-07T19:46:04.9679093Z nsight-compute-2024. | 443.1 MB | ###2 | 32% 2025-05-07T19:46:04.9679415Z 2025-05-07T19:46:04.9756416Z libcublas-12.6.4.1 | 256.2 MB | #####1 | 51%  2025-05-07T19:46:04.9756713Z 2025-05-07T19:46:04.9756719Z 2025-05-07T19:46:04.9876519Z libcufft-11.3.0.4 | 156.2 MB | #######8 | 78%  2025-05-07T19:46:04.9876873Z 2025-05-07T19:46:04.9876881Z 2025-05-07T19:46:04.9876886Z 2025-05-07T19:46:04.9876891Z 2025-05-07T19:46:05.0443914Z cuda-nsight-12.6.77 | 113.2 MB | ########7 | 87%  2025-05-07T19:46:05.0680293Z nsight-compute-2024. | 443.1 MB | ###3 | 34% 2025-05-07T19:46:05.0680909Z 2025-05-07T19:46:05.0981793Z libcublas-12.6.4.1 | 256.2 MB | #####4 | 54%  2025-05-07T19:46:05.0982609Z 2025-05-07T19:46:05.0982668Z 2025-05-07T19:46:05.0982682Z 2025-05-07T19:46:05.0982745Z 2025-05-07T19:46:05.1465162Z cuda-nsight-12.6.77 | 113.2 MB | #########4 | 94%  2025-05-07T19:46:05.1786314Z nsight-compute-2024. | 443.1 MB | ###5 | 36% 2025-05-07T19:46:05.1786634Z 2025-05-07T19:46:05.1954404Z libcublas-12.6.4.1 | 256.2 MB | #####8 | 58%  2025-05-07T19:46:05.1954723Z 2025-05-07T19:46:05.1954729Z 2025-05-07T19:46:05.2465113Z libcufft-11.3.0.4 | 156.2 MB | ########2 | 83%  2025-05-07T19:46:05.2855172Z nsight-compute-2024. | 443.1 MB | ###7 | 37% 2025-05-07T19:46:05.2855477Z 2025-05-07T19:46:05.2855674Z 2025-05-07T19:46:05.2855685Z 2025-05-07T19:46:05.2855690Z 2025-05-07T19:46:05.2892675Z cuda-nsight-12.6.77 | 113.2 MB | #########9 | 100%  2025-05-07T19:46:05.2893056Z 2025-05-07T19:46:05.2893061Z 2025-05-07T19:46:05.2893066Z 2025-05-07T19:46:05.2954785Z libcusparse-12.5.4.2 | 118.6 MB | ########## | 100%  2025-05-07T19:46:05.2955264Z 2025-05-07T19:46:05.2956076Z libcublas-12.6.4.1 | 256.2 MB | ######1 | 61%  2025-05-07T19:46:05.2956371Z 2025-05-07T19:46:05.2956377Z 2025-05-07T19:46:05.3269148Z libcufft-11.3.0.4 | 156.2 MB | ######### | 90%  2025-05-07T19:46:05.3269479Z 2025-05-07T19:46:05.3269564Z 2025-05-07T19:46:05.3269568Z 2025-05-07T19:46:05.3270386Z 2025-05-07T19:46:05.3270392Z 2025-05-07T19:46:05.3667472Z cuda-nvvp-12.6.80 | 109.3 MB | | 0%  2025-05-07T19:46:05.4044241Z nsight-compute-2024. | 443.1 MB | ###9 | 39% 2025-05-07T19:46:05.4044539Z 2025-05-07T19:46:05.4103132Z libcublas-12.6.4.1 | 256.2 MB | ######4 | 64%  2025-05-07T19:46:05.4103429Z 2025-05-07T19:46:05.4103435Z 2025-05-07T19:46:05.4272454Z libcufft-11.3.0.4 | 156.2 MB | #########4 | 95%  2025-05-07T19:46:05.4272780Z 2025-05-07T19:46:05.4272814Z 2025-05-07T19:46:05.4272820Z 2025-05-07T19:46:05.4272824Z 2025-05-07T19:46:05.4272852Z 2025-05-07T19:46:05.4816568Z cuda-nvvp-12.6.80 | 109.3 MB | 3 | 3%  2025-05-07T19:46:05.5243247Z nsight-compute-2024. | 443.1 MB | #### | 41% 2025-05-07T19:46:05.5243545Z 2025-05-07T19:46:05.5243552Z 2025-05-07T19:46:05.5274513Z libcufft-11.3.0.4 | 156.2 MB | #########9 | 99%  2025-05-07T19:46:05.5275387Z 2025-05-07T19:46:05.5275443Z 2025-05-07T19:46:05.5275456Z 2025-05-07T19:46:05.5275468Z 2025-05-07T19:46:05.5275478Z 2025-05-07T19:46:05.5358774Z cuda-nvvp-12.6.80 | 109.3 MB | 9 | 9%  2025-05-07T19:46:05.5359127Z 2025-05-07T19:46:05.5817601Z libcublas-12.6.4.1 | 256.2 MB | ######7 | 67%  2025-05-07T19:46:05.6275226Z nsight-compute-2024. | 443.1 MB | ####2 | 42% 2025-05-07T19:46:05.6275585Z 2025-05-07T19:46:05.6275614Z 2025-05-07T19:46:05.6275618Z 2025-05-07T19:46:05.6275624Z 2025-05-07T19:46:05.6275629Z 2025-05-07T19:46:05.6365934Z cuda-nvvp-12.6.80 | 109.3 MB | #5 | 16%  2025-05-07T19:46:05.6366281Z 2025-05-07T19:46:05.6819821Z libcublas-12.6.4.1 | 256.2 MB | ######9 | 70%  2025-05-07T19:46:05.7275767Z nsight-compute-2024. | 443.1 MB | ####4 | 44% 2025-05-07T19:46:05.7276089Z 2025-05-07T19:46:05.7276095Z 2025-05-07T19:46:05.7276099Z 2025-05-07T19:46:05.7276104Z 2025-05-07T19:46:05.7276107Z 2025-05-07T19:46:05.7403778Z cuda-nvvp-12.6.80 | 109.3 MB | ##2 | 23%  2025-05-07T19:46:05.7404122Z 2025-05-07T19:46:05.7821596Z libcublas-12.6.4.1 | 256.2 MB | #######2 | 73%  2025-05-07T19:46:05.8274985Z nsight-compute-2024. | 443.1 MB | ####5 | 46% 2025-05-07T19:46:05.8275303Z 2025-05-07T19:46:05.8275308Z 2025-05-07T19:46:05.8275313Z 2025-05-07T19:46:05.8275318Z 2025-05-07T19:46:05.8275322Z 2025-05-07T19:46:05.8411234Z cuda-nvvp-12.6.80 | 109.3 MB | ##9 | 29%  2025-05-07T19:46:05.8411602Z 2025-05-07T19:46:05.8822779Z libcublas-12.6.4.1 | 256.2 MB | #######5 | 75%  2025-05-07T19:46:05.9276420Z nsight-compute-2024. | 443.1 MB | ####7 | 48% 2025-05-07T19:46:05.9276712Z 2025-05-07T19:46:05.9276717Z 2025-05-07T19:46:05.9276721Z 2025-05-07T19:46:05.9276724Z 2025-05-07T19:46:05.9276729Z 2025-05-07T19:46:05.9413663Z cuda-nvvp-12.6.80 | 109.3 MB | ###6 | 36%  2025-05-07T19:46:05.9414579Z 2025-05-07T19:46:05.9823290Z libcublas-12.6.4.1 | 256.2 MB | #######8 | 78%  2025-05-07T19:46:06.0276302Z nsight-compute-2024. | 443.1 MB | ####9 | 49% 2025-05-07T19:46:06.0276609Z 2025-05-07T19:46:06.0276614Z 2025-05-07T19:46:06.0276618Z 2025-05-07T19:46:06.0276621Z 2025-05-07T19:46:06.0276626Z 2025-05-07T19:46:06.0434635Z cuda-nvvp-12.6.80 | 109.3 MB | ####3 | 43%  2025-05-07T19:46:06.0435023Z 2025-05-07T19:46:06.0867801Z libcublas-12.6.4.1 | 256.2 MB | ######## | 81%  2025-05-07T19:46:06.1276516Z nsight-compute-2024. | 443.1 MB | #####1 | 51% 2025-05-07T19:46:06.1276829Z 2025-05-07T19:46:06.1276841Z 2025-05-07T19:46:06.1276845Z 2025-05-07T19:46:06.1276850Z 2025-05-07T19:46:06.1276853Z 2025-05-07T19:46:06.1437830Z cuda-nvvp-12.6.80 | 109.3 MB | ##### | 50%  2025-05-07T19:46:06.1438157Z 2025-05-07T19:46:06.1870699Z libcublas-12.6.4.1 | 256.2 MB | ########3 | 84%  2025-05-07T19:46:06.2278312Z nsight-compute-2024. | 443.1 MB | #####2 | 53% 2025-05-07T19:46:06.2278646Z 2025-05-07T19:46:06.2278652Z 2025-05-07T19:46:06.2278655Z 2025-05-07T19:46:06.2278659Z 2025-05-07T19:46:06.2279473Z 2025-05-07T19:46:06.2440112Z cuda-nvvp-12.6.80 | 109.3 MB | #####7 | 57%  2025-05-07T19:46:06.2440433Z 2025-05-07T19:46:06.2916145Z libcublas-12.6.4.1 | 256.2 MB | ########6 | 86%  2025-05-07T19:46:06.3285611Z nsight-compute-2024. | 443.1 MB | #####4 | 54% 2025-05-07T19:46:06.3286277Z 2025-05-07T19:46:06.3286286Z 2025-05-07T19:46:06.3286293Z 2025-05-07T19:46:06.3286301Z 2025-05-07T19:46:06.3286340Z 2025-05-07T19:46:06.3440341Z cuda-nvvp-12.6.80 | 109.3 MB | ######4 | 64%  2025-05-07T19:46:06.3440655Z 2025-05-07T19:46:06.3916563Z libcublas-12.6.4.1 | 256.2 MB | ########9 | 89%  2025-05-07T19:46:06.4331707Z nsight-compute-2024. | 443.1 MB | #####6 | 56% 2025-05-07T19:46:06.4332119Z 2025-05-07T19:46:06.4332181Z 2025-05-07T19:46:06.4332188Z 2025-05-07T19:46:06.4332216Z 2025-05-07T19:46:06.4332222Z 2025-05-07T19:46:06.4466396Z cuda-nvvp-12.6.80 | 109.3 MB | #######1 | 71%  2025-05-07T19:46:06.4466755Z 2025-05-07T19:46:06.5147958Z libcublas-12.6.4.1 | 256.2 MB | #########1 | 92%  2025-05-07T19:46:06.5331545Z nsight-compute-2024. | 443.1 MB | #####7 | 58% 2025-05-07T19:46:06.5332035Z 2025-05-07T19:46:06.5332062Z 2025-05-07T19:46:06.5332069Z 2025-05-07T19:46:06.5332100Z 2025-05-07T19:46:06.5332106Z 2025-05-07T19:46:06.5467234Z cuda-nvvp-12.6.80 | 109.3 MB | #######8 | 79%  2025-05-07T19:46:06.5467591Z 2025-05-07T19:46:06.6148139Z libcublas-12.6.4.1 | 256.2 MB | #########5 | 95%  2025-05-07T19:46:06.6334956Z nsight-compute-2024. | 443.1 MB | #####9 | 60% 2025-05-07T19:46:06.6335331Z 2025-05-07T19:46:06.6335412Z 2025-05-07T19:46:06.6335416Z 2025-05-07T19:46:06.6335421Z 2025-05-07T19:46:06.6335439Z 2025-05-07T19:46:06.6521307Z cuda-nvvp-12.6.80 | 109.3 MB | ########5 | 86%  2025-05-07T19:46:06.6521644Z 2025-05-07T19:46:06.7153811Z libcublas-12.6.4.1 | 256.2 MB | #########7 | 98%  2025-05-07T19:46:06.7340023Z nsight-compute-2024. | 443.1 MB | ######1 | 61% 2025-05-07T19:46:06.7340367Z 2025-05-07T19:46:06.7340373Z 2025-05-07T19:46:06.7340378Z 2025-05-07T19:46:06.7340382Z 2025-05-07T19:46:06.7340388Z 2025-05-07T19:46:06.8156016Z cuda-nvvp-12.6.80 | 109.3 MB | #########2 | 93%  2025-05-07T19:46:06.8920332Z nsight-compute-2024. | 443.1 MB | ######3 | 63% 2025-05-07T19:46:06.8920657Z 2025-05-07T19:46:06.8920663Z 2025-05-07T19:46:06.8920668Z 2025-05-07T19:46:06.8922631Z 2025-05-07T19:46:06.9435112Z cuda-nsight-12.6.77 | 113.2 MB | ########## | 100%  2025-05-07T19:46:06.9435463Z 2025-05-07T19:46:06.9435469Z 2025-05-07T19:46:06.9435473Z 2025-05-07T19:46:06.9435484Z 2025-05-07T19:46:06.9435488Z 2025-05-07T19:46:06.9435493Z 2025-05-07T19:46:06.9959891Z libcusolver-11.7.1.2 | 95.8 MB | | 0%  2025-05-07T19:46:07.0436326Z nsight-compute-2024. | 443.1 MB | ######5 | 65% 2025-05-07T19:46:07.0436640Z 2025-05-07T19:46:07.0436649Z 2025-05-07T19:46:07.0436677Z 2025-05-07T19:46:07.0436682Z 2025-05-07T19:46:07.0436687Z 2025-05-07T19:46:07.0436690Z 2025-05-07T19:46:07.0958710Z libcusolver-11.7.1.2 | 95.8 MB | 9 | 10%  2025-05-07T19:46:07.1435957Z nsight-compute-2024. | 443.1 MB | ######7 | 67% 2025-05-07T19:46:07.1436280Z 2025-05-07T19:46:07.1436287Z 2025-05-07T19:46:07.1436292Z 2025-05-07T19:46:07.1436295Z 2025-05-07T19:46:07.1436303Z 2025-05-07T19:46:07.1436307Z 2025-05-07T19:46:07.1959861Z libcusolver-11.7.1.2 | 95.8 MB | #8 | 18%  2025-05-07T19:46:07.2436178Z nsight-compute-2024. | 443.1 MB | ######8 | 69% 2025-05-07T19:46:07.2436520Z 2025-05-07T19:46:07.2436525Z 2025-05-07T19:46:07.2436529Z 2025-05-07T19:46:07.2436533Z 2025-05-07T19:46:07.2436536Z 2025-05-07T19:46:07.2436541Z 2025-05-07T19:46:07.3065964Z libcusolver-11.7.1.2 | 95.8 MB | ##7 | 27%  2025-05-07T19:46:07.3579062Z nsight-compute-2024. | 443.1 MB | ####### | 71% 2025-05-07T19:46:07.3579377Z 2025-05-07T19:46:07.3579383Z 2025-05-07T19:46:07.3579387Z 2025-05-07T19:46:07.3579391Z 2025-05-07T19:46:07.3579396Z 2025-05-07T19:46:07.3579401Z 2025-05-07T19:46:07.4300510Z libcusolver-11.7.1.2 | 95.8 MB | ###4 | 35%  2025-05-07T19:46:07.4579770Z nsight-compute-2024. | 443.1 MB | #######2 | 72% 2025-05-07T19:46:07.4580056Z 2025-05-07T19:46:07.4580063Z 2025-05-07T19:46:07.4580066Z 2025-05-07T19:46:07.4580070Z 2025-05-07T19:46:07.4580074Z 2025-05-07T19:46:07.4580078Z 2025-05-07T19:46:07.4602101Z libcusolver-11.7.1.2 | 95.8 MB | ####3 | 44%  2025-05-07T19:46:07.4602442Z 2025-05-07T19:46:07.4604491Z 2025-05-07T19:46:07.5315390Z libcufft-11.3.0.4 | 156.2 MB | ########## | 100%  2025-05-07T19:46:07.5315730Z 2025-05-07T19:46:07.5315761Z 2025-05-07T19:46:07.5315786Z 2025-05-07T19:46:07.5315791Z 2025-05-07T19:46:07.5315796Z 2025-05-07T19:46:07.5315801Z 2025-05-07T19:46:07.5315806Z 2025-05-07T19:46:07.5582994Z libnpp-12.3.1.54 | 93.4 MB | | 0%  2025-05-07T19:46:07.5583545Z nsight-compute-2024. | 443.1 MB | #######3 | 74% 2025-05-07T19:46:07.5583806Z 2025-05-07T19:46:07.5583811Z 2025-05-07T19:46:07.5583817Z 2025-05-07T19:46:07.5583843Z 2025-05-07T19:46:07.5583847Z 2025-05-07T19:46:07.5583850Z 2025-05-07T19:46:07.6316168Z libcusolver-11.7.1.2 | 95.8 MB | #####2 | 53%  2025-05-07T19:46:07.6316529Z 2025-05-07T19:46:07.6316534Z 2025-05-07T19:46:07.6316537Z 2025-05-07T19:46:07.6316541Z 2025-05-07T19:46:07.6316546Z 2025-05-07T19:46:07.6316551Z 2025-05-07T19:46:07.6316556Z 2025-05-07T19:46:07.6761295Z libnpp-12.3.1.54 | 93.4 MB | 6 | 7%  2025-05-07T19:46:07.6862620Z nsight-compute-2024. | 443.1 MB | #######5 | 75% 2025-05-07T19:46:07.6862966Z 2025-05-07T19:46:07.6862971Z 2025-05-07T19:46:07.6862975Z 2025-05-07T19:46:07.6862979Z 2025-05-07T19:46:07.6862983Z 2025-05-07T19:46:07.6863722Z 2025-05-07T19:46:07.7316078Z libcusolver-11.7.1.2 | 95.8 MB | ######1 | 61%  2025-05-07T19:46:07.7316430Z 2025-05-07T19:46:07.7316435Z 2025-05-07T19:46:07.7316438Z 2025-05-07T19:46:07.7316443Z 2025-05-07T19:46:07.7316446Z 2025-05-07T19:46:07.7316471Z 2025-05-07T19:46:07.7316475Z 2025-05-07T19:46:07.7763167Z libnpp-12.3.1.54 | 93.4 MB | #3 | 14%  2025-05-07T19:46:07.8155857Z nsight-compute-2024. | 443.1 MB | #######6 | 77% 2025-05-07T19:46:07.8156209Z 2025-05-07T19:46:07.8156214Z 2025-05-07T19:46:07.8156218Z 2025-05-07T19:46:07.8156221Z 2025-05-07T19:46:07.8156225Z 2025-05-07T19:46:07.8156229Z 2025-05-07T19:46:07.8318437Z libcusolver-11.7.1.2 | 95.8 MB | ######8 | 69%  2025-05-07T19:46:07.8319401Z 2025-05-07T19:46:07.8319415Z 2025-05-07T19:46:07.8319427Z 2025-05-07T19:46:07.8319471Z 2025-05-07T19:46:07.8319483Z 2025-05-07T19:46:07.8319494Z 2025-05-07T19:46:07.8319504Z 2025-05-07T19:46:07.8763095Z libnpp-12.3.1.54 | 93.4 MB | ## | 21%  2025-05-07T19:46:07.9316607Z nsight-compute-2024. | 443.1 MB | #######8 | 78% 2025-05-07T19:46:07.9316912Z 2025-05-07T19:46:07.9316918Z 2025-05-07T19:46:07.9316922Z 2025-05-07T19:46:07.9316926Z 2025-05-07T19:46:07.9316948Z 2025-05-07T19:46:07.9316951Z 2025-05-07T19:46:07.9316955Z 2025-05-07T19:46:07.9763465Z libnpp-12.3.1.54 | 93.4 MB | ##6 | 27%  2025-05-07T19:46:08.0071954Z nsight-compute-2024. | 443.1 MB | ######## | 81% 2025-05-07T19:46:08.0072382Z 2025-05-07T19:46:08.0072457Z 2025-05-07T19:46:08.0072461Z 2025-05-07T19:46:08.0072465Z 2025-05-07T19:46:08.0072468Z 2025-05-07T19:46:08.0072472Z 2025-05-07T19:46:08.0317844Z libcusolver-11.7.1.2 | 95.8 MB | #######5 | 76%  2025-05-07T19:46:08.0318207Z 2025-05-07T19:46:08.0318432Z 2025-05-07T19:46:08.0318437Z 2025-05-07T19:46:08.0318441Z 2025-05-07T19:46:08.0318445Z 2025-05-07T19:46:08.0318449Z 2025-05-07T19:46:08.0320701Z 2025-05-07T19:46:08.0330090Z libnpp-12.3.1.54 | 93.4 MB | ###3 | 34%  2025-05-07T19:46:08.0330406Z 2025-05-07T19:46:08.0330411Z 2025-05-07T19:46:08.0330415Z 2025-05-07T19:46:08.0330425Z 2025-05-07T19:46:08.0926090Z cuda-nsight-12.6.77 | 113.2 MB | ########## | 100%  2025-05-07T19:46:08.1073219Z nsight-compute-2024. | 443.1 MB | ########2 | 82% 2025-05-07T19:46:08.1073615Z 2025-05-07T19:46:08.1073726Z 2025-05-07T19:46:08.1320908Z 2025-05-07T19:46:08.1320928Z 2025-05-07T19:46:08.1320944Z 2025-05-07T19:46:08.1320956Z 2025-05-07T19:46:08.1321735Z libcusolver-11.7.1.2 | 95.8 MB | ########2 | 82%  2025-05-07T19:46:08.1322067Z 2025-05-07T19:46:08.1322070Z 2025-05-07T19:46:08.1322074Z 2025-05-07T19:46:08.1322102Z 2025-05-07T19:46:08.1322106Z 2025-05-07T19:46:08.1322129Z 2025-05-07T19:46:08.1322133Z 2025-05-07T19:46:08.1361606Z libnpp-12.3.1.54 | 93.4 MB | ####1 | 42%  2025-05-07T19:46:08.1361923Z 2025-05-07T19:46:08.1361928Z 2025-05-07T19:46:08.1361932Z 2025-05-07T19:46:08.1361935Z 2025-05-07T19:46:08.1361939Z 2025-05-07T19:46:08.1362226Z cuda-nvvp-12.6.80 | 109.3 MB | ########## | 100%  2025-05-07T19:46:08.1362516Z 2025-05-07T19:46:08.1362534Z 2025-05-07T19:46:08.1362538Z 2025-05-07T19:46:08.1362541Z 2025-05-07T19:46:08.1362545Z 2025-05-07T19:46:08.1835866Z cuda-nvvp-12.6.80 | 109.3 MB | ########## | 100%  2025-05-07T19:46:08.1836193Z 2025-05-07T19:46:08.1836199Z 2025-05-07T19:46:08.1836202Z 2025-05-07T19:46:08.1836206Z 2025-05-07T19:46:08.1836209Z 2025-05-07T19:46:08.1836213Z 2025-05-07T19:46:08.1836216Z 2025-05-07T19:46:08.1836220Z 2025-05-07T19:46:08.2156764Z cuda-nvdisasm-12.6.7 | 47.6 MB | | 0%  2025-05-07T19:46:08.2267348Z nsight-compute-2024. | 443.1 MB | ########3 | 84% 2025-05-07T19:46:08.2267701Z 2025-05-07T19:46:08.2267707Z 2025-05-07T19:46:08.2267711Z 2025-05-07T19:46:08.2267716Z 2025-05-07T19:46:08.2267720Z 2025-05-07T19:46:08.2267725Z 2025-05-07T19:46:08.2402105Z libcusolver-11.7.1.2 | 95.8 MB | ########8 | 88%  2025-05-07T19:46:08.2402485Z 2025-05-07T19:46:08.2402515Z 2025-05-07T19:46:08.2402520Z 2025-05-07T19:46:08.2402543Z 2025-05-07T19:46:08.2402548Z 2025-05-07T19:46:08.2402552Z 2025-05-07T19:46:08.2402556Z 2025-05-07T19:46:08.2835863Z libnpp-12.3.1.54 | 93.4 MB | ####8 | 49%  2025-05-07T19:46:08.2836187Z 2025-05-07T19:46:08.2836192Z 2025-05-07T19:46:08.2836196Z 2025-05-07T19:46:08.2836224Z 2025-05-07T19:46:08.2836229Z 2025-05-07T19:46:08.2836233Z 2025-05-07T19:46:08.2836237Z 2025-05-07T19:46:08.2836242Z 2025-05-07T19:46:08.3356907Z cuda-nvdisasm-12.6.7 | 47.6 MB | #1 | 11%  2025-05-07T19:46:08.3442086Z nsight-compute-2024. | 443.1 MB | ########5 | 85% 2025-05-07T19:46:08.3442375Z 2025-05-07T19:46:08.3442380Z 2025-05-07T19:46:08.3442384Z 2025-05-07T19:46:08.3442388Z 2025-05-07T19:46:08.3442391Z 2025-05-07T19:46:08.3442400Z 2025-05-07T19:46:08.3594064Z libcusolver-11.7.1.2 | 95.8 MB | #########4 | 94%  2025-05-07T19:46:08.3594517Z 2025-05-07T19:46:08.3594681Z 2025-05-07T19:46:08.3594686Z 2025-05-07T19:46:08.3594719Z 2025-05-07T19:46:08.3594733Z 2025-05-07T19:46:08.3594761Z 2025-05-07T19:46:08.3594852Z 2025-05-07T19:46:08.3836827Z libnpp-12.3.1.54 | 93.4 MB | #####5 | 56%  2025-05-07T19:46:08.3837156Z 2025-05-07T19:46:08.3837160Z 2025-05-07T19:46:08.3837164Z 2025-05-07T19:46:08.3837168Z 2025-05-07T19:46:08.3837172Z 2025-05-07T19:46:08.3837175Z 2025-05-07T19:46:08.3837179Z 2025-05-07T19:46:08.3837763Z 2025-05-07T19:46:08.4259031Z cuda-nvdisasm-12.6.7 | 47.6 MB | ##2 | 23%  2025-05-07T19:46:08.4259387Z 2025-05-07T19:46:08.4259601Z 2025-05-07T19:46:08.4259607Z 2025-05-07T19:46:08.4604408Z libcusparse-12.5.4.2 | 118.6 MB | ########## | 100%  2025-05-07T19:46:08.4604916Z nsight-compute-2024. | 443.1 MB | ########6 | 87% 2025-05-07T19:46:08.4605206Z 2025-05-07T19:46:08.4605211Z 2025-05-07T19:46:08.4605215Z 2025-05-07T19:46:08.4605219Z 2025-05-07T19:46:08.4605223Z 2025-05-07T19:46:08.4606365Z 2025-05-07T19:46:08.4738789Z libcusolver-11.7.1.2 | 95.8 MB | #########9 | 100%  2025-05-07T19:46:08.4739232Z 2025-05-07T19:46:08.4739430Z 2025-05-07T19:46:08.4739440Z 2025-05-07T19:46:08.4739446Z 2025-05-07T19:46:08.4739452Z 2025-05-07T19:46:08.4739457Z 2025-05-07T19:46:08.4739463Z 2025-05-07T19:46:08.4837633Z libnpp-12.3.1.54 | 93.4 MB | ######2 | 62%  2025-05-07T19:46:08.4837973Z 2025-05-07T19:46:08.4837982Z 2025-05-07T19:46:08.4837989Z 2025-05-07T19:46:08.4837998Z 2025-05-07T19:46:08.4838004Z 2025-05-07T19:46:08.4838008Z 2025-05-07T19:46:08.4838055Z 2025-05-07T19:46:08.4838076Z 2025-05-07T19:46:08.5733654Z cuda-nvdisasm-12.6.7 | 47.6 MB | ###5 | 36%  2025-05-07T19:46:08.5835792Z nsight-compute-2024. | 443.1 MB | ########8 | 88% 2025-05-07T19:46:08.5836123Z 2025-05-07T19:46:08.5836128Z 2025-05-07T19:46:08.5836133Z 2025-05-07T19:46:08.5836137Z 2025-05-07T19:46:08.5836143Z 2025-05-07T19:46:08.5836147Z 2025-05-07T19:46:08.5836186Z 2025-05-07T19:46:08.5843582Z libnpp-12.3.1.54 | 93.4 MB | ######8 | 68%  2025-05-07T19:46:08.5843886Z 2025-05-07T19:46:08.5843901Z 2025-05-07T19:46:08.5843905Z 2025-05-07T19:46:08.5843909Z 2025-05-07T19:46:08.5843912Z 2025-05-07T19:46:08.5843916Z 2025-05-07T19:46:08.5843919Z 2025-05-07T19:46:08.5844719Z 2025-05-07T19:46:08.6750870Z cuda-nvdisasm-12.6.7 | 47.6 MB | #####4 | 54%  2025-05-07T19:46:08.6836091Z nsight-compute-2024. | 443.1 MB | ########9 | 90% 2025-05-07T19:46:08.6836400Z 2025-05-07T19:46:08.6836427Z 2025-05-07T19:46:08.6836434Z 2025-05-07T19:46:08.6836438Z 2025-05-07T19:46:08.6836441Z 2025-05-07T19:46:08.6836445Z 2025-05-07T19:46:08.6836448Z 2025-05-07T19:46:08.6846955Z libnpp-12.3.1.54 | 93.4 MB | #######6 | 77%  2025-05-07T19:46:08.6847817Z 2025-05-07T19:46:08.6847829Z 2025-05-07T19:46:08.6847841Z 2025-05-07T19:46:08.6847852Z 2025-05-07T19:46:08.6847862Z 2025-05-07T19:46:08.6847903Z 2025-05-07T19:46:08.6847915Z 2025-05-07T19:46:08.6847939Z 2025-05-07T19:46:08.7838270Z cuda-nvdisasm-12.6.7 | 47.6 MB | ####### | 70%  2025-05-07T19:46:08.7838649Z 2025-05-07T19:46:08.7838654Z 2025-05-07T19:46:08.7838659Z 2025-05-07T19:46:08.7838663Z 2025-05-07T19:46:08.7838666Z 2025-05-07T19:46:08.7838670Z 2025-05-07T19:46:08.7838673Z 2025-05-07T19:46:08.7849164Z libnpp-12.3.1.54 | 93.4 MB | ########6 | 86%  2025-05-07T19:46:08.7849515Z 2025-05-07T19:46:08.7849531Z 2025-05-07T19:46:08.7849535Z 2025-05-07T19:46:08.7849562Z 2025-05-07T19:46:08.7849566Z 2025-05-07T19:46:08.7849569Z 2025-05-07T19:46:08.7849573Z 2025-05-07T19:46:08.7850034Z 2025-05-07T19:46:08.7858476Z cuda-nvdisasm-12.6.7 | 47.6 MB | ########8 | 88%  2025-05-07T19:46:08.8839478Z nsight-compute-2024. | 443.1 MB | ######### | 91% 2025-05-07T19:46:08.8839820Z 2025-05-07T19:46:08.8839827Z 2025-05-07T19:46:08.8839833Z 2025-05-07T19:46:08.8839879Z 2025-05-07T19:46:08.8839898Z 2025-05-07T19:46:08.8839901Z 2025-05-07T19:46:08.8839905Z 2025-05-07T19:46:08.8865155Z libnpp-12.3.1.54 | 93.4 MB | #########5 | 96%  2025-05-07T19:46:08.9866032Z nsight-compute-2024. | 443.1 MB | #########2 | 93% 2025-05-07T19:46:09.0892986Z nsight-compute-2024. | 443.1 MB | #########4 | 95% 2025-05-07T19:46:09.2144188Z nsight-compute-2024. | 443.1 MB | #########7 | 97% 2025-05-07T19:46:09.3524682Z nsight-compute-2024. | 443.1 MB | #########9 | 99% 2025-05-07T19:46:09.3525027Z 2025-05-07T19:46:09.3525284Z 2025-05-07T19:46:09.3525292Z 2025-05-07T19:46:09.3525297Z 2025-05-07T19:46:09.3525302Z 2025-05-07T19:46:09.3525307Z 2025-05-07T19:46:09.3525311Z 2025-05-07T19:46:09.3525319Z 2025-05-07T19:46:09.3941997Z cuda-nvdisasm-12.6.7 | 47.6 MB | ########## | 100%  2025-05-07T19:46:09.3942357Z 2025-05-07T19:46:09.3942363Z 2025-05-07T19:46:09.3942368Z 2025-05-07T19:46:09.3942373Z 2025-05-07T19:46:09.3942569Z 2025-05-07T19:46:09.3942575Z 2025-05-07T19:46:09.3942580Z 2025-05-07T19:46:09.3942584Z 2025-05-07T19:46:09.3942589Z 2025-05-07T19:46:09.4943579Z libcurand-10.3.7.77 | 39.9 MB | | 0%  2025-05-07T19:46:09.4943956Z 2025-05-07T19:46:09.4943961Z 2025-05-07T19:46:09.4943965Z 2025-05-07T19:46:09.4943968Z 2025-05-07T19:46:09.4943972Z 2025-05-07T19:46:09.4943975Z 2025-05-07T19:46:09.4943979Z 2025-05-07T19:46:09.4943983Z 2025-05-07T19:46:09.4943987Z 2025-05-07T19:46:09.5918864Z libcurand-10.3.7.77 | 39.9 MB | ##3 | 23%  2025-05-07T19:46:09.5919214Z 2025-05-07T19:46:09.5919220Z 2025-05-07T19:46:09.5919225Z 2025-05-07T19:46:09.5919229Z 2025-05-07T19:46:09.5919232Z 2025-05-07T19:46:09.5919237Z 2025-05-07T19:46:09.5945676Z libcusolver-11.7.1.2 | 95.8 MB | ########## | 100%  2025-05-07T19:46:09.5946289Z 2025-05-07T19:46:09.5946321Z 2025-05-07T19:46:09.5946327Z 2025-05-07T19:46:09.5946353Z 2025-05-07T19:46:09.5946358Z 2025-05-07T19:46:09.5946362Z 2025-05-07T19:46:09.5946367Z 2025-05-07T19:46:09.5946372Z 2025-05-07T19:46:09.5946376Z 2025-05-07T19:46:09.6443205Z libcurand-10.3.7.77 | 39.9 MB | ####8 | 49%  2025-05-07T19:46:09.6443549Z 2025-05-07T19:46:09.6443554Z 2025-05-07T19:46:09.6443558Z 2025-05-07T19:46:09.6443562Z 2025-05-07T19:46:09.6443565Z 2025-05-07T19:46:09.6443569Z 2025-05-07T19:46:09.6443572Z 2025-05-07T19:46:09.6443576Z 2025-05-07T19:46:09.6443580Z 2025-05-07T19:46:09.6443888Z 2025-05-07T19:46:09.6948005Z gds-tools-1.11.1.6 | 37.8 MB | | 0%  2025-05-07T19:46:09.6948965Z 2025-05-07T19:46:09.6948981Z 2025-05-07T19:46:09.6948992Z 2025-05-07T19:46:09.6949002Z 2025-05-07T19:46:09.6949012Z 2025-05-07T19:46:09.6949023Z 2025-05-07T19:46:09.6949033Z 2025-05-07T19:46:09.6949044Z 2025-05-07T19:46:09.6949080Z 2025-05-07T19:46:09.7446187Z libcurand-10.3.7.77 | 39.9 MB | #######1 | 71%  2025-05-07T19:46:09.7446596Z 2025-05-07T19:46:09.7446602Z 2025-05-07T19:46:09.7446608Z 2025-05-07T19:46:09.7446613Z 2025-05-07T19:46:09.7446617Z 2025-05-07T19:46:09.7446621Z 2025-05-07T19:46:09.7446626Z 2025-05-07T19:46:09.7446658Z 2025-05-07T19:46:09.7446661Z 2025-05-07T19:46:09.7446665Z 2025-05-07T19:46:09.8036908Z gds-tools-1.11.1.6 | 37.8 MB | #8 | 19%  2025-05-07T19:46:09.8037257Z 2025-05-07T19:46:09.8037263Z 2025-05-07T19:46:09.8037269Z 2025-05-07T19:46:09.8037272Z 2025-05-07T19:46:09.8037323Z 2025-05-07T19:46:09.8037327Z 2025-05-07T19:46:09.8037331Z 2025-05-07T19:46:09.8037334Z 2025-05-07T19:46:09.8037338Z 2025-05-07T19:46:09.8446367Z libcurand-10.3.7.77 | 39.9 MB | #########1 | 92%  2025-05-07T19:46:09.8446752Z 2025-05-07T19:46:09.8446758Z 2025-05-07T19:46:09.8446787Z 2025-05-07T19:46:09.8446792Z 2025-05-07T19:46:09.8446797Z 2025-05-07T19:46:09.8446802Z 2025-05-07T19:46:09.8446825Z 2025-05-07T19:46:09.8446831Z 2025-05-07T19:46:09.8446837Z 2025-05-07T19:46:09.8446842Z 2025-05-07T19:46:09.8732254Z gds-tools-1.11.1.6 | 37.8 MB | ###5 | 35%  2025-05-07T19:46:09.8732717Z 2025-05-07T19:46:09.8732808Z 2025-05-07T19:46:09.8732813Z 2025-05-07T19:46:09.8732907Z 2025-05-07T19:46:09.8732915Z 2025-05-07T19:46:09.8732921Z 2025-05-07T19:46:09.8732927Z 2025-05-07T19:46:09.8732932Z 2025-05-07T19:46:09.9503383Z cuda-nvdisasm-12.6.7 | 47.6 MB | ########## | 100%  2025-05-07T19:46:09.9504113Z 2025-05-07T19:46:09.9504124Z 2025-05-07T19:46:09.9504130Z 2025-05-07T19:46:09.9504133Z 2025-05-07T19:46:09.9504138Z 2025-05-07T19:46:09.9504143Z 2025-05-07T19:46:09.9504150Z 2025-05-07T19:46:09.9504155Z 2025-05-07T19:46:09.9504160Z 2025-05-07T19:46:09.9504165Z 2025-05-07T19:46:10.0237289Z gds-tools-1.11.1.6 | 37.8 MB | ####9 | 50%  2025-05-07T19:46:10.0237654Z 2025-05-07T19:46:10.0237987Z 2025-05-07T19:46:10.0237992Z 2025-05-07T19:46:10.0237997Z 2025-05-07T19:46:10.0238000Z 2025-05-07T19:46:10.0238005Z 2025-05-07T19:46:10.0238010Z 2025-05-07T19:46:10.0758370Z libnpp-12.3.1.54 | 93.4 MB | ########## | 100%  2025-05-07T19:46:10.0758705Z 2025-05-07T19:46:10.0758709Z 2025-05-07T19:46:10.0758737Z 2025-05-07T19:46:10.0758741Z 2025-05-07T19:46:10.0758745Z 2025-05-07T19:46:10.0758749Z 2025-05-07T19:46:10.0758753Z 2025-05-07T19:46:10.0758759Z 2025-05-07T19:46:10.0758762Z 2025-05-07T19:46:10.0758766Z 2025-05-07T19:46:10.0759598Z 2025-05-07T19:46:10.1019119Z cuda-nvcc-tools-12.6 | 23.0 MB | | 0%  2025-05-07T19:46:10.1019476Z 2025-05-07T19:46:10.1019539Z 2025-05-07T19:46:10.1019543Z 2025-05-07T19:46:10.1019548Z 2025-05-07T19:46:10.1019553Z 2025-05-07T19:46:10.1019571Z 2025-05-07T19:46:10.1019575Z 2025-05-07T19:46:10.1019580Z 2025-05-07T19:46:10.1019585Z 2025-05-07T19:46:10.1019589Z 2025-05-07T19:46:10.1021263Z gds-tools-1.11.1.6 | 37.8 MB | ######3 | 64%  2025-05-07T19:46:10.1022085Z 2025-05-07T19:46:10.1474663Z libcublas-12.6.4.1 | 256.2 MB | ########## | 100%  2025-05-07T19:46:10.1475009Z 2025-05-07T19:46:10.1475347Z 2025-05-07T19:46:10.1475364Z 2025-05-07T19:46:10.1475370Z 2025-05-07T19:46:10.1475377Z 2025-05-07T19:46:10.1475382Z 2025-05-07T19:46:10.1475387Z 2025-05-07T19:46:10.1475393Z 2025-05-07T19:46:10.1475439Z 2025-05-07T19:46:10.1475445Z 2025-05-07T19:46:10.1475449Z 2025-05-07T19:46:10.1475454Z 2025-05-07T19:46:10.1760512Z cuda-nvrtc-12.6.85 | 17.3 MB | | 0%  2025-05-07T19:46:10.1761524Z 2025-05-07T19:46:10.1761538Z 2025-05-07T19:46:10.1761550Z 2025-05-07T19:46:10.1761561Z 2025-05-07T19:46:10.1761589Z 2025-05-07T19:46:10.1761600Z 2025-05-07T19:46:10.1761610Z 2025-05-07T19:46:10.1761621Z 2025-05-07T19:46:10.1761631Z 2025-05-07T19:46:10.1761642Z 2025-05-07T19:46:10.1761676Z 2025-05-07T19:46:10.2457133Z cuda-nvcc-tools-12.6 | 23.0 MB | ###8 | 39%  2025-05-07T19:46:10.2457552Z 2025-05-07T19:46:10.2457558Z 2025-05-07T19:46:10.2457562Z 2025-05-07T19:46:10.2457569Z 2025-05-07T19:46:10.2457573Z 2025-05-07T19:46:10.2457577Z 2025-05-07T19:46:10.2457582Z 2025-05-07T19:46:10.2457586Z 2025-05-07T19:46:10.2457590Z 2025-05-07T19:46:10.2457596Z 2025-05-07T19:46:10.2476031Z gds-tools-1.11.1.6 | 37.8 MB | #######5 | 76%  2025-05-07T19:46:10.2476367Z 2025-05-07T19:46:10.2476373Z 2025-05-07T19:46:10.2476410Z 2025-05-07T19:46:10.2476414Z 2025-05-07T19:46:10.2476418Z 2025-05-07T19:46:10.2476422Z 2025-05-07T19:46:10.2476426Z 2025-05-07T19:46:10.2476430Z 2025-05-07T19:46:10.2476434Z 2025-05-07T19:46:10.2476439Z 2025-05-07T19:46:10.2476443Z 2025-05-07T19:46:10.2476447Z 2025-05-07T19:46:10.2760157Z cuda-nvrtc-12.6.85 | 17.3 MB | ###1 | 32%  2025-05-07T19:46:10.2760549Z 2025-05-07T19:46:10.2760554Z 2025-05-07T19:46:10.2760558Z 2025-05-07T19:46:10.2760562Z 2025-05-07T19:46:10.2760566Z 2025-05-07T19:46:10.2760570Z 2025-05-07T19:46:10.2760573Z 2025-05-07T19:46:10.2760577Z 2025-05-07T19:46:10.2760581Z 2025-05-07T19:46:10.2760585Z 2025-05-07T19:46:10.2760588Z 2025-05-07T19:46:10.3047269Z cuda-nvcc-tools-12.6 | 23.0 MB | ######9 | 70%  2025-05-07T19:46:10.3048149Z 2025-05-07T19:46:10.3048283Z 2025-05-07T19:46:10.3461493Z libcufft-11.3.0.4 | 156.2 MB | ########## | 100%  2025-05-07T19:46:10.3462075Z 2025-05-07T19:46:10.3462098Z 2025-05-07T19:46:10.3462103Z 2025-05-07T19:46:10.3462107Z 2025-05-07T19:46:10.3462112Z 2025-05-07T19:46:10.3462117Z 2025-05-07T19:46:10.3462121Z 2025-05-07T19:46:10.3462126Z 2025-05-07T19:46:10.3462131Z 2025-05-07T19:46:10.3462135Z 2025-05-07T19:46:10.3475916Z gds-tools-1.11.1.6 | 37.8 MB | ########7 | 87%  2025-05-07T19:46:10.3476261Z 2025-05-07T19:46:10.3476431Z 2025-05-07T19:46:10.3476436Z 2025-05-07T19:46:10.3476440Z 2025-05-07T19:46:10.3476443Z 2025-05-07T19:46:10.3476447Z 2025-05-07T19:46:10.3476452Z 2025-05-07T19:46:10.3476457Z 2025-05-07T19:46:10.3476460Z 2025-05-07T19:46:10.3476464Z 2025-05-07T19:46:10.3476467Z 2025-05-07T19:46:10.3476471Z 2025-05-07T19:46:10.3660265Z cuda-nvrtc-12.6.85 | 17.3 MB | ######3 | 63%  2025-05-07T19:46:10.3660648Z 2025-05-07T19:46:10.3660653Z 2025-05-07T19:46:10.3660659Z 2025-05-07T19:46:10.3660663Z 2025-05-07T19:46:10.3660708Z 2025-05-07T19:46:10.3660713Z 2025-05-07T19:46:10.3660718Z 2025-05-07T19:46:10.3660723Z 2025-05-07T19:46:10.3660728Z 2025-05-07T19:46:10.3773020Z libcurand-10.3.7.77 | 39.9 MB | ########## | 100%  2025-05-07T19:46:10.3773574Z 2025-05-07T19:46:10.3775811Z 2025-05-07T19:46:10.3775823Z 2025-05-07T19:46:10.3775830Z 2025-05-07T19:46:10.3775849Z 2025-05-07T19:46:10.3775854Z 2025-05-07T19:46:10.3775891Z 2025-05-07T19:46:10.3775894Z 2025-05-07T19:46:10.3775898Z 2025-05-07T19:46:10.3775901Z 2025-05-07T19:46:10.3775905Z 2025-05-07T19:46:10.4027462Z cuda-nvcc-tools-12.6 | 23.0 MB | #########6 | 96%  2025-05-07T19:46:10.4027835Z 2025-05-07T19:46:10.4027842Z 2025-05-07T19:46:10.4027847Z 2025-05-07T19:46:10.4027853Z 2025-05-07T19:46:10.4027858Z 2025-05-07T19:46:10.4027862Z 2025-05-07T19:46:10.4027867Z 2025-05-07T19:46:10.4027894Z 2025-05-07T19:46:10.4027899Z 2025-05-07T19:46:10.4027904Z 2025-05-07T19:46:10.4027908Z 2025-05-07T19:46:10.4027932Z 2025-05-07T19:46:10.4027937Z 2025-05-07T19:46:10.4175211Z libnvjitlink-12.6.85 | 14.9 MB | | 0%  2025-05-07T19:46:10.4175574Z 2025-05-07T19:46:10.4175579Z 2025-05-07T19:46:10.4175610Z 2025-05-07T19:46:10.4175617Z 2025-05-07T19:46:10.4175621Z 2025-05-07T19:46:10.4548496Z cuda-nvvp-12.6.80 | 109.3 MB | ########## | 100%  2025-05-07T19:46:10.4548878Z 2025-05-07T19:46:10.4548883Z 2025-05-07T19:46:10.4548887Z 2025-05-07T19:46:10.4548891Z 2025-05-07T19:46:10.4548896Z 2025-05-07T19:46:10.4548899Z 2025-05-07T19:46:10.4548903Z 2025-05-07T19:46:10.4548906Z 2025-05-07T19:46:10.4548910Z 2025-05-07T19:46:10.4548914Z 2025-05-07T19:46:10.5038111Z gds-tools-1.11.1.6 | 37.8 MB | #########8 | 98%  2025-05-07T19:46:10.5038499Z 2025-05-07T19:46:10.5038506Z 2025-05-07T19:46:10.5038512Z 2025-05-07T19:46:10.5038516Z 2025-05-07T19:46:10.5038521Z 2025-05-07T19:46:10.5038525Z 2025-05-07T19:46:10.5038586Z 2025-05-07T19:46:10.5038592Z 2025-05-07T19:46:10.5038597Z 2025-05-07T19:46:10.5038602Z 2025-05-07T19:46:10.5038607Z 2025-05-07T19:46:10.5038612Z 2025-05-07T19:46:10.5038617Z 2025-05-07T19:46:10.6276757Z libnvjitlink-12.6.85 | 14.9 MB | #####7 | 57%  2025-05-07T19:46:10.6277142Z 2025-05-07T19:46:10.6277149Z 2025-05-07T19:46:10.6277154Z 2025-05-07T19:46:10.6277251Z 2025-05-07T19:46:10.6277256Z 2025-05-07T19:46:10.6277260Z 2025-05-07T19:46:10.6277263Z 2025-05-07T19:46:10.6277267Z 2025-05-07T19:46:10.6277272Z 2025-05-07T19:46:10.6277277Z 2025-05-07T19:46:10.6277285Z 2025-05-07T19:46:10.6277290Z 2025-05-07T19:46:10.6277606Z cuda-nvrtc-12.6.85 | 17.3 MB | ########## | 100%  2025-05-07T19:46:10.6277935Z 2025-05-07T19:46:10.6277939Z 2025-05-07T19:46:10.6277943Z 2025-05-07T19:46:10.6277947Z 2025-05-07T19:46:10.6277951Z 2025-05-07T19:46:10.6277955Z 2025-05-07T19:46:10.6277961Z 2025-05-07T19:46:10.6278215Z 2025-05-07T19:46:10.6278220Z 2025-05-07T19:46:10.6278224Z 2025-05-07T19:46:10.6278227Z 2025-05-07T19:46:10.6278231Z 2025-05-07T19:46:10.6739122Z cuda-nvrtc-12.6.85 | 17.3 MB | ########## | 100%  2025-05-07T19:46:10.6739522Z 2025-05-07T19:46:10.6739528Z 2025-05-07T19:46:10.6739533Z 2025-05-07T19:46:10.6739537Z 2025-05-07T19:46:10.6739542Z 2025-05-07T19:46:10.6739819Z 2025-05-07T19:46:10.6739824Z 2025-05-07T19:46:10.6739828Z 2025-05-07T19:46:10.6739833Z 2025-05-07T19:46:10.6739838Z 2025-05-07T19:46:10.6739844Z 2025-05-07T19:46:10.6760387Z cuda-nvcc-tools-12.6 | 23.0 MB | ########## | 100%  2025-05-07T19:46:10.6760761Z 2025-05-07T19:46:10.6760766Z 2025-05-07T19:46:10.6760770Z 2025-05-07T19:46:10.6760774Z 2025-05-07T19:46:10.6760778Z 2025-05-07T19:46:10.6760782Z 2025-05-07T19:46:10.6760786Z 2025-05-07T19:46:10.6760790Z 2025-05-07T19:46:10.6760795Z 2025-05-07T19:46:10.6760798Z 2025-05-07T19:46:10.6760818Z 2025-05-07T19:46:10.6760822Z 2025-05-07T19:46:10.6760825Z 2025-05-07T19:46:10.6761267Z 2025-05-07T19:46:10.7215918Z cuda-nvcc-dev_linux- | 10.8 MB | | 0%  2025-05-07T19:46:10.7216313Z 2025-05-07T19:46:10.7216318Z 2025-05-07T19:46:10.7216322Z 2025-05-07T19:46:10.7216327Z 2025-05-07T19:46:10.7216332Z 2025-05-07T19:46:10.7216336Z 2025-05-07T19:46:10.7216341Z 2025-05-07T19:46:10.7216390Z 2025-05-07T19:46:10.7216395Z 2025-05-07T19:46:10.7216399Z 2025-05-07T19:46:10.7216403Z 2025-05-07T19:46:10.7216407Z 2025-05-07T19:46:10.7216411Z 2025-05-07T19:46:10.7216415Z 2025-05-07T19:46:10.7216419Z 2025-05-07T19:46:10.7442923Z cuda-nvvm-tools-12.6 | 10.4 MB | | 0%  2025-05-07T19:46:10.7443334Z 2025-05-07T19:46:10.7443339Z 2025-05-07T19:46:10.7443343Z 2025-05-07T19:46:10.7443347Z 2025-05-07T19:46:10.7443351Z 2025-05-07T19:46:10.7443356Z 2025-05-07T19:46:10.7443360Z 2025-05-07T19:46:10.7443387Z 2025-05-07T19:46:10.7443392Z 2025-05-07T19:46:10.7443396Z 2025-05-07T19:46:10.7443399Z 2025-05-07T19:46:10.7443403Z 2025-05-07T19:46:10.7443407Z 2025-05-07T19:46:10.7443720Z libnvjitlink-12.6.85 | 14.9 MB | ########## | 100%  2025-05-07T19:46:10.7444058Z 2025-05-07T19:46:10.7444061Z 2025-05-07T19:46:10.7444065Z 2025-05-07T19:46:10.7444068Z 2025-05-07T19:46:10.7444085Z 2025-05-07T19:46:10.7444089Z 2025-05-07T19:46:10.7444093Z 2025-05-07T19:46:10.7444096Z 2025-05-07T19:46:10.7444099Z 2025-05-07T19:46:10.7444103Z 2025-05-07T19:46:10.7444106Z 2025-05-07T19:46:10.7444110Z 2025-05-07T19:46:10.7444113Z 2025-05-07T19:46:10.7772495Z libnvjitlink-12.6.85 | 14.9 MB | ########## | 100%  2025-05-07T19:46:10.7772898Z 2025-05-07T19:46:10.7772903Z 2025-05-07T19:46:10.7772908Z 2025-05-07T19:46:10.7772912Z 2025-05-07T19:46:10.7772916Z 2025-05-07T19:46:10.7772920Z 2025-05-07T19:46:10.7772924Z 2025-05-07T19:46:10.7772963Z 2025-05-07T19:46:10.7772967Z 2025-05-07T19:46:10.7772971Z 2025-05-07T19:46:10.7772976Z 2025-05-07T19:46:10.7772980Z 2025-05-07T19:46:10.7772984Z 2025-05-07T19:46:10.7773002Z 2025-05-07T19:46:10.7923378Z cuda-nvcc-dev_linux- | 10.8 MB | #######2 | 73%  2025-05-07T19:46:10.7923754Z 2025-05-07T19:46:10.7923760Z 2025-05-07T19:46:10.7923764Z 2025-05-07T19:46:10.7923795Z 2025-05-07T19:46:10.7923798Z 2025-05-07T19:46:10.7923803Z 2025-05-07T19:46:10.7923807Z 2025-05-07T19:46:10.7923825Z 2025-05-07T19:46:10.7923829Z 2025-05-07T19:46:10.7923832Z 2025-05-07T19:46:10.7923836Z 2025-05-07T19:46:10.7923839Z 2025-05-07T19:46:10.7923842Z 2025-05-07T19:46:10.7923846Z 2025-05-07T19:46:10.7923850Z 2025-05-07T19:46:10.7923853Z 2025-05-07T19:46:10.8215882Z cuda-sanitizer-api-1 | 8.9 MB | | 0%  2025-05-07T19:46:10.8216287Z 2025-05-07T19:46:10.8216293Z 2025-05-07T19:46:10.8216297Z 2025-05-07T19:46:10.8216536Z 2025-05-07T19:46:10.8216541Z 2025-05-07T19:46:10.8216545Z 2025-05-07T19:46:10.8216549Z 2025-05-07T19:46:10.8216553Z 2025-05-07T19:46:10.8216557Z 2025-05-07T19:46:10.8216561Z 2025-05-07T19:46:10.8216565Z 2025-05-07T19:46:10.8216569Z 2025-05-07T19:46:10.8216573Z 2025-05-07T19:46:10.8216577Z 2025-05-07T19:46:10.8216582Z 2025-05-07T19:46:10.8806057Z cuda-nvvm-tools-12.6 | 10.4 MB | ######2 | 62%  2025-05-07T19:46:10.8806655Z 2025-05-07T19:46:10.8806661Z 2025-05-07T19:46:10.8806665Z 2025-05-07T19:46:10.8806668Z 2025-05-07T19:46:10.8806672Z 2025-05-07T19:46:10.8806675Z 2025-05-07T19:46:10.8806679Z 2025-05-07T19:46:10.8806683Z 2025-05-07T19:46:10.8806686Z 2025-05-07T19:46:10.8806690Z 2025-05-07T19:46:10.8924313Z gds-tools-1.11.1.6 | 37.8 MB | ########## | 100%  2025-05-07T19:46:10.8924657Z 2025-05-07T19:46:10.8924662Z 2025-05-07T19:46:10.8924666Z 2025-05-07T19:46:10.8924669Z 2025-05-07T19:46:10.8924692Z 2025-05-07T19:46:10.8924696Z 2025-05-07T19:46:10.8924700Z 2025-05-07T19:46:10.8924703Z 2025-05-07T19:46:10.8924707Z 2025-05-07T19:46:10.8924711Z 2025-05-07T19:46:10.8924727Z 2025-05-07T19:46:10.8924731Z 2025-05-07T19:46:10.8924734Z 2025-05-07T19:46:10.8924738Z 2025-05-07T19:46:10.8924741Z 2025-05-07T19:46:10.8924745Z 2025-05-07T19:46:10.9354411Z cuda-sanitizer-api-1 | 8.9 MB | ########2 | 83%  2025-05-07T19:46:10.9354858Z 2025-05-07T19:46:10.9354879Z 2025-05-07T19:46:10.9354883Z 2025-05-07T19:46:10.9354887Z 2025-05-07T19:46:10.9354891Z 2025-05-07T19:46:10.9354896Z 2025-05-07T19:46:10.9354900Z 2025-05-07T19:46:10.9354904Z 2025-05-07T19:46:10.9354908Z 2025-05-07T19:46:10.9354912Z 2025-05-07T19:46:10.9354916Z 2025-05-07T19:46:10.9354923Z 2025-05-07T19:46:10.9354927Z 2025-05-07T19:46:10.9354930Z 2025-05-07T19:46:10.9354934Z 2025-05-07T19:46:10.9354937Z 2025-05-07T19:46:10.9354940Z 2025-05-07T19:46:10.9777322Z cuda-nvvm-impl-12.6. | 7.7 MB | | 0%  2025-05-07T19:46:10.9777752Z 2025-05-07T19:46:10.9777757Z 2025-05-07T19:46:10.9777762Z 2025-05-07T19:46:10.9777766Z 2025-05-07T19:46:10.9777770Z 2025-05-07T19:46:10.9777774Z 2025-05-07T19:46:10.9777778Z 2025-05-07T19:46:10.9777782Z 2025-05-07T19:46:10.9777786Z 2025-05-07T19:46:10.9777789Z 2025-05-07T19:46:10.9777806Z 2025-05-07T19:46:10.9777810Z 2025-05-07T19:46:10.9777813Z 2025-05-07T19:46:10.9777817Z 2025-05-07T19:46:10.9857962Z cuda-nvcc-dev_linux- | 10.8 MB | ########## | 100%  2025-05-07T19:46:10.9858339Z 2025-05-07T19:46:10.9858344Z 2025-05-07T19:46:10.9858347Z 2025-05-07T19:46:10.9858351Z 2025-05-07T19:46:10.9858354Z 2025-05-07T19:46:10.9858358Z 2025-05-07T19:46:10.9858361Z 2025-05-07T19:46:10.9858365Z 2025-05-07T19:46:10.9858386Z 2025-05-07T19:46:10.9858390Z 2025-05-07T19:46:10.9858394Z 2025-05-07T19:46:10.9858398Z 2025-05-07T19:46:10.9858416Z 2025-05-07T19:46:10.9858421Z 2025-05-07T19:46:10.9858425Z 2025-05-07T19:46:10.9858428Z 2025-05-07T19:46:10.9876466Z cuda-sanitizer-api-1 | 8.9 MB | ########## | 100%  2025-05-07T19:46:10.9876873Z 2025-05-07T19:46:10.9876878Z 2025-05-07T19:46:10.9876881Z 2025-05-07T19:46:10.9876885Z 2025-05-07T19:46:10.9876889Z 2025-05-07T19:46:10.9876892Z 2025-05-07T19:46:10.9876910Z 2025-05-07T19:46:10.9876914Z 2025-05-07T19:46:10.9876917Z 2025-05-07T19:46:10.9876921Z 2025-05-07T19:46:10.9876925Z 2025-05-07T19:46:10.9876928Z 2025-05-07T19:46:10.9876932Z 2025-05-07T19:46:10.9876935Z 2025-05-07T19:46:10.9876939Z 2025-05-07T19:46:10.9877264Z cuda-nvvm-tools-12.6 | 10.4 MB | ########## | 100%  2025-05-07T19:46:10.9877614Z 2025-05-07T19:46:10.9877618Z 2025-05-07T19:46:10.9877621Z 2025-05-07T19:46:10.9877624Z 2025-05-07T19:46:10.9877628Z 2025-05-07T19:46:10.9877631Z 2025-05-07T19:46:10.9877847Z 2025-05-07T19:46:10.9877853Z 2025-05-07T19:46:10.9877856Z 2025-05-07T19:46:10.9877860Z 2025-05-07T19:46:10.9877863Z 2025-05-07T19:46:10.9877867Z 2025-05-07T19:46:10.9877870Z 2025-05-07T19:46:10.9877873Z 2025-05-07T19:46:10.9877877Z 2025-05-07T19:46:11.0104595Z cuda-nvvm-tools-12.6 | 10.4 MB | ########## | 100%  2025-05-07T19:46:11.0105004Z 2025-05-07T19:46:11.0105009Z 2025-05-07T19:46:11.0105275Z 2025-05-07T19:46:11.0105279Z 2025-05-07T19:46:11.0105282Z 2025-05-07T19:46:11.0105286Z 2025-05-07T19:46:11.0105289Z 2025-05-07T19:46:11.0105294Z 2025-05-07T19:46:11.0105314Z 2025-05-07T19:46:11.0105318Z 2025-05-07T19:46:11.0105322Z 2025-05-07T19:46:11.0105327Z 2025-05-07T19:46:11.0105331Z 2025-05-07T19:46:11.0105335Z 2025-05-07T19:46:11.0105339Z 2025-05-07T19:46:11.0105343Z 2025-05-07T19:46:11.0105347Z 2025-05-07T19:46:11.0105350Z 2025-05-07T19:46:11.0214552Z cuda-cupti-dev-12.6. | 3.4 MB | | 0%  2025-05-07T19:46:11.0214959Z 2025-05-07T19:46:11.0214963Z 2025-05-07T19:46:11.0214967Z 2025-05-07T19:46:11.0214970Z 2025-05-07T19:46:11.0214974Z 2025-05-07T19:46:11.0214978Z 2025-05-07T19:46:11.0214981Z 2025-05-07T19:46:11.0214985Z 2025-05-07T19:46:11.0214988Z 2025-05-07T19:46:11.0214992Z 2025-05-07T19:46:11.0214995Z 2025-05-07T19:46:11.0214999Z 2025-05-07T19:46:11.0215002Z 2025-05-07T19:46:11.0215013Z 2025-05-07T19:46:11.0215017Z 2025-05-07T19:46:11.0215020Z 2025-05-07T19:46:11.0215023Z 2025-05-07T19:46:11.0215027Z 2025-05-07T19:46:11.0215030Z 2025-05-07T19:46:11.0356110Z ... (more hidden) ... 2025-05-07T19:46:11.0356446Z 2025-05-07T19:46:11.0356451Z 2025-05-07T19:46:11.0356455Z 2025-05-07T19:46:11.0356458Z 2025-05-07T19:46:11.0356462Z 2025-05-07T19:46:11.0356466Z 2025-05-07T19:46:11.0356469Z 2025-05-07T19:46:11.0356472Z 2025-05-07T19:46:11.0356489Z 2025-05-07T19:46:11.0356492Z 2025-05-07T19:46:11.0356511Z 2025-05-07T19:46:11.0356516Z 2025-05-07T19:46:11.0356519Z 2025-05-07T19:46:11.0356523Z 2025-05-07T19:46:11.0356526Z 2025-05-07T19:46:11.0356529Z 2025-05-07T19:46:11.0356533Z 2025-05-07T19:46:11.0935696Z cuda-nvvm-impl-12.6. | 7.7 MB | ####8 | 48%  2025-05-07T19:46:11.0936099Z 2025-05-07T19:46:11.0936105Z 2025-05-07T19:46:11.0936109Z 2025-05-07T19:46:11.0936138Z 2025-05-07T19:46:11.0936142Z 2025-05-07T19:46:11.0936146Z 2025-05-07T19:46:11.0936151Z 2025-05-07T19:46:11.0936155Z 2025-05-07T19:46:11.0936159Z 2025-05-07T19:46:11.0936163Z 2025-05-07T19:46:11.0936167Z 2025-05-07T19:46:11.0936170Z 2025-05-07T19:46:11.0936175Z 2025-05-07T19:46:11.0936178Z 2025-05-07T19:46:11.0936196Z 2025-05-07T19:46:11.0936200Z 2025-05-07T19:46:11.0936203Z 2025-05-07T19:46:11.0936207Z 2025-05-07T19:46:11.0936210Z 2025-05-07T19:46:11.0984542Z ... (more hidden) ... 2025-05-07T19:46:11.0984933Z 2025-05-07T19:46:11.0984938Z 2025-05-07T19:46:11.0984956Z 2025-05-07T19:46:11.0984960Z 2025-05-07T19:46:11.0984963Z 2025-05-07T19:46:11.0984967Z 2025-05-07T19:46:11.0984970Z 2025-05-07T19:46:11.0984974Z 2025-05-07T19:46:11.0984977Z 2025-05-07T19:46:11.0984981Z 2025-05-07T19:46:11.0984984Z 2025-05-07T19:46:11.0984988Z 2025-05-07T19:46:11.0984991Z 2025-05-07T19:46:11.0984995Z 2025-05-07T19:46:11.0984998Z 2025-05-07T19:46:11.0985008Z 2025-05-07T19:46:11.0985012Z 2025-05-07T19:46:11.1575802Z 2025-05-07T19:46:11.1576649Z cuda-cupti-dev-12.6. | 3.4 MB | ########## | 100%  2025-05-07T19:46:11.1577088Z 2025-05-07T19:46:11.1577093Z 2025-05-07T19:46:11.1577098Z 2025-05-07T19:46:11.1577103Z 2025-05-07T19:46:11.1577107Z 2025-05-07T19:46:11.1577111Z 2025-05-07T19:46:11.1577115Z 2025-05-07T19:46:11.1577120Z 2025-05-07T19:46:11.1577124Z 2025-05-07T19:46:11.1577128Z 2025-05-07T19:46:11.1577132Z 2025-05-07T19:46:11.1577136Z 2025-05-07T19:46:11.1577409Z 2025-05-07T19:46:11.1577418Z 2025-05-07T19:46:11.1577422Z 2025-05-07T19:46:11.1577426Z 2025-05-07T19:46:11.1577430Z 2025-05-07T19:46:11.1577807Z cuda-nvvm-impl-12.6. | 7.7 MB | ########## | 100%  2025-05-07T19:46:11.1578146Z 2025-05-07T19:46:11.1578150Z 2025-05-07T19:46:11.1578154Z 2025-05-07T19:46:11.1578158Z 2025-05-07T19:46:11.1578162Z 2025-05-07T19:46:11.1578296Z 2025-05-07T19:46:11.1578300Z 2025-05-07T19:46:11.1578303Z 2025-05-07T19:46:11.1578306Z 2025-05-07T19:46:11.1578310Z 2025-05-07T19:46:11.1578313Z 2025-05-07T19:46:11.1578317Z 2025-05-07T19:46:11.1578320Z 2025-05-07T19:46:11.1578323Z 2025-05-07T19:46:11.1578327Z 2025-05-07T19:46:11.1578330Z 2025-05-07T19:46:11.1578334Z 2025-05-07T19:46:11.6303402Z cuda-nvvm-impl-12.6. | 7.7 MB | ########## | 100%  2025-05-07T19:46:11.6303933Z 2025-05-07T19:46:11.6303940Z 2025-05-07T19:46:11.6303945Z 2025-05-07T19:46:11.6303978Z 2025-05-07T19:46:11.6303983Z 2025-05-07T19:46:11.6303987Z 2025-05-07T19:46:11.8677248Z libcusolver-11.7.1.2 | 95.8 MB | ########## | 100%  2025-05-07T19:46:11.8677606Z 2025-05-07T19:46:11.8677613Z 2025-05-07T19:46:11.8677617Z 2025-05-07T19:46:11.8677622Z 2025-05-07T19:46:11.8677627Z 2025-05-07T19:46:11.8677632Z 2025-05-07T19:46:11.8677654Z 2025-05-07T19:46:11.9810682Z libnpp-12.3.1.54 | 93.4 MB | ########## | 100%  2025-05-07T19:46:11.9811095Z 2025-05-07T19:46:11.9811120Z 2025-05-07T19:46:11.9811124Z 2025-05-07T19:46:11.9811127Z 2025-05-07T19:46:11.9811131Z 2025-05-07T19:46:11.9811135Z 2025-05-07T19:46:11.9811140Z 2025-05-07T19:46:11.9811144Z 2025-05-07T19:46:11.9811149Z 2025-05-07T19:46:12.1973506Z libcurand-10.3.7.77 | 39.9 MB | ########## | 100%  2025-05-07T19:46:12.1973880Z 2025-05-07T19:46:12.1973923Z 2025-05-07T19:46:12.1973928Z 2025-05-07T19:46:12.1973932Z 2025-05-07T19:46:12.1973936Z 2025-05-07T19:46:12.1973979Z 2025-05-07T19:46:12.1973986Z 2025-05-07T19:46:12.1973991Z 2025-05-07T19:46:12.1973996Z 2025-05-07T19:46:12.1974001Z 2025-05-07T19:46:12.1974006Z 2025-05-07T19:46:12.1974010Z 2025-05-07T19:46:12.4109980Z cuda-nvrtc-12.6.85 | 17.3 MB | ########## | 100%  2025-05-07T19:46:12.4110396Z 2025-05-07T19:46:12.4110403Z 2025-05-07T19:46:12.4110409Z 2025-05-07T19:46:12.4110485Z 2025-05-07T19:46:12.4110490Z 2025-05-07T19:46:12.4110494Z 2025-05-07T19:46:12.4110499Z 2025-05-07T19:46:12.4110504Z 2025-05-07T19:46:12.4110508Z 2025-05-07T19:46:12.4110513Z 2025-05-07T19:46:12.4110517Z 2025-05-07T19:46:12.4276805Z cuda-nvcc-tools-12.6 | 23.0 MB | ########## | 100%  2025-05-07T19:46:12.4277172Z 2025-05-07T19:46:12.4277177Z 2025-05-07T19:46:12.4277181Z 2025-05-07T19:46:12.4277185Z 2025-05-07T19:46:12.4277188Z 2025-05-07T19:46:12.4277193Z 2025-05-07T19:46:12.4277197Z 2025-05-07T19:46:12.4277201Z 2025-05-07T19:46:12.4277231Z 2025-05-07T19:46:12.4277235Z 2025-05-07T19:46:12.4277238Z 2025-05-07T19:46:12.4277242Z 2025-05-07T19:46:12.4277245Z 2025-05-07T19:46:12.6718381Z libnvjitlink-12.6.85 | 14.9 MB | ########## | 100%  2025-05-07T19:46:12.6718778Z 2025-05-07T19:46:12.6718786Z 2025-05-07T19:46:12.6718792Z 2025-05-07T19:46:12.6718797Z 2025-05-07T19:46:12.6718804Z 2025-05-07T19:46:12.6718873Z 2025-05-07T19:46:12.6718878Z 2025-05-07T19:46:12.6718883Z 2025-05-07T19:46:12.6718889Z 2025-05-07T19:46:12.6718893Z 2025-05-07T19:46:12.6896579Z gds-tools-1.11.1.6 | 37.8 MB | ########## | 100%  2025-05-07T19:46:12.6896928Z 2025-05-07T19:46:12.6896933Z 2025-05-07T19:46:12.6896961Z 2025-05-07T19:46:12.6896965Z 2025-05-07T19:46:12.6896970Z 2025-05-07T19:46:12.6896975Z 2025-05-07T19:46:12.6896979Z 2025-05-07T19:46:12.6896984Z 2025-05-07T19:46:12.6896987Z 2025-05-07T19:46:12.6896992Z 2025-05-07T19:46:12.6896996Z 2025-05-07T19:46:12.6897243Z 2025-05-07T19:46:12.6897249Z 2025-05-07T19:46:12.6897252Z 2025-05-07T19:46:12.7646089Z cuda-nvcc-dev_linux- | 10.8 MB | ########## | 100%  2025-05-07T19:46:12.8519325Z nsight-compute-2024. | 443.1 MB | ########## | 100% 2025-05-07T19:46:12.8519657Z 2025-05-07T19:46:12.8519663Z 2025-05-07T19:46:12.8519669Z 2025-05-07T19:46:12.8519672Z 2025-05-07T19:46:12.8519695Z 2025-05-07T19:46:12.8519972Z 2025-05-07T19:46:12.8519976Z 2025-05-07T19:46:12.8519981Z 2025-05-07T19:46:12.8519985Z 2025-05-07T19:46:12.8519989Z 2025-05-07T19:46:12.8519993Z 2025-05-07T19:46:12.8519997Z 2025-05-07T19:46:12.8520005Z 2025-05-07T19:46:12.8520010Z 2025-05-07T19:46:12.8520014Z 2025-05-07T19:46:12.8520018Z 2025-05-07T19:46:12.8652417Z cuda-sanitizer-api-1 | 8.9 MB | ########## | 100%  2025-05-07T19:46:12.8652822Z 2025-05-07T19:46:12.8652828Z 2025-05-07T19:46:12.8652834Z 2025-05-07T19:46:12.8652839Z 2025-05-07T19:46:12.8652867Z 2025-05-07T19:46:12.8652872Z 2025-05-07T19:46:12.8652876Z 2025-05-07T19:46:12.8652880Z 2025-05-07T19:46:12.8652884Z 2025-05-07T19:46:12.8652889Z 2025-05-07T19:46:12.8652915Z 2025-05-07T19:46:12.8652919Z 2025-05-07T19:46:12.8652923Z 2025-05-07T19:46:12.8652927Z 2025-05-07T19:46:12.8652933Z 2025-05-07T19:46:12.9138523Z cuda-nvvm-tools-12.6 | 10.4 MB | ########## | 100%  2025-05-07T19:46:12.9138985Z 2025-05-07T19:46:12.9138991Z 2025-05-07T19:46:12.9138995Z 2025-05-07T19:46:12.9138998Z 2025-05-07T19:46:12.9139002Z 2025-05-07T19:46:12.9139005Z 2025-05-07T19:46:12.9139009Z 2025-05-07T19:46:12.9139013Z 2025-05-07T19:46:12.9139016Z 2025-05-07T19:46:12.9139020Z 2025-05-07T19:46:12.9139023Z 2025-05-07T19:46:12.9139026Z 2025-05-07T19:46:12.9139030Z 2025-05-07T19:46:12.9139033Z 2025-05-07T19:46:12.9139037Z 2025-05-07T19:46:12.9139040Z 2025-05-07T19:46:12.9139044Z 2025-05-07T19:46:12.9139047Z 2025-05-07T19:46:12.9139051Z 2025-05-07T19:46:12.9139357Z ... (more hidden) ... 2025-05-07T19:46:12.9139652Z 2025-05-07T19:46:12.9139656Z 2025-05-07T19:46:12.9139659Z 2025-05-07T19:46:12.9139663Z 2025-05-07T19:46:12.9139666Z 2025-05-07T19:46:12.9139670Z 2025-05-07T19:46:12.9139673Z 2025-05-07T19:46:12.9139676Z 2025-05-07T19:46:12.9139680Z 2025-05-07T19:46:12.9139683Z 2025-05-07T19:46:12.9139686Z 2025-05-07T19:46:12.9139694Z 2025-05-07T19:46:12.9139697Z 2025-05-07T19:46:12.9139701Z 2025-05-07T19:46:12.9139704Z 2025-05-07T19:46:12.9139727Z 2025-05-07T19:46:12.9139731Z 2025-05-07T19:46:12.9139734Z 2025-05-07T19:46:12.9139737Z 2025-05-07T19:46:12.9757120Z ... (more hidden) ... 2025-05-07T19:46:12.9757507Z 2025-05-07T19:46:12.9757542Z 2025-05-07T19:46:12.9757547Z 2025-05-07T19:46:12.9757551Z 2025-05-07T19:46:12.9757555Z 2025-05-07T19:46:12.9757559Z 2025-05-07T19:46:12.9757564Z 2025-05-07T19:46:12.9757568Z 2025-05-07T19:46:12.9757600Z 2025-05-07T19:46:12.9757604Z 2025-05-07T19:46:12.9757608Z 2025-05-07T19:46:12.9757612Z 2025-05-07T19:46:12.9757616Z 2025-05-07T19:46:12.9757620Z 2025-05-07T19:46:12.9757624Z 2025-05-07T19:46:12.9757628Z 2025-05-07T19:46:12.9757632Z 2025-05-07T19:46:12.9757636Z 2025-05-07T19:46:12.9758015Z cuda-cupti-dev-12.6. | 3.4 MB | ########## | 100%  2025-05-07T19:46:12.9758413Z 2025-05-07T19:46:12.9758417Z 2025-05-07T19:46:12.9758420Z 2025-05-07T19:46:12.9758424Z 2025-05-07T19:46:12.9758427Z 2025-05-07T19:46:12.9758431Z 2025-05-07T19:46:12.9758434Z 2025-05-07T19:46:12.9758438Z 2025-05-07T19:46:12.9758441Z 2025-05-07T19:46:12.9758444Z 2025-05-07T19:46:12.9758448Z 2025-05-07T19:46:12.9758451Z 2025-05-07T19:46:12.9758455Z 2025-05-07T19:46:12.9758458Z 2025-05-07T19:46:12.9758462Z 2025-05-07T19:46:12.9758465Z 2025-05-07T19:46:12.9758468Z 2025-05-07T19:46:12.9758497Z 2025-05-07T19:46:13.0966455Z cuda-cupti-dev-12.6. | 3.4 MB | ########## | 100%  2025-05-07T19:46:13.0966911Z 2025-05-07T19:46:13.0966917Z 2025-05-07T19:46:13.0966922Z 2025-05-07T19:46:13.0966927Z 2025-05-07T19:46:13.0966932Z 2025-05-07T19:46:13.0966937Z 2025-05-07T19:46:13.0966941Z 2025-05-07T19:46:13.0966946Z 2025-05-07T19:46:13.0966951Z 2025-05-07T19:46:13.0966955Z 2025-05-07T19:46:13.0966960Z 2025-05-07T19:46:13.0967106Z 2025-05-07T19:46:13.0967111Z 2025-05-07T19:46:13.0967130Z 2025-05-07T19:46:13.0967133Z 2025-05-07T19:46:13.0967137Z 2025-05-07T19:46:13.0967141Z 2025-05-07T19:46:13.8766480Z cuda-nvvm-impl-12.6. | 7.7 MB | ########## | 100%  2025-05-07T19:46:13.8766893Z 2025-05-07T19:46:17.3629241Z libcublas-12.6.4.1 | 256.2 MB | ########## | 100%  2025-05-07T19:46:17.3633098Z nsight-compute-2024. | 443.1 MB | ########## | 100% 2025-05-07T19:46:17.3633897Z 2025-05-07T19:46:17.3633940Z 2025-05-07T19:46:17.3633956Z 2025-05-07T19:46:17.3634192Z 2025-05-07T19:46:17.3634214Z 2025-05-07T19:46:17.3634225Z 2025-05-07T19:46:17.3634238Z 2025-05-07T19:46:17.3634249Z 2025-05-07T19:46:17.3634261Z 2025-05-07T19:46:17.3634272Z 2025-05-07T19:46:17.3634283Z 2025-05-07T19:46:17.3634294Z 2025-05-07T19:46:17.3634305Z 2025-05-07T19:46:17.3634317Z 2025-05-07T19:46:17.3634327Z 2025-05-07T19:46:17.3634339Z 2025-05-07T19:46:17.3634350Z 2025-05-07T19:46:17.3634399Z 2025-05-07T19:46:17.3634410Z 2025-05-07T19:46:17.3634698Z 2025-05-07T19:46:17.3635598Z  2025-05-07T19:46:17.3636425Z 2025-05-07T19:46:17.3636736Z 2025-05-07T19:46:17.3637079Z  2025-05-07T19:46:17.3637311Z 2025-05-07T19:46:17.3637382Z 2025-05-07T19:46:17.3637563Z  2025-05-07T19:46:17.3637789Z 2025-05-07T19:46:17.3637794Z 2025-05-07T19:46:17.3637833Z 2025-05-07T19:46:17.3638037Z  2025-05-07T19:46:17.3638264Z 2025-05-07T19:46:17.3638268Z 2025-05-07T19:46:17.3638272Z 2025-05-07T19:46:17.3638276Z 2025-05-07T19:46:17.3638462Z  2025-05-07T19:46:17.3638712Z 2025-05-07T19:46:17.3638717Z 2025-05-07T19:46:17.3638721Z 2025-05-07T19:46:17.3638740Z 2025-05-07T19:46:17.3638744Z 2025-05-07T19:46:17.3638930Z  2025-05-07T19:46:17.3639162Z 2025-05-07T19:46:17.3639166Z 2025-05-07T19:46:17.3639188Z 2025-05-07T19:46:17.3639192Z 2025-05-07T19:46:17.3639195Z 2025-05-07T19:46:17.3639199Z 2025-05-07T19:46:17.3639461Z  2025-05-07T19:46:17.3639772Z 2025-05-07T19:46:17.3639776Z 2025-05-07T19:46:17.3639780Z 2025-05-07T19:46:17.3639783Z 2025-05-07T19:46:17.3639787Z 2025-05-07T19:46:17.3639806Z 2025-05-07T19:46:17.3639829Z 2025-05-07T19:46:17.3640032Z  2025-05-07T19:46:17.3640263Z 2025-05-07T19:46:17.3640267Z 2025-05-07T19:46:17.3640271Z 2025-05-07T19:46:17.3640274Z 2025-05-07T19:46:17.3640278Z 2025-05-07T19:46:17.3640281Z 2025-05-07T19:46:17.3640285Z 2025-05-07T19:46:17.3640290Z 2025-05-07T19:46:17.3640501Z  2025-05-07T19:46:17.3640748Z 2025-05-07T19:46:17.3640752Z 2025-05-07T19:46:17.3640755Z 2025-05-07T19:46:17.3640759Z 2025-05-07T19:46:17.3640762Z 2025-05-07T19:46:17.3640766Z 2025-05-07T19:46:17.3640769Z 2025-05-07T19:46:17.3640773Z 2025-05-07T19:46:17.3640776Z 2025-05-07T19:46:17.3641003Z  2025-05-07T19:46:17.3641247Z 2025-05-07T19:46:17.3641250Z 2025-05-07T19:46:17.3641254Z 2025-05-07T19:46:17.3641257Z 2025-05-07T19:46:17.3641553Z 2025-05-07T19:46:17.3641557Z 2025-05-07T19:46:17.3641562Z 2025-05-07T19:46:17.3641566Z 2025-05-07T19:46:17.3641569Z 2025-05-07T19:46:17.3641573Z 2025-05-07T19:46:17.3641825Z  2025-05-07T19:46:17.3642078Z 2025-05-07T19:46:17.3642084Z 2025-05-07T19:46:17.3642089Z 2025-05-07T19:46:17.3642094Z 2025-05-07T19:46:17.3642101Z 2025-05-07T19:46:17.3642276Z 2025-05-07T19:46:17.3642279Z 2025-05-07T19:46:17.3642283Z 2025-05-07T19:46:17.3642286Z 2025-05-07T19:46:17.3642290Z 2025-05-07T19:46:17.3642294Z 2025-05-07T19:46:17.3642546Z  2025-05-07T19:46:17.3642810Z 2025-05-07T19:46:17.3642816Z 2025-05-07T19:46:17.3642822Z 2025-05-07T19:46:17.3642829Z 2025-05-07T19:46:17.3642836Z 2025-05-07T19:46:17.3642840Z 2025-05-07T19:46:17.3642845Z 2025-05-07T19:46:17.3642851Z 2025-05-07T19:46:17.3642969Z 2025-05-07T19:46:17.3642973Z 2025-05-07T19:46:17.3642987Z 2025-05-07T19:46:17.3642993Z 2025-05-07T19:46:17.3643263Z  2025-05-07T19:46:17.3643626Z 2025-05-07T19:46:17.3643630Z 2025-05-07T19:46:17.3643633Z 2025-05-07T19:46:17.3643637Z 2025-05-07T19:46:17.3643640Z 2025-05-07T19:46:17.3643644Z 2025-05-07T19:46:17.3643647Z 2025-05-07T19:46:17.3643652Z 2025-05-07T19:46:17.3643664Z 2025-05-07T19:46:17.3643668Z 2025-05-07T19:46:17.3643673Z 2025-05-07T19:46:17.3643678Z 2025-05-07T19:46:17.3643685Z 2025-05-07T19:46:17.3643981Z  2025-05-07T19:46:17.3644274Z 2025-05-07T19:46:17.3644278Z 2025-05-07T19:46:17.3644282Z 2025-05-07T19:46:17.3644285Z 2025-05-07T19:46:17.3644289Z 2025-05-07T19:46:17.3644292Z 2025-05-07T19:46:17.3644295Z 2025-05-07T19:46:17.3644299Z 2025-05-07T19:46:17.3644302Z 2025-05-07T19:46:17.3644305Z 2025-05-07T19:46:17.3644309Z 2025-05-07T19:46:17.3644318Z 2025-05-07T19:46:17.3644321Z 2025-05-07T19:46:17.3644325Z 2025-05-07T19:46:17.3644609Z  2025-05-07T19:46:17.3644893Z 2025-05-07T19:46:17.3644897Z 2025-05-07T19:46:17.3644900Z 2025-05-07T19:46:17.3644904Z 2025-05-07T19:46:17.3644907Z 2025-05-07T19:46:17.3644910Z 2025-05-07T19:46:17.3644914Z 2025-05-07T19:46:17.3644924Z 2025-05-07T19:46:17.3644928Z 2025-05-07T19:46:17.3644932Z 2025-05-07T19:46:17.3644936Z 2025-05-07T19:46:17.3644939Z 2025-05-07T19:46:17.3644974Z 2025-05-07T19:46:17.3644980Z 2025-05-07T19:46:17.3644985Z 2025-05-07T19:46:17.3645258Z  2025-05-07T19:46:17.3645518Z 2025-05-07T19:46:17.3645522Z 2025-05-07T19:46:17.3645525Z 2025-05-07T19:46:17.3645529Z 2025-05-07T19:46:17.3645533Z 2025-05-07T19:46:17.3645537Z 2025-05-07T19:46:17.3645541Z 2025-05-07T19:46:17.3645567Z 2025-05-07T19:46:17.3645576Z 2025-05-07T19:46:17.3645580Z 2025-05-07T19:46:17.3645583Z 2025-05-07T19:46:17.3645586Z 2025-05-07T19:46:17.3645590Z 2025-05-07T19:46:17.3645593Z 2025-05-07T19:46:17.3645597Z 2025-05-07T19:46:17.3645600Z 2025-05-07T19:46:17.3645839Z  2025-05-07T19:46:17.3646128Z 2025-05-07T19:46:17.3646131Z 2025-05-07T19:46:17.3646143Z 2025-05-07T19:46:17.3646146Z 2025-05-07T19:46:17.3646150Z 2025-05-07T19:46:17.3646153Z 2025-05-07T19:46:17.3646157Z 2025-05-07T19:46:17.3646160Z 2025-05-07T19:46:17.3646164Z 2025-05-07T19:46:17.3646167Z 2025-05-07T19:46:17.3646171Z 2025-05-07T19:46:17.3646174Z 2025-05-07T19:46:17.3646178Z 2025-05-07T19:46:17.3646181Z 2025-05-07T19:46:17.3646184Z 2025-05-07T19:46:17.3646188Z 2025-05-07T19:46:17.3646191Z 2025-05-07T19:46:17.3646440Z  2025-05-07T19:46:17.3648055Z 2025-05-07T19:46:17.3648062Z 2025-05-07T19:46:17.3648066Z 2025-05-07T19:46:17.3648069Z 2025-05-07T19:46:17.3648073Z 2025-05-07T19:46:17.3648076Z 2025-05-07T19:46:17.3648079Z 2025-05-07T19:46:17.3648083Z 2025-05-07T19:46:17.3648086Z 2025-05-07T19:46:17.3648090Z 2025-05-07T19:46:17.3648093Z 2025-05-07T19:46:17.3648096Z 2025-05-07T19:46:17.3648100Z 2025-05-07T19:46:17.3648104Z 2025-05-07T19:46:17.3648107Z 2025-05-07T19:46:17.3648202Z 2025-05-07T19:46:17.3648205Z 2025-05-07T19:46:17.3648235Z 2025-05-07T19:46:17.3648513Z  2025-05-07T19:46:17.3648785Z 2025-05-07T19:46:17.3648788Z 2025-05-07T19:46:17.3648909Z  2025-05-07T19:46:17.3649104Z 2025-05-07T19:46:17.3649108Z 2025-05-07T19:46:17.3649256Z  2025-05-07T19:46:17.3669301Z 2025-05-07T19:46:17.3669312Z 2025-05-07T19:46:17.3669317Z 2025-05-07T19:46:17.3669706Z  2025-05-07T19:46:17.3670096Z 2025-05-07T19:46:17.3670134Z 2025-05-07T19:46:17.3670147Z 2025-05-07T19:46:17.3670151Z 2025-05-07T19:46:17.3670410Z  2025-05-07T19:46:17.3670622Z 2025-05-07T19:46:17.3670627Z 2025-05-07T19:46:17.3670632Z 2025-05-07T19:46:17.3670636Z 2025-05-07T19:46:17.3670642Z 2025-05-07T19:46:17.3670856Z  2025-05-07T19:46:17.3671068Z 2025-05-07T19:46:17.3671074Z 2025-05-07T19:46:17.3671082Z 2025-05-07T19:46:17.3671091Z 2025-05-07T19:46:17.3671111Z 2025-05-07T19:46:17.3671116Z 2025-05-07T19:46:17.3671355Z  2025-05-07T19:46:17.3671626Z 2025-05-07T19:46:17.3671632Z 2025-05-07T19:46:17.3671639Z 2025-05-07T19:46:17.3671648Z 2025-05-07T19:46:17.3671657Z 2025-05-07T19:46:17.3671662Z 2025-05-07T19:46:17.3671670Z 2025-05-07T19:46:17.3671905Z  2025-05-07T19:46:17.3672233Z 2025-05-07T19:46:17.3672239Z 2025-05-07T19:46:17.3672246Z 2025-05-07T19:46:17.3672253Z 2025-05-07T19:46:17.3672259Z 2025-05-07T19:46:17.3672265Z 2025-05-07T19:46:17.3672270Z 2025-05-07T19:46:17.3672283Z 2025-05-07T19:46:17.3672445Z  2025-05-07T19:46:17.3672649Z 2025-05-07T19:46:17.3672653Z 2025-05-07T19:46:17.3672657Z 2025-05-07T19:46:17.3672660Z 2025-05-07T19:46:17.3672663Z 2025-05-07T19:46:17.3672667Z 2025-05-07T19:46:17.3672670Z 2025-05-07T19:46:17.3672674Z 2025-05-07T19:46:17.3672677Z 2025-05-07T19:46:17.3672818Z  2025-05-07T19:46:17.3672999Z 2025-05-07T19:46:17.3673039Z 2025-05-07T19:46:17.3673042Z 2025-05-07T19:46:17.3673046Z 2025-05-07T19:46:17.3673049Z 2025-05-07T19:46:17.3673053Z 2025-05-07T19:46:17.3673056Z 2025-05-07T19:46:17.3673059Z 2025-05-07T19:46:17.3673063Z 2025-05-07T19:46:17.3673066Z 2025-05-07T19:46:17.3673226Z  2025-05-07T19:46:17.3673409Z 2025-05-07T19:46:17.3673413Z 2025-05-07T19:46:17.3673447Z 2025-05-07T19:46:17.3673451Z 2025-05-07T19:46:17.3673454Z 2025-05-07T19:46:17.3673659Z 2025-05-07T19:46:17.3673662Z 2025-05-07T19:46:17.3673666Z 2025-05-07T19:46:17.3673678Z 2025-05-07T19:46:17.3673681Z 2025-05-07T19:46:17.3673685Z 2025-05-07T19:46:17.3673898Z  2025-05-07T19:46:17.3674326Z 2025-05-07T19:46:17.3674331Z 2025-05-07T19:46:17.3674335Z 2025-05-07T19:46:17.3674344Z 2025-05-07T19:46:17.3674348Z 2025-05-07T19:46:17.3674352Z 2025-05-07T19:46:17.3674356Z 2025-05-07T19:46:17.3674360Z 2025-05-07T19:46:17.3674370Z 2025-05-07T19:46:17.3674381Z 2025-05-07T19:46:17.3674396Z 2025-05-07T19:46:17.3674400Z 2025-05-07T19:46:17.3674620Z  2025-05-07T19:46:17.3675102Z 2025-05-07T19:46:17.3675108Z 2025-05-07T19:46:17.3675115Z 2025-05-07T19:46:17.3675123Z 2025-05-07T19:46:17.3675133Z 2025-05-07T19:46:17.3675138Z 2025-05-07T19:46:17.3675146Z 2025-05-07T19:46:17.3675153Z 2025-05-07T19:46:17.3675159Z 2025-05-07T19:46:17.3675167Z 2025-05-07T19:46:17.3675175Z 2025-05-07T19:46:17.3675184Z 2025-05-07T19:46:17.3675190Z 2025-05-07T19:46:17.3675470Z  2025-05-07T19:46:17.3675914Z 2025-05-07T19:46:17.3675918Z 2025-05-07T19:46:17.3675922Z 2025-05-07T19:46:17.3675925Z 2025-05-07T19:46:17.3675929Z 2025-05-07T19:46:17.3675932Z 2025-05-07T19:46:17.3675936Z 2025-05-07T19:46:17.3675940Z 2025-05-07T19:46:17.3675943Z 2025-05-07T19:46:17.3675947Z 2025-05-07T19:46:17.3675950Z 2025-05-07T19:46:17.3675953Z 2025-05-07T19:46:17.3675957Z 2025-05-07T19:46:17.3675960Z 2025-05-07T19:46:17.3676256Z  2025-05-07T19:46:17.3676508Z 2025-05-07T19:46:17.3676512Z 2025-05-07T19:46:17.3676515Z 2025-05-07T19:46:17.3676519Z 2025-05-07T19:46:17.3676522Z 2025-05-07T19:46:17.3676526Z 2025-05-07T19:46:17.3676529Z 2025-05-07T19:46:17.3676532Z 2025-05-07T19:46:17.3676536Z 2025-05-07T19:46:17.3676539Z 2025-05-07T19:46:17.3676543Z 2025-05-07T19:46:17.3676546Z 2025-05-07T19:46:17.3676549Z 2025-05-07T19:46:17.3676553Z 2025-05-07T19:46:17.3676557Z 2025-05-07T19:46:17.3676751Z  2025-05-07T19:46:17.3676978Z 2025-05-07T19:46:17.3676982Z 2025-05-07T19:46:17.3676985Z 2025-05-07T19:46:17.3676989Z 2025-05-07T19:46:17.3676992Z 2025-05-07T19:46:17.3676996Z 2025-05-07T19:46:17.3676999Z 2025-05-07T19:46:17.3677003Z 2025-05-07T19:46:17.3677006Z 2025-05-07T19:46:17.3677010Z 2025-05-07T19:46:17.3677014Z 2025-05-07T19:46:17.3677017Z 2025-05-07T19:46:17.3677021Z 2025-05-07T19:46:17.3677056Z 2025-05-07T19:46:17.3677060Z 2025-05-07T19:46:17.3677067Z 2025-05-07T19:46:17.3677244Z  2025-05-07T19:46:17.3677478Z 2025-05-07T19:46:17.3677481Z 2025-05-07T19:46:17.3677485Z 2025-05-07T19:46:17.3677489Z 2025-05-07T19:46:17.3677492Z 2025-05-07T19:46:17.3677496Z 2025-05-07T19:46:17.3677500Z 2025-05-07T19:46:17.3677529Z 2025-05-07T19:46:17.3677533Z 2025-05-07T19:46:17.3677536Z 2025-05-07T19:46:17.3677539Z 2025-05-07T19:46:17.3677543Z 2025-05-07T19:46:17.3677546Z 2025-05-07T19:46:17.3677550Z 2025-05-07T19:46:17.3677553Z 2025-05-07T19:46:17.3677557Z 2025-05-07T19:46:17.3677565Z 2025-05-07T19:46:17.3677743Z  2025-05-07T19:46:17.3677978Z 2025-05-07T19:46:17.3678010Z 2025-05-07T19:46:17.3678013Z 2025-05-07T19:46:17.3678017Z 2025-05-07T19:46:17.3678020Z 2025-05-07T19:46:17.3678024Z 2025-05-07T19:46:17.3678027Z 2025-05-07T19:46:17.3678031Z 2025-05-07T19:46:17.3678034Z 2025-05-07T19:46:17.3678038Z 2025-05-07T19:46:17.3678045Z 2025-05-07T19:46:17.3678049Z 2025-05-07T19:46:17.3678052Z 2025-05-07T19:46:17.3678055Z 2025-05-07T19:46:17.3678059Z 2025-05-07T19:46:17.3678062Z 2025-05-07T19:46:17.3678066Z 2025-05-07T19:46:17.3678069Z 2025-05-07T19:46:17.3678253Z  2025-05-07T19:46:17.3678523Z 2025-05-07T19:46:17.3678527Z 2025-05-07T19:46:17.3678641Z  2025-05-07T19:46:17.3678763Z 2025-05-07T19:46:17.3678767Z 2025-05-07T19:46:17.3678902Z  2025-05-07T19:46:17.3679020Z 2025-05-07T19:46:17.3679024Z 2025-05-07T19:46:17.3679027Z 2025-05-07T19:46:17.3679149Z  2025-05-07T19:46:17.3679307Z 2025-05-07T19:46:17.3679310Z 2025-05-07T19:46:17.3679315Z 2025-05-07T19:46:17.3679319Z 2025-05-07T19:46:17.3679438Z  2025-05-07T19:46:17.3679573Z 2025-05-07T19:46:17.3679576Z 2025-05-07T19:46:17.3679580Z 2025-05-07T19:46:17.3679583Z 2025-05-07T19:46:17.3679612Z 2025-05-07T19:46:17.3679734Z  2025-05-07T19:46:17.3679872Z 2025-05-07T19:46:17.3679881Z 2025-05-07T19:46:17.3679884Z 2025-05-07T19:46:17.3679888Z 2025-05-07T19:46:17.3679891Z 2025-05-07T19:46:17.3679895Z 2025-05-07T19:46:17.3680055Z  2025-05-07T19:46:17.3680201Z 2025-05-07T19:46:17.3680205Z 2025-05-07T19:46:17.3680208Z 2025-05-07T19:46:17.3680212Z 2025-05-07T19:46:17.3680215Z 2025-05-07T19:46:17.3680219Z 2025-05-07T19:46:17.3680222Z 2025-05-07T19:46:17.3680349Z  2025-05-07T19:46:17.3680533Z 2025-05-07T19:46:17.3680536Z 2025-05-07T19:46:17.3680540Z 2025-05-07T19:46:17.3680543Z 2025-05-07T19:46:17.3680547Z 2025-05-07T19:46:17.3680612Z 2025-05-07T19:46:17.3680616Z 2025-05-07T19:46:17.3680619Z 2025-05-07T19:46:17.3680754Z  2025-05-07T19:46:17.3680945Z 2025-05-07T19:46:17.3680949Z 2025-05-07T19:46:17.3680953Z 2025-05-07T19:46:17.3680956Z 2025-05-07T19:46:17.3680960Z 2025-05-07T19:46:17.3680963Z 2025-05-07T19:46:17.3680967Z 2025-05-07T19:46:17.3680970Z 2025-05-07T19:46:17.3680974Z 2025-05-07T19:46:17.3681173Z  2025-05-07T19:46:17.3681360Z 2025-05-07T19:46:17.3681363Z 2025-05-07T19:46:17.3681367Z 2025-05-07T19:46:17.3681371Z 2025-05-07T19:46:17.3681374Z 2025-05-07T19:46:17.3681378Z 2025-05-07T19:46:17.3681381Z 2025-05-07T19:46:17.3681385Z 2025-05-07T19:46:17.3681388Z 2025-05-07T19:46:17.3681391Z 2025-05-07T19:46:17.3681536Z  2025-05-07T19:46:17.3681713Z 2025-05-07T19:46:17.3681719Z 2025-05-07T19:46:17.3681722Z 2025-05-07T19:46:17.3681726Z 2025-05-07T19:46:17.3681752Z 2025-05-07T19:46:17.3681755Z 2025-05-07T19:46:17.3681763Z 2025-05-07T19:46:17.3681767Z 2025-05-07T19:46:17.3681770Z 2025-05-07T19:46:17.3681773Z 2025-05-07T19:46:17.3681777Z 2025-05-07T19:46:17.3681925Z  2025-05-07T19:46:17.3682117Z 2025-05-07T19:46:17.3682121Z 2025-05-07T19:46:17.3682124Z 2025-05-07T19:46:17.3682128Z 2025-05-07T19:46:17.3682158Z 2025-05-07T19:46:17.3682162Z 2025-05-07T19:46:17.3682165Z 2025-05-07T19:46:17.3682173Z 2025-05-07T19:46:17.3682176Z 2025-05-07T19:46:17.3682180Z 2025-05-07T19:46:17.3682183Z 2025-05-07T19:46:17.3682186Z 2025-05-07T19:46:17.3682335Z  2025-05-07T19:46:17.3682538Z 2025-05-07T19:46:17.3682542Z 2025-05-07T19:46:17.3682545Z 2025-05-07T19:46:17.3682576Z 2025-05-07T19:46:17.3682579Z 2025-05-07T19:46:17.3682582Z 2025-05-07T19:46:17.3682586Z 2025-05-07T19:46:17.3682589Z 2025-05-07T19:46:17.3682592Z 2025-05-07T19:46:17.3682596Z 2025-05-07T19:46:17.3682599Z 2025-05-07T19:46:17.3682603Z 2025-05-07T19:46:17.3682606Z 2025-05-07T19:46:17.3682762Z  2025-05-07T19:46:17.3682972Z 2025-05-07T19:46:17.3683001Z 2025-05-07T19:46:17.3683004Z 2025-05-07T19:46:17.3683008Z 2025-05-07T19:46:17.3683011Z 2025-05-07T19:46:17.3683014Z 2025-05-07T19:46:17.3683018Z 2025-05-07T19:46:17.3683022Z 2025-05-07T19:46:17.3683025Z 2025-05-07T19:46:17.3683029Z 2025-05-07T19:46:17.3683032Z 2025-05-07T19:46:17.3683036Z 2025-05-07T19:46:17.3683043Z 2025-05-07T19:46:17.3683046Z 2025-05-07T19:46:17.3683200Z  2025-05-07T19:46:17.3683440Z 2025-05-07T19:46:17.3683443Z 2025-05-07T19:46:17.3683447Z 2025-05-07T19:46:17.3683450Z 2025-05-07T19:46:17.3683453Z 2025-05-07T19:46:17.3683457Z 2025-05-07T19:46:17.3683460Z 2025-05-07T19:46:17.3683464Z 2025-05-07T19:46:17.3683467Z 2025-05-07T19:46:17.3683471Z 2025-05-07T19:46:17.3683474Z 2025-05-07T19:46:17.3683478Z 2025-05-07T19:46:17.3683481Z 2025-05-07T19:46:17.3683485Z 2025-05-07T19:46:17.3683488Z 2025-05-07T19:46:17.3683679Z  2025-05-07T19:46:17.3683898Z 2025-05-07T19:46:17.3683902Z 2025-05-07T19:46:17.3683905Z 2025-05-07T19:46:17.3683909Z 2025-05-07T19:46:17.3683912Z 2025-05-07T19:46:17.3683916Z 2025-05-07T19:46:17.3683919Z 2025-05-07T19:46:17.3683922Z 2025-05-07T19:46:17.3683926Z 2025-05-07T19:46:17.3683929Z 2025-05-07T19:46:17.3683933Z 2025-05-07T19:46:17.3683936Z 2025-05-07T19:46:17.3683943Z 2025-05-07T19:46:17.3683946Z 2025-05-07T19:46:17.3683950Z 2025-05-07T19:46:17.3683953Z 2025-05-07T19:46:17.3684144Z  2025-05-07T19:46:17.3684370Z 2025-05-07T19:46:17.3684374Z 2025-05-07T19:46:17.3684377Z 2025-05-07T19:46:17.3684381Z 2025-05-07T19:46:17.3684384Z 2025-05-07T19:46:17.3684387Z 2025-05-07T19:46:17.3684391Z 2025-05-07T19:46:17.3684394Z 2025-05-07T19:46:17.3684398Z 2025-05-07T19:46:17.3684401Z 2025-05-07T19:46:17.3684405Z 2025-05-07T19:46:17.3684432Z 2025-05-07T19:46:17.3684435Z 2025-05-07T19:46:17.3684496Z 2025-05-07T19:46:17.3684499Z 2025-05-07T19:46:17.3684503Z 2025-05-07T19:46:17.3684506Z 2025-05-07T19:46:17.3684683Z  2025-05-07T19:46:17.3684917Z 2025-05-07T19:46:17.3684920Z 2025-05-07T19:46:17.3684924Z 2025-05-07T19:46:17.3684928Z 2025-05-07T19:46:17.3684931Z 2025-05-07T19:46:17.3684960Z 2025-05-07T19:46:17.3684964Z 2025-05-07T19:46:17.3684967Z 2025-05-07T19:46:17.3685027Z 2025-05-07T19:46:17.3685031Z 2025-05-07T19:46:17.3685034Z 2025-05-07T19:46:17.3685038Z 2025-05-07T19:46:17.3685041Z 2025-05-07T19:46:17.3685044Z 2025-05-07T19:46:17.3685048Z 2025-05-07T19:46:17.3685051Z 2025-05-07T19:46:17.3685055Z 2025-05-07T19:46:17.3685058Z 2025-05-07T19:46:17.3685237Z  2025-05-07T19:46:17.3685503Z 2025-05-07T19:46:17.3685507Z 2025-05-07T19:46:17.3685619Z  2025-05-07T19:46:17.3685739Z 2025-05-07T19:46:17.3685743Z 2025-05-07T19:46:17.3685882Z  2025-05-07T19:46:17.3686003Z 2025-05-07T19:46:17.3686011Z 2025-05-07T19:46:17.3686015Z 2025-05-07T19:46:17.3686133Z  2025-05-07T19:46:17.3686287Z 2025-05-07T19:46:17.3686291Z 2025-05-07T19:46:17.3686294Z 2025-05-07T19:46:17.3686298Z 2025-05-07T19:46:17.3686417Z  2025-05-07T19:46:17.3686553Z 2025-05-07T19:46:17.3686556Z 2025-05-07T19:46:17.3686560Z 2025-05-07T19:46:17.3686563Z 2025-05-07T19:46:17.3686568Z 2025-05-07T19:46:17.3686724Z  2025-05-07T19:46:17.3686866Z 2025-05-07T19:46:17.3686870Z 2025-05-07T19:46:17.3686873Z 2025-05-07T19:46:17.3686877Z 2025-05-07T19:46:17.3686880Z 2025-05-07T19:46:17.3686883Z 2025-05-07T19:46:17.3687009Z  2025-05-07T19:46:17.3687180Z 2025-05-07T19:46:17.3687184Z 2025-05-07T19:46:17.3687188Z 2025-05-07T19:46:17.3687192Z 2025-05-07T19:46:17.3687195Z 2025-05-07T19:46:17.3687199Z 2025-05-07T19:46:17.3687202Z 2025-05-07T19:46:17.3687331Z  2025-05-07T19:46:17.3687517Z 2025-05-07T19:46:17.3687521Z 2025-05-07T19:46:17.3687529Z 2025-05-07T19:46:17.3687532Z 2025-05-07T19:46:17.3687536Z 2025-05-07T19:46:17.3687539Z 2025-05-07T19:46:17.3687543Z 2025-05-07T19:46:17.3687547Z 2025-05-07T19:46:17.3687682Z  2025-05-07T19:46:17.3687849Z 2025-05-07T19:46:17.3687853Z 2025-05-07T19:46:17.3687882Z 2025-05-07T19:46:17.3687885Z 2025-05-07T19:46:17.3687889Z 2025-05-07T19:46:17.3687892Z 2025-05-07T19:46:17.3687895Z 2025-05-07T19:46:17.3687903Z 2025-05-07T19:46:17.3687906Z 2025-05-07T19:46:17.3688042Z  2025-05-07T19:46:17.3688216Z 2025-05-07T19:46:17.3688220Z 2025-05-07T19:46:17.3688224Z 2025-05-07T19:46:17.3688227Z 2025-05-07T19:46:17.3688256Z 2025-05-07T19:46:17.3688259Z 2025-05-07T19:46:17.3688262Z 2025-05-07T19:46:17.3688266Z 2025-05-07T19:46:17.3688270Z 2025-05-07T19:46:17.3688273Z 2025-05-07T19:46:17.3688415Z  2025-05-07T19:46:17.3688596Z 2025-05-07T19:46:17.3688599Z 2025-05-07T19:46:17.3688603Z 2025-05-07T19:46:17.3688606Z 2025-05-07T19:46:17.3688639Z 2025-05-07T19:46:17.3688643Z 2025-05-07T19:46:17.3688646Z 2025-05-07T19:46:17.3688650Z 2025-05-07T19:46:17.3688653Z 2025-05-07T19:46:17.3688657Z 2025-05-07T19:46:17.3688661Z 2025-05-07T19:46:17.3688805Z  2025-05-07T19:46:17.3688994Z 2025-05-07T19:46:17.3688997Z 2025-05-07T19:46:17.3689001Z 2025-05-07T19:46:17.3689005Z 2025-05-07T19:46:17.3689032Z 2025-05-07T19:46:17.3689040Z 2025-05-07T19:46:17.3689043Z 2025-05-07T19:46:17.3689046Z 2025-05-07T19:46:17.3689050Z 2025-05-07T19:46:17.3689053Z 2025-05-07T19:46:17.3689058Z 2025-05-07T19:46:17.3689061Z 2025-05-07T19:46:17.3689211Z  2025-05-07T19:46:17.3689410Z 2025-05-07T19:46:17.3689413Z 2025-05-07T19:46:17.3689417Z 2025-05-07T19:46:17.3689448Z 2025-05-07T19:46:17.3689452Z 2025-05-07T19:46:17.3689455Z 2025-05-07T19:46:17.3689459Z 2025-05-07T19:46:17.3689462Z 2025-05-07T19:46:17.3689466Z 2025-05-07T19:46:17.3689469Z 2025-05-07T19:46:17.3689473Z 2025-05-07T19:46:17.3689541Z 2025-05-07T19:46:17.3689546Z 2025-05-07T19:46:17.3689697Z  2025-05-07T19:46:17.3689907Z 2025-05-07T19:46:17.3689938Z 2025-05-07T19:46:17.3689941Z 2025-05-07T19:46:17.3689945Z 2025-05-07T19:46:17.3689948Z 2025-05-07T19:46:17.3689951Z 2025-05-07T19:46:17.3689955Z 2025-05-07T19:46:17.3689959Z 2025-05-07T19:46:17.3689962Z 2025-05-07T19:46:17.3689966Z 2025-05-07T19:46:17.3690030Z 2025-05-07T19:46:17.3690033Z 2025-05-07T19:46:17.3690036Z 2025-05-07T19:46:17.3690040Z 2025-05-07T19:46:17.3690195Z  2025-05-07T19:46:17.3690435Z 2025-05-07T19:46:17.3690439Z 2025-05-07T19:46:17.3690444Z 2025-05-07T19:46:17.3690448Z 2025-05-07T19:46:17.3690451Z 2025-05-07T19:46:17.3690455Z 2025-05-07T19:46:17.3690458Z 2025-05-07T19:46:17.3690462Z 2025-05-07T19:46:17.3690465Z 2025-05-07T19:46:17.3690468Z 2025-05-07T19:46:17.3690472Z 2025-05-07T19:46:17.3690475Z 2025-05-07T19:46:17.3690479Z 2025-05-07T19:46:17.3690486Z 2025-05-07T19:46:17.3690489Z 2025-05-07T19:46:17.3690676Z  2025-05-07T19:46:17.3690893Z 2025-05-07T19:46:17.3690897Z 2025-05-07T19:46:17.3690901Z 2025-05-07T19:46:17.3690904Z 2025-05-07T19:46:17.3690907Z 2025-05-07T19:46:17.3690911Z 2025-05-07T19:46:17.3690914Z 2025-05-07T19:46:17.3690918Z 2025-05-07T19:46:17.3690921Z 2025-05-07T19:46:17.3690925Z 2025-05-07T19:46:17.3690932Z 2025-05-07T19:46:17.3690936Z 2025-05-07T19:46:17.3690940Z 2025-05-07T19:46:17.3690943Z 2025-05-07T19:46:17.3690947Z 2025-05-07T19:46:17.3690950Z 2025-05-07T19:46:17.3691142Z  2025-05-07T19:46:17.3691367Z 2025-05-07T19:46:17.3691370Z 2025-05-07T19:46:17.3691374Z 2025-05-07T19:46:17.3691377Z 2025-05-07T19:46:17.3691381Z 2025-05-07T19:46:17.3691384Z 2025-05-07T19:46:17.3691388Z 2025-05-07T19:46:17.3691392Z 2025-05-07T19:46:17.3691395Z 2025-05-07T19:46:17.3691398Z 2025-05-07T19:46:17.3691402Z 2025-05-07T19:46:17.3691434Z 2025-05-07T19:46:17.3691437Z 2025-05-07T19:46:17.3691440Z 2025-05-07T19:46:17.3691444Z 2025-05-07T19:46:17.3691447Z 2025-05-07T19:46:17.3691451Z 2025-05-07T19:46:17.3691622Z  2025-05-07T19:46:17.3691852Z 2025-05-07T19:46:17.3691856Z 2025-05-07T19:46:17.3691859Z 2025-05-07T19:46:17.3691863Z 2025-05-07T19:46:17.3691866Z 2025-05-07T19:46:17.3691897Z 2025-05-07T19:46:17.3691905Z 2025-05-07T19:46:17.3691908Z 2025-05-07T19:46:17.3691912Z 2025-05-07T19:46:17.3691915Z 2025-05-07T19:46:17.3691919Z 2025-05-07T19:46:17.3691922Z 2025-05-07T19:46:17.3691926Z 2025-05-07T19:46:17.3691929Z 2025-05-07T19:46:17.3691933Z 2025-05-07T19:46:17.3691936Z 2025-05-07T19:46:17.3691940Z 2025-05-07T19:46:17.3691943Z 2025-05-07T19:46:17.3692122Z  2025-05-07T19:46:17.3692382Z 2025-05-07T19:46:17.3692386Z 2025-05-07T19:46:17.3692496Z  2025-05-07T19:46:17.3692619Z 2025-05-07T19:46:17.3692623Z 2025-05-07T19:46:17.3692764Z  2025-05-07T19:46:17.3692888Z 2025-05-07T19:46:17.3692891Z 2025-05-07T19:46:17.3692895Z 2025-05-07T19:46:17.3693013Z  2025-05-07T19:46:17.3693166Z 2025-05-07T19:46:17.3693170Z 2025-05-07T19:46:17.3693175Z 2025-05-07T19:46:17.3693179Z 2025-05-07T19:46:17.3693298Z  2025-05-07T19:46:17.3693431Z 2025-05-07T19:46:17.3693435Z 2025-05-07T19:46:17.3693439Z 2025-05-07T19:46:17.3693446Z 2025-05-07T19:46:17.3693449Z 2025-05-07T19:46:17.3693600Z  2025-05-07T19:46:17.3693737Z 2025-05-07T19:46:17.3693740Z 2025-05-07T19:46:17.3693744Z 2025-05-07T19:46:17.3693747Z 2025-05-07T19:46:17.3693751Z 2025-05-07T19:46:17.3693755Z 2025-05-07T19:46:17.3693878Z  2025-05-07T19:46:17.3694048Z 2025-05-07T19:46:17.3694052Z 2025-05-07T19:46:17.3694056Z 2025-05-07T19:46:17.3694059Z 2025-05-07T19:46:17.3694063Z 2025-05-07T19:46:17.3694066Z 2025-05-07T19:46:17.3694070Z 2025-05-07T19:46:17.3694201Z  2025-05-07T19:46:17.3694442Z 2025-05-07T19:46:17.3694446Z 2025-05-07T19:46:17.3694450Z 2025-05-07T19:46:17.3694453Z 2025-05-07T19:46:17.3694457Z 2025-05-07T19:46:17.3694460Z 2025-05-07T19:46:17.3694464Z 2025-05-07T19:46:17.3694467Z 2025-05-07T19:46:17.3694602Z  2025-05-07T19:46:17.3694771Z 2025-05-07T19:46:17.3694775Z 2025-05-07T19:46:17.3694807Z 2025-05-07T19:46:17.3694810Z 2025-05-07T19:46:17.3694814Z 2025-05-07T19:46:17.3694895Z 2025-05-07T19:46:17.3694899Z 2025-05-07T19:46:17.3694902Z 2025-05-07T19:46:17.3694906Z 2025-05-07T19:46:17.3695044Z  2025-05-07T19:46:17.3695218Z 2025-05-07T19:46:17.3695222Z 2025-05-07T19:46:17.3695225Z 2025-05-07T19:46:17.3695253Z 2025-05-07T19:46:17.3695256Z 2025-05-07T19:46:17.3695260Z 2025-05-07T19:46:17.3695264Z 2025-05-07T19:46:17.3695324Z 2025-05-07T19:46:17.3695328Z 2025-05-07T19:46:17.3695331Z 2025-05-07T19:46:17.3695501Z  2025-05-07T19:46:17.3695688Z 2025-05-07T19:46:17.3695691Z 2025-05-07T19:46:17.3695699Z 2025-05-07T19:46:17.3695703Z 2025-05-07T19:46:17.3695706Z 2025-05-07T19:46:17.3695710Z 2025-05-07T19:46:17.3695714Z 2025-05-07T19:46:17.3695717Z 2025-05-07T19:46:17.3695720Z 2025-05-07T19:46:17.3695724Z 2025-05-07T19:46:17.3695727Z 2025-05-07T19:46:17.3695903Z  2025-05-07T19:46:17.3696098Z 2025-05-07T19:46:17.3696102Z 2025-05-07T19:46:17.3696105Z 2025-05-07T19:46:17.3696112Z 2025-05-07T19:46:17.3696116Z 2025-05-07T19:46:17.3696120Z 2025-05-07T19:46:17.3696123Z 2025-05-07T19:46:17.3696127Z 2025-05-07T19:46:17.3696130Z 2025-05-07T19:46:17.3696134Z 2025-05-07T19:46:17.3696137Z 2025-05-07T19:46:17.3696140Z 2025-05-07T19:46:17.3696321Z  2025-05-07T19:46:17.3696523Z 2025-05-07T19:46:17.3696527Z 2025-05-07T19:46:17.3696530Z 2025-05-07T19:46:17.3696534Z 2025-05-07T19:46:17.3696537Z 2025-05-07T19:46:17.3696540Z 2025-05-07T19:46:17.3696544Z 2025-05-07T19:46:17.3696547Z 2025-05-07T19:46:17.3696555Z 2025-05-07T19:46:17.3696559Z 2025-05-07T19:46:17.3696563Z 2025-05-07T19:46:17.3696566Z 2025-05-07T19:46:17.3696570Z 2025-05-07T19:46:17.3696757Z  2025-05-07T19:46:17.3696966Z 2025-05-07T19:46:17.3696970Z 2025-05-07T19:46:17.3696973Z 2025-05-07T19:46:17.3696977Z 2025-05-07T19:46:17.3696980Z 2025-05-07T19:46:17.3696984Z 2025-05-07T19:46:17.3696987Z 2025-05-07T19:46:17.3696995Z 2025-05-07T19:46:17.3696998Z 2025-05-07T19:46:17.3697001Z 2025-05-07T19:46:17.3697005Z 2025-05-07T19:46:17.3697037Z 2025-05-07T19:46:17.3697040Z 2025-05-07T19:46:17.3697044Z 2025-05-07T19:46:17.3697203Z  2025-05-07T19:46:17.3697420Z 2025-05-07T19:46:17.3697423Z 2025-05-07T19:46:17.3697427Z 2025-05-07T19:46:17.3697430Z 2025-05-07T19:46:17.3697434Z 2025-05-07T19:46:17.3697437Z 2025-05-07T19:46:17.3697441Z 2025-05-07T19:46:17.3697445Z 2025-05-07T19:46:17.3697479Z 2025-05-07T19:46:17.3697482Z 2025-05-07T19:46:17.3697485Z 2025-05-07T19:46:17.3697493Z 2025-05-07T19:46:17.3697496Z 2025-05-07T19:46:17.3697499Z 2025-05-07T19:46:17.3697503Z 2025-05-07T19:46:17.3697667Z  2025-05-07T19:46:17.3697885Z 2025-05-07T19:46:17.3697888Z 2025-05-07T19:46:17.3697892Z 2025-05-07T19:46:17.3697895Z 2025-05-07T19:46:17.3697924Z 2025-05-07T19:46:17.3697928Z 2025-05-07T19:46:17.3697931Z 2025-05-07T19:46:17.3697934Z 2025-05-07T19:46:17.3697942Z 2025-05-07T19:46:17.3697945Z 2025-05-07T19:46:17.3697948Z 2025-05-07T19:46:17.3697952Z 2025-05-07T19:46:17.3697955Z 2025-05-07T19:46:17.3697959Z 2025-05-07T19:46:17.3697962Z 2025-05-07T19:46:17.3697965Z 2025-05-07T19:46:17.3698131Z  2025-05-07T19:46:17.3698383Z 2025-05-07T19:46:17.3698387Z 2025-05-07T19:46:17.3698390Z 2025-05-07T19:46:17.3698394Z 2025-05-07T19:46:17.3698397Z 2025-05-07T19:46:17.3698400Z 2025-05-07T19:46:17.3698404Z 2025-05-07T19:46:17.3698407Z 2025-05-07T19:46:17.3698410Z 2025-05-07T19:46:17.3698491Z 2025-05-07T19:46:17.3698495Z 2025-05-07T19:46:17.3698498Z 2025-05-07T19:46:17.3698502Z 2025-05-07T19:46:17.3698505Z 2025-05-07T19:46:17.3698508Z 2025-05-07T19:46:17.3698512Z 2025-05-07T19:46:17.3698515Z 2025-05-07T19:46:17.3698715Z  2025-05-07T19:46:17.3698949Z 2025-05-07T19:46:17.3698952Z 2025-05-07T19:46:17.3698956Z 2025-05-07T19:46:17.3698960Z 2025-05-07T19:46:17.3699021Z 2025-05-07T19:46:17.3699024Z 2025-05-07T19:46:17.3699028Z 2025-05-07T19:46:17.3699031Z 2025-05-07T19:46:17.3699035Z 2025-05-07T19:46:17.3699038Z 2025-05-07T19:46:17.3699042Z 2025-05-07T19:46:17.3699045Z 2025-05-07T19:46:17.3699048Z 2025-05-07T19:46:17.3699051Z 2025-05-07T19:46:17.3699055Z 2025-05-07T19:46:17.3699058Z 2025-05-07T19:46:17.3699087Z 2025-05-07T19:46:17.3699091Z 2025-05-07T19:46:17.3699269Z  2025-05-07T19:46:17.3699508Z 2025-05-07T19:46:17.3699512Z 2025-05-07T19:46:17.3699659Z  2025-05-07T19:46:17.3699783Z 2025-05-07T19:46:17.3699787Z 2025-05-07T19:46:17.3699906Z  2025-05-07T19:46:17.3700030Z 2025-05-07T19:46:17.3700034Z 2025-05-07T19:46:17.3700038Z 2025-05-07T19:46:17.3700186Z  2025-05-07T19:46:17.3700309Z 2025-05-07T19:46:17.3700313Z 2025-05-07T19:46:17.3700316Z 2025-05-07T19:46:17.3700320Z 2025-05-07T19:46:17.3700447Z  2025-05-07T19:46:17.3700605Z 2025-05-07T19:46:17.3700613Z 2025-05-07T19:46:17.3700616Z 2025-05-07T19:46:17.3700620Z 2025-05-07T19:46:17.3700624Z 2025-05-07T19:46:17.3700747Z  2025-05-07T19:46:17.3700916Z 2025-05-07T19:46:17.3700920Z 2025-05-07T19:46:17.3700924Z 2025-05-07T19:46:17.3700927Z 2025-05-07T19:46:17.3700930Z 2025-05-07T19:46:17.3700934Z 2025-05-07T19:46:17.3701064Z  2025-05-07T19:46:17.3701206Z 2025-05-07T19:46:17.3701210Z 2025-05-07T19:46:17.3701213Z 2025-05-07T19:46:17.3701217Z 2025-05-07T19:46:17.3701221Z 2025-05-07T19:46:17.3701252Z 2025-05-07T19:46:17.3701259Z 2025-05-07T19:46:17.3701398Z  done 2025-05-07T19:46:17.5783651Z Preparing transaction: / - done 2025-05-07T19:46:18.2799913Z Verifying transaction: | / - \ | / - done 2025-05-07T19:46:18.5854184Z Executing transaction: | / - done 2025-05-07T19:46:20.3378272Z [INSTALL] Fixing file placements for CUDA 12.6.3+ ... 2025-05-07T19:46:20.3380379Z [INSTALL] Creating symlinks: libnvToolsExt.so 2025-05-07T19:46:20.3382625Z + ln -sf /github/home/miniconda/envs/build_binary/lib/libnvToolsExt.so.1 /github/home/miniconda/envs/build_binary/lib/libnvToolsExt.so 2025-05-07T19:46:20.3384458Z 2025-05-07T19:46:20.3395502Z 2025-05-07T19:46:20.3396871Z + ln -sf /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvToolsExt.so.1 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvToolsExt.so 2025-05-07T19:46:20.3397697Z 2025-05-07T19:46:20.3410621Z 2025-05-07T19:46:20.3411347Z [INSTALL] Copying nvtx3 headers ... 2025-05-07T19:46:20.3416655Z + cp -r /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExt.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtCuda.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtCudaRt.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtOpenCL.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtSync.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvtx3.hpp /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvtxDetail /github/home/miniconda/envs/build_binary/include/ 2025-05-07T19:46:20.3420731Z 2025-05-07T19:46:20.3633489Z 2025-05-07T19:46:20.3640106Z + cp -r /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExt.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtCuda.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtCudaRt.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtOpenCL.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtSync.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvtx3.hpp /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvtxDetail /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include/ 2025-05-07T19:46:20.3644293Z 2025-05-07T19:46:20.3657767Z 2025-05-07T19:46:20.3658999Z [INSTALL] Appending libcuda.so path to LD_LIBRARY_PATH ... 2025-05-07T19:46:20.4045557Z [ENV] Appending to LD_LIBRARY_PATH: /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs ... 2025-05-07T19:46:22.0494071Z + conda env config vars set -n build_binary LD_LIBRARY_PATH=/github/home/miniconda/envs/build_binary/lib:/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs 2025-05-07T19:46:22.0494896Z 2025-05-07T19:46:22.4629519Z 2025-05-07T19:46:22.4633014Z [INSTALL] Setting environment variable NVML_LIB_PATH ... 2025-05-07T19:46:22.4997348Z + conda env config vars set -n build_binary NVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:46:22.4997916Z 2025-05-07T19:46:22.9100569Z 2025-05-07T19:46:22.9101455Z [INSTALL] Setting environment variable CUDA_INCLUDE_DIRS ... 2025-05-07T19:46:22.9104474Z + conda env config vars set -n build_binary CUDA_INCLUDE_DIRS="/github/home/miniconda/envs/build_binary/include/:/github/home/miniconda/envs/build_binary/targets/x86_64-linux/include/" 2025-05-07T19:46:22.9105283Z 2025-05-07T19:46:23.3196104Z 2025-05-07T19:46:25.0378683Z [CHECK] cuda_runtime.h found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include/cuda_runtime.h 2025-05-07T19:46:26.7590526Z [CHECK] libcuda.so found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:46:28.4815962Z [CHECK] libnvToolsExt.so found in CONDA_PREFIX PATH (symbolic link): /github/home/miniconda/envs/build_binary/lib/libnvToolsExt.so 2025-05-07T19:46:28.4816879Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvToolsExt.so 2025-05-07T19:46:30.2356396Z [CHECK] libnvidia-ml.so found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libnvidia-ml.so 2025-05-07T19:46:31.8362099Z /github/home/miniconda/envs/build_binary/bin/nvcc 2025-05-07T19:46:31.8362929Z 2025-05-07T19:46:31.8941132Z [CHECK] Binary nvcc found in PATH 2025-05-07T19:46:35.1433737Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:46:35.1435918Z Target: x86_64-conda-linux-gnu 2025-05-07T19:46:35.1436246Z Thread model: posix 2025-05-07T19:46:35.1436602Z InstalledDir: /github/home/miniconda/envs/build_binary/bin 2025-05-07T19:46:35.1437272Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang.cfg 2025-05-07T19:46:35.1437761Z 2025-05-07T19:46:35.2002460Z [INSTALL] Resetting compiler symlinks to clang ... 2025-05-07T19:46:38.5738051Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:46:38.5738639Z 2025-05-07T19:46:38.5753028Z 2025-05-07T19:46:38.5773107Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:46:38.5774064Z 2025-05-07T19:46:38.5785866Z 2025-05-07T19:46:38.5805289Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang++ /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:46:38.5806896Z 2025-05-07T19:46:38.5822295Z 2025-05-07T19:46:38.5839540Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang++ /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:46:38.5841061Z 2025-05-07T19:46:38.5855131Z 2025-05-07T19:46:38.5856182Z + ls -la /github/home/miniconda/envs/build_binary/etc/conda/activate.d 2025-05-07T19:46:38.5857718Z 2025-05-07T19:46:38.5871963Z total 20 2025-05-07T19:46:38.5872306Z drwxr-xr-x. 2 root root 154 May 7 19:46 . 2025-05-07T19:46:38.5872713Z drwxr-xr-x. 5 root root 62 May 7 19:44 .. 2025-05-07T19:46:38.5873156Z -rw-r--r--. 2 root root 3778 Jun 10 2024 activate-binutils_linux-64.sh 2025-05-07T19:46:38.5873751Z -rw-r--r--. 2 root root 136 Mar 27 01:27 libglib_activate.sh 2025-05-07T19:46:38.5874334Z -rw-r--r--. 2 root root 873 Jun 5 2024 libxml2_activate.sh 2025-05-07T19:46:38.5874801Z -rw-r--r--. 2 root root 499 Nov 30 04:26 openjdk_activate.sh 2025-05-07T19:46:38.5875277Z -rw-r--r--. 2 root root 2932 Nov 20 20:32 ~cuda-nvcc_activate.sh 2025-05-07T19:46:38.5875564Z 2025-05-07T19:46:38.5875802Z [INSTALL] Removing the -ccbin=CXX hook from NVCC activation scripts ... 2025-05-07T19:46:38.5876524Z + sed -i /-ccbin=/d /github/home/miniconda/envs/build_binary/etc/conda/activate.d/*cuda-nvcc_activate.sh 2025-05-07T19:46:38.5888496Z 2025-05-07T19:46:38.5888514Z 2025-05-07T19:46:38.5889282Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:46:38.5890131Z 2025-05-07T19:46:40.2882782Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:46:40.2883760Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:46:40.2887092Z 2025-05-07T19:46:40.2887473Z [BUILD] Setting Clang as the NVCC host compiler: 2025-05-07T19:46:41.9557895Z [BUILD] Setting prepend flags for NVCC ... 2025-05-07T19:46:41.9559056Z + conda env config vars set -n build_binary NVCC_PREPEND_FLAGS="-allow-unsupported-compiler -Xcompiler -stdlib=libstdc++ -ccbin /github/home/miniconda/envs/build_binary/bin/c++" 2025-05-07T19:46:41.9559827Z 2025-05-07T19:46:42.3740818Z 2025-05-07T19:46:42.3741610Z + conda run -n build_binary printenv NVCC_PREPEND_FLAGS 2025-05-07T19:46:42.3741967Z 2025-05-07T19:46:43.9817156Z -allow-unsupported-compiler -Xcompiler -stdlib=libstdc++ -ccbin /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:46:43.9818932Z 2025-05-07T19:46:44.0598511Z 2025-05-07T19:46:44.0598942Z [INFO] Printing out all preprocessor defines in nvcc ... 2025-05-07T19:46:44.0599533Z + conda run -n build_binary nvcc --compiler-options -dM -E -x cu - < /dev/null 2025-05-07T19:46:44.0599915Z 2025-05-07T19:46:45.7604276Z #define ADJ_ESTERROR 0x0008 2025-05-07T19:46:45.7605258Z #define ADJ_FREQUENCY 0x0002 2025-05-07T19:46:45.7606078Z #define ADJ_MAXERROR 0x0004 2025-05-07T19:46:45.7606858Z #define ADJ_MICRO 0x1000 2025-05-07T19:46:45.7607400Z #define ADJ_NANO 0x2000 2025-05-07T19:46:45.7607688Z #define ADJ_OFFSET 0x0001 2025-05-07T19:46:45.7607979Z #define ADJ_OFFSET_SINGLESHOT 0x8001 2025-05-07T19:46:45.7608357Z #define ADJ_OFFSET_SS_READ 0xa001 2025-05-07T19:46:45.7608675Z #define ADJ_STATUS 0x0010 2025-05-07T19:46:45.7608976Z #define ADJ_TAI 0x0080 2025-05-07T19:46:45.7609232Z #define ADJ_TICK 0x4000 2025-05-07T19:46:45.7609525Z #define ADJ_TIMECONST 0x0020 2025-05-07T19:46:45.7609838Z #define AIO_PRIO_DELTA_MAX 20 2025-05-07T19:46:45.7610164Z #define BC_BASE_MAX _POSIX2_BC_BASE_MAX 2025-05-07T19:46:45.7610501Z #define BC_DIM_MAX _POSIX2_BC_DIM_MAX 2025-05-07T19:46:45.7610863Z #define BC_SCALE_MAX _POSIX2_BC_SCALE_MAX 2025-05-07T19:46:45.7611228Z #define BC_STRING_MAX _POSIX2_BC_STRING_MAX 2025-05-07T19:46:45.7611562Z #define BIG_ENDIAN __BIG_ENDIAN 2025-05-07T19:46:45.7611875Z #define BUFSIZ _IO_BUFSIZ 2025-05-07T19:46:45.7612148Z #define BYTE_ORDER __BYTE_ORDER 2025-05-07T19:46:45.7612455Z #define CHARCLASS_NAME_MAX 2048 2025-05-07T19:46:45.7613176Z #define CHAR_BIT __CHAR_BIT__ 2025-05-07T19:46:45.7613543Z #define CHAR_MAX __SCHAR_MAX__ 2025-05-07T19:46:45.7613823Z #define CHAR_MIN SCHAR_MIN 2025-05-07T19:46:45.7614282Z #define CLOCKS_PER_SEC 1000000l 2025-05-07T19:46:45.7614571Z #define CLOCK_BOOTTIME 7 2025-05-07T19:46:45.7614874Z #define CLOCK_BOOTTIME_ALARM 9 2025-05-07T19:46:45.7615180Z #define CLOCK_MONOTONIC 1 2025-05-07T19:46:45.7615459Z #define CLOCK_MONOTONIC_COARSE 6 2025-05-07T19:46:45.7616189Z #define CLOCK_MONOTONIC_RAW 4 2025-05-07T19:46:45.7616494Z #define CLOCK_PROCESS_CPUTIME_ID 2 2025-05-07T19:46:45.7616824Z #define CLOCK_REALTIME 0 2025-05-07T19:46:45.7617105Z #define CLOCK_REALTIME_ALARM 8 2025-05-07T19:46:45.7617436Z #define CLOCK_REALTIME_COARSE 5 2025-05-07T19:46:45.7617734Z #define CLOCK_TAI 11 2025-05-07T19:46:45.7618027Z #define CLOCK_THREAD_CPUTIME_ID 3 2025-05-07T19:46:45.7618368Z #define COLL_WEIGHTS_MAX 255 2025-05-07T19:46:45.7618651Z #define CUDARTAPI 2025-05-07T19:46:45.7618941Z #define CUDARTAPI_CDECL 2025-05-07T19:46:45.7619206Z #define CUDART_CB 2025-05-07T19:46:45.7619486Z #define CUDART_DEVICE __device__ 2025-05-07T19:46:45.7619779Z #define CUDART_VERSION 12060 2025-05-07T19:46:45.7620097Z #define CUDA_DOUBLE_MATH_FUNCTIONS 1 2025-05-07T19:46:45.7620418Z #define CUDA_IPC_HANDLE_SIZE 64 2025-05-07T19:46:45.7620755Z #define CU_UUID_HAS_BEEN_DEFINED 2025-05-07T19:46:45.7621061Z #define DELAYTIMER_MAX 2147483647 2025-05-07T19:46:45.7621387Z #define DOMAIN 1 2025-05-07T19:46:45.7621659Z #define EOF (-1) 2025-05-07T19:46:45.7621906Z #define EXIT_FAILURE 1 2025-05-07T19:46:45.7622194Z #define EXIT_SUCCESS 0 2025-05-07T19:46:45.7622494Z #define EXPR_NEST_MAX _POSIX2_EXPR_NEST_MAX 2025-05-07T19:46:45.7622933Z #define FD_CLR(fd,fdsetp) __FD_CLR (fd, fdsetp) 2025-05-07T19:46:45.7623326Z #define FD_ISSET(fd,fdsetp) __FD_ISSET (fd, fdsetp) 2025-05-07T19:46:45.7623751Z #define FD_SET(fd,fdsetp) __FD_SET (fd, fdsetp) 2025-05-07T19:46:45.7624108Z #define FD_SETSIZE __FD_SETSIZE 2025-05-07T19:46:45.7624460Z #define FD_ZERO(fdsetp) __FD_ZERO (fdsetp) 2025-05-07T19:46:45.7624807Z #define FILENAME_MAX 4096 2025-05-07T19:46:45.7625106Z #define FOPEN_MAX 16 2025-05-07T19:46:45.7625436Z #define FP_ILOGB0 (-2147483647 - 1) 2025-05-07T19:46:45.7625769Z #define FP_ILOGBNAN (-2147483647 - 1) 2025-05-07T19:46:45.7626119Z #define FP_INFINITE 1 2025-05-07T19:46:45.7626384Z #define FP_NAN 0 2025-05-07T19:46:45.7626656Z #define FP_NORMAL 4 2025-05-07T19:46:45.7626916Z #define FP_SUBNORMAL 3 2025-05-07T19:46:45.7627204Z #define FP_ZERO 2 2025-05-07T19:46:45.7627592Z #define HOST_NAME_MAX 64 2025-05-07T19:46:45.7627884Z #define HUGE 3.40282347e+38F 2025-05-07T19:46:45.7628175Z #define HUGE_VAL (__builtin_huge_val()) 2025-05-07T19:46:45.7628737Z #define HUGE_VALF (__builtin_huge_valf()) 2025-05-07T19:46:45.7629104Z #define HUGE_VALL (__builtin_huge_vall()) 2025-05-07T19:46:45.7629488Z #define INFINITY (__builtin_inff()) 2025-05-07T19:46:45.7629821Z #define INT_MAX __INT_MAX__ 2025-05-07T19:46:45.7630127Z #define INT_MIN (-__INT_MAX__ -1) 2025-05-07T19:46:45.7630440Z #define IOV_MAX 1024 2025-05-07T19:46:45.7630707Z #define LINE_MAX _POSIX2_LINE_MAX 2025-05-07T19:46:45.7631044Z #define LITTLE_ENDIAN __LITTLE_ENDIAN 2025-05-07T19:46:45.7631358Z #define LLONG_MAX __LONG_LONG_MAX__ 2025-05-07T19:46:45.7631705Z #define LLONG_MIN (-__LONG_LONG_MAX__-1LL) 2025-05-07T19:46:45.7632034Z #define LOGIN_NAME_MAX 256 2025-05-07T19:46:45.7632332Z #define LONG_BIT 64 2025-05-07T19:46:45.7632627Z #define LONG_LONG_MAX __LONG_LONG_MAX__ 2025-05-07T19:46:45.7632970Z #define LONG_LONG_MIN (-__LONG_LONG_MAX__-1LL) 2025-05-07T19:46:45.7633335Z #define LONG_MAX __LONG_MAX__ 2025-05-07T19:46:45.7633634Z #define LONG_MIN (-__LONG_MAX__ -1L) 2025-05-07T19:46:45.7633958Z #define L_ctermid 9 2025-05-07T19:46:45.7634291Z #define L_cuserid 9 2025-05-07T19:46:45.7634567Z #define L_tmpnam 20 2025-05-07T19:46:45.7634822Z #define MATH_ERREXCEPT 2 2025-05-07T19:46:45.7635121Z #define MATH_ERRNO 1 2025-05-07T19:46:45.7635375Z #define MAX_CANON 255 2025-05-07T19:46:45.7635806Z #define MAX_INPUT 255 2025-05-07T19:46:45.7636123Z #define MB_CUR_MAX (__ctype_get_mb_cur_max ()) 2025-05-07T19:46:45.7636458Z #define MB_LEN_MAX 16 2025-05-07T19:46:45.7636756Z #define MOD_CLKA ADJ_OFFSET_SINGLESHOT 2025-05-07T19:46:45.7637078Z #define MOD_CLKB ADJ_TICK 2025-05-07T19:46:45.7637385Z #define MOD_ESTERROR ADJ_ESTERROR 2025-05-07T19:46:45.7637699Z #define MOD_FREQUENCY ADJ_FREQUENCY 2025-05-07T19:46:45.7638135Z #define MOD_MAXERROR ADJ_MAXERROR 2025-05-07T19:46:45.7638433Z #define MOD_MICRO ADJ_MICRO 2025-05-07T19:46:45.7638737Z #define MOD_NANO ADJ_NANO 2025-05-07T19:46:45.7639012Z #define MOD_OFFSET ADJ_OFFSET 2025-05-07T19:46:45.7639329Z #define MOD_STATUS ADJ_STATUS 2025-05-07T19:46:45.7639638Z #define MOD_TAI ADJ_TAI 2025-05-07T19:46:45.7639913Z #define MOD_TIMECONST ADJ_TIMECONST 2025-05-07T19:46:45.7640240Z #define MQ_PRIO_MAX 32768 2025-05-07T19:46:45.7640511Z #define M_1_PI 0.31830988618379067154 2025-05-07T19:46:45.7640870Z #define M_1_PIl 0.318309886183790671537767526745028724L 2025-05-07T19:46:45.7641225Z #define M_2_PI 0.63661977236758134308 2025-05-07T19:46:45.7641582Z #define M_2_PIl 0.636619772367581343075535053490057448L 2025-05-07T19:46:45.7641938Z #define M_2_SQRTPI 1.12837916709551257390 2025-05-07T19:46:45.7642322Z #define M_2_SQRTPIl 1.128379167095512573896158903121545172L 2025-05-07T19:46:45.7642707Z #define M_E 2.7182818284590452354 2025-05-07T19:46:45.7643023Z #define M_El 2.718281828459045235360287471352662498L 2025-05-07T19:46:45.7643384Z #define M_LN10 2.30258509299404568402 2025-05-07T19:46:45.7643720Z #define M_LN10l 2.302585092994045684017991454684364208L 2025-05-07T19:46:45.7644089Z #define M_LN2 0.69314718055994530942 2025-05-07T19:46:45.7644417Z #define M_LN2l 0.693147180559945309417232121458176568L 2025-05-07T19:46:45.7644785Z #define M_LOG10E 0.43429448190325182765 2025-05-07T19:46:45.7645131Z #define M_LOG10El 0.434294481903251827651128918916605082L 2025-05-07T19:46:45.7645512Z #define M_LOG2E 1.4426950408889634074 2025-05-07T19:46:45.7645878Z #define M_LOG2El 1.442695040888963407359924681001892137L 2025-05-07T19:46:45.7646336Z #define M_PI 3.14159265358979323846 2025-05-07T19:46:45.7646637Z #define M_PI_2 1.57079632679489661923 2025-05-07T19:46:45.7646948Z #define M_PI_2l 1.570796326794896619231321691639751442L 2025-05-07T19:46:45.7647292Z #define M_PI_4 0.78539816339744830962 2025-05-07T19:46:45.7647605Z #define M_PI_4l 0.785398163397448309615660845819875721L 2025-05-07T19:46:45.7647983Z #define M_PIl 3.141592653589793238462643383279502884L 2025-05-07T19:46:45.7648490Z #define M_SQRT1_2 0.70710678118654752440 2025-05-07T19:46:45.7648889Z #define M_SQRT1_2l 0.707106781186547524400844362104849039L 2025-05-07T19:46:45.7649273Z #define M_SQRT2 1.41421356237309504880 2025-05-07T19:46:45.7649608Z #define M_SQRT2l 1.414213562373095048801688724209698079L 2025-05-07T19:46:45.7650144Z #define NAME_MAX 255 2025-05-07T19:46:45.7650468Z #define NAN (__builtin_nanf ("")) 2025-05-07T19:46:45.7650790Z #define NFDBITS __NFDBITS 2025-05-07T19:46:45.7651064Z #define NGROUPS_MAX 65536 2025-05-07T19:46:45.7651364Z #define NL_ARGMAX _POSIX_ARG_MAX 2025-05-07T19:46:45.7651669Z #define NL_LANGMAX _POSIX2_LINE_MAX 2025-05-07T19:46:45.7651993Z #define NL_MSGMAX INT_MAX 2025-05-07T19:46:45.7652262Z #define NL_NMAX INT_MAX 2025-05-07T19:46:45.7652545Z #define NL_SETMAX INT_MAX 2025-05-07T19:46:45.7652836Z #define NL_TEXTMAX INT_MAX 2025-05-07T19:46:45.7653101Z #define NULL __null 2025-05-07T19:46:45.7653367Z #define NZERO 20 2025-05-07T19:46:45.7653605Z #define OVERFLOW 3 2025-05-07T19:46:45.7653872Z #define PATH_MAX 4096 2025-05-07T19:46:45.7654139Z #define PDP_ENDIAN __PDP_ENDIAN 2025-05-07T19:46:45.7654445Z #define PIPE_BUF 4096 2025-05-07T19:46:45.7654690Z #define PLOSS 6 2025-05-07T19:46:45.7655093Z #define PTHREAD_DESTRUCTOR_ITERATIONS _POSIX_THREAD_DESTRUCTOR_ITERATIONS 2025-05-07T19:46:45.7655553Z #define PTHREAD_KEYS_MAX 1024 2025-05-07T19:46:45.7655869Z #define PTHREAD_STACK_MIN 16384 2025-05-07T19:46:45.7656160Z #define P_tmpdir "/tmp" 2025-05-07T19:46:45.7656534Z #define RAND_MAX 2147483647 2025-05-07T19:46:45.7656839Z #define RE_DUP_MAX (0x7fff) 2025-05-07T19:46:45.7657105Z #define RTSIG_MAX 32 2025-05-07T19:46:45.7657412Z #define SCHAR_MAX __SCHAR_MAX__ 2025-05-07T19:46:45.7657714Z #define SCHAR_MIN (-__SCHAR_MAX__-1) 2025-05-07T19:46:45.7658044Z #define SEEK_CUR 1 2025-05-07T19:46:45.7658345Z #define SEEK_DATA 3 2025-05-07T19:46:45.7658605Z #define SEEK_END 2 2025-05-07T19:46:45.7658867Z #define SEEK_HOLE 4 2025-05-07T19:46:45.7659228Z #define SEEK_SET 0 2025-05-07T19:46:45.7659500Z #define SEM_VALUE_MAX (2147483647) 2025-05-07T19:46:45.7659806Z #define SHRT_MAX __SHRT_MAX__ 2025-05-07T19:46:45.7660133Z #define SHRT_MIN (-__SHRT_MAX__ -1) 2025-05-07T19:46:45.7660427Z #define SING 2 2025-05-07T19:46:45.7660698Z #define SSIZE_MAX LONG_MAX 2025-05-07T19:46:45.7660976Z #define STA_CLK 0x8000 2025-05-07T19:46:45.7661268Z #define STA_CLOCKERR 0x1000 2025-05-07T19:46:45.7661548Z #define STA_DEL 0x0020 2025-05-07T19:46:45.7661830Z #define STA_FLL 0x0008 2025-05-07T19:46:45.7662096Z #define STA_FREQHOLD 0x0080 2025-05-07T19:46:45.7662394Z #define STA_INS 0x0010 2025-05-07T19:46:45.7662672Z #define STA_MODE 0x4000 2025-05-07T19:46:45.7662925Z #define STA_NANO 0x2000 2025-05-07T19:46:45.7663434Z #define STA_PLL 0x0001 2025-05-07T19:46:45.7663685Z #define STA_PPSERROR 0x0800 2025-05-07T19:46:45.7663980Z #define STA_PPSFREQ 0x0002 2025-05-07T19:46:45.7664251Z #define STA_PPSJITTER 0x0200 2025-05-07T19:46:45.7664559Z #define STA_PPSSIGNAL 0x0100 2025-05-07T19:46:45.7664819Z #define STA_PPSTIME 0x0004 2025-05-07T19:46:45.7665096Z #define STA_PPSWANDER 0x0400 2025-05-07T19:46:45.7665639Z #define STA_RONLY (STA_PPSSIGNAL | STA_PPSJITTER | STA_PPSWANDER | STA_PPSERROR | STA_CLOCKERR | STA_NANO | STA_MODE | STA_CLK) 2025-05-07T19:46:45.7666226Z #define STA_UNSYNC 0x0040 2025-05-07T19:46:45.7666493Z #define TIMER_ABSTIME 1 2025-05-07T19:46:45.7666725Z #define TIME_UTC 1 2025-05-07T19:46:45.7666968Z #define TLOSS 5 2025-05-07T19:46:45.7667187Z #define TMP_MAX 238328 2025-05-07T19:46:45.7667450Z #define TTY_NAME_MAX 32 2025-05-07T19:46:45.7667700Z #define UCHAR_MAX (__SCHAR_MAX__*2 +1) 2025-05-07T19:46:45.7668027Z #define UINT_MAX (__INT_MAX__ *2U +1U) 2025-05-07T19:46:45.7668346Z #define ULLONG_MAX (__LONG_LONG_MAX__*2ULL+1ULL) 2025-05-07T19:46:45.7668728Z #define ULONG_LONG_MAX (__LONG_LONG_MAX__*2ULL+1ULL) 2025-05-07T19:46:45.7669093Z #define ULONG_MAX (__LONG_MAX__ *2UL+1UL) 2025-05-07T19:46:45.7669388Z #define UNDERFLOW 4 2025-05-07T19:46:45.7669656Z #define USHRT_MAX (__SHRT_MAX__ *2 +1) 2025-05-07T19:46:45.7669938Z #define WCONTINUED 8 2025-05-07T19:46:45.7670188Z #define WEXITED 4 2025-05-07T19:46:45.7670502Z #define WEXITSTATUS(status) __WEXITSTATUS (__WAIT_INT (status)) 2025-05-07T19:46:45.7670989Z #define WIFCONTINUED(status) __WIFCONTINUED (__WAIT_INT (status)) 2025-05-07T19:46:45.7671444Z #define WIFEXITED(status) __WIFEXITED (__WAIT_INT (status)) 2025-05-07T19:46:45.7671907Z #define WIFSIGNALED(status) __WIFSIGNALED (__WAIT_INT (status)) 2025-05-07T19:46:45.7672395Z #define WIFSTOPPED(status) __WIFSTOPPED (__WAIT_INT (status)) 2025-05-07T19:46:45.7672755Z #define WNOHANG 1 2025-05-07T19:46:45.7673007Z #define WNOWAIT 0x01000000 2025-05-07T19:46:45.7673263Z #define WORD_BIT 32 2025-05-07T19:46:45.7673507Z #define WSTOPPED 2 2025-05-07T19:46:45.7673798Z #define WSTOPSIG(status) __WSTOPSIG (__WAIT_INT (status)) 2025-05-07T19:46:45.7674311Z #define WTERMSIG(status) __WTERMSIG (__WAIT_INT (status)) 2025-05-07T19:46:45.7674855Z #define WUNTRACED 2 2025-05-07T19:46:45.7675135Z #define XATTR_LIST_MAX 65536 2025-05-07T19:46:45.7675425Z #define XATTR_NAME_MAX 255 2025-05-07T19:46:45.7675731Z #define XATTR_SIZE_MAX 65536 2025-05-07T19:46:45.7676049Z #define X_TLOSS 1.41484755040568800000e+16 2025-05-07T19:46:45.7676363Z #define _ACRTIMP 2025-05-07T19:46:45.7676625Z #define _ALLOCA_H 1 2025-05-07T19:46:45.7676866Z #define _ASSERT_H 1 2025-05-07T19:46:45.7677129Z #define _ATFILE_SOURCE 1 2025-05-07T19:46:45.7677397Z #define _BITS_BYTESWAP_H 1 2025-05-07T19:46:45.7677692Z #define _BITS_POSIX1_LIM_H 1 2025-05-07T19:46:45.7678061Z #define _BITS_POSIX2_LIM_H 1 2025-05-07T19:46:45.7678381Z #define _BITS_PTHREADTYPES_H 1 2025-05-07T19:46:45.7678667Z #define _BITS_TIMEX_H 1 2025-05-07T19:46:45.7678947Z #define _BITS_TIME_H 1 2025-05-07T19:46:45.7679212Z #define _BITS_TYPESIZES_H 1 2025-05-07T19:46:45.7679515Z #define _BITS_TYPES_H 1 2025-05-07T19:46:45.7679796Z #define _BSD_SOURCE 1 2025-05-07T19:46:45.7680052Z #define _CONCEPT_CHECK_H 1 2025-05-07T19:46:45.7680423Z #define _CPP_TYPE_TRAITS_H 1 2025-05-07T19:46:45.7680696Z #define _CRTIMP 2025-05-07T19:46:45.7680954Z #define _CTYPE_H 1 2025-05-07T19:46:45.7681190Z #define _ENDIAN_H 1 2025-05-07T19:46:45.7681461Z #define _EXCEPTION_DEFINES_H 1 2025-05-07T19:46:45.7681745Z #define _EXT_NUMERIC_TRAITS 1 2025-05-07T19:46:45.7682050Z #define _EXT_TYPE_TRAITS 1 2025-05-07T19:46:45.7682315Z #define _FEATURES_H 1 2025-05-07T19:46:45.7682594Z #define _FUNCTEXCEPT_H 1 2025-05-07T19:46:45.7682854Z #define _GCC_LIMITS_H_ 2025-05-07T19:46:45.7683183Z #define _GLIBCXX11_DEPRECATED _GLIBCXX_DEPRECATED 2025-05-07T19:46:45.7683693Z #define _GLIBCXX11_DEPRECATED_SUGGEST(ALT) _GLIBCXX_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:46:45.7684160Z #define _GLIBCXX11_USE_C99_COMPLEX 1 2025-05-07T19:46:45.7684504Z #define _GLIBCXX11_USE_C99_MATH 1 2025-05-07T19:46:45.7684806Z #define _GLIBCXX11_USE_C99_STDIO 1 2025-05-07T19:46:45.7685140Z #define _GLIBCXX11_USE_C99_STDLIB 1 2025-05-07T19:46:45.7685449Z #define _GLIBCXX11_USE_C99_WCHAR 1 2025-05-07T19:46:45.7685793Z #define _GLIBCXX14_CONSTEXPR constexpr 2025-05-07T19:46:45.7686123Z #define _GLIBCXX17_CONSTEXPR constexpr 2025-05-07T19:46:45.7686498Z #define _GLIBCXX17_DEPRECATED [[__deprecated__]] 2025-05-07T19:46:45.7687107Z #define _GLIBCXX17_DEPRECATED_SUGGEST(ALT) _GLIBCXX_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:46:45.7687559Z #define _GLIBCXX17_INLINE inline 2025-05-07T19:46:45.7687853Z #define _GLIBCXX20_CONSTEXPR 2025-05-07T19:46:45.7688123Z #define _GLIBCXX20_DEPRECATED(MSG) 2025-05-07T19:46:45.7688437Z #define _GLIBCXX20_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:46:45.7688752Z #define _GLIBCXX98_USE_C99_COMPLEX 1 2025-05-07T19:46:45.7689050Z #define _GLIBCXX98_USE_C99_MATH 1 2025-05-07T19:46:45.7689327Z #define _GLIBCXX98_USE_C99_STDIO 1 2025-05-07T19:46:45.7689616Z #define _GLIBCXX98_USE_C99_STDLIB 1 2025-05-07T19:46:45.7690016Z #define _GLIBCXX98_USE_C99_WCHAR 1 2025-05-07T19:46:45.7690361Z #define _GLIBCXX_ABI_TAG_CXX11 __attribute ((__abi_tag__ ("cxx11"))) 2025-05-07T19:46:45.7690756Z #define _GLIBCXX_ATOMIC_BUILTINS 1 2025-05-07T19:46:45.7691045Z #define _GLIBCXX_BEGIN_EXTERN_C extern "C" { 2025-05-07T19:46:45.7691365Z #define _GLIBCXX_BEGIN_NAMESPACE_ALGO 2025-05-07T19:46:45.7691665Z #define _GLIBCXX_BEGIN_NAMESPACE_CONTAINER 2025-05-07T19:46:45.7692036Z #define _GLIBCXX_BEGIN_NAMESPACE_CXX11 namespace __cxx11 { 2025-05-07T19:46:45.7692391Z #define _GLIBCXX_BEGIN_NAMESPACE_LDBL 2025-05-07T19:46:45.7692812Z #define _GLIBCXX_BEGIN_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_BEGIN_NAMESPACE_CXX11 2025-05-07T19:46:45.7693291Z #define _GLIBCXX_BEGIN_NAMESPACE_VERSION 2025-05-07T19:46:45.7693600Z #define _GLIBCXX_BITS_SPECFUN_H 1 2025-05-07T19:46:45.7693885Z #define _GLIBCXX_BITS_STD_ABS_H 2025-05-07T19:46:45.7694147Z #define _GLIBCXX_CMATH 1 2025-05-07T19:46:45.7694434Z #define _GLIBCXX_CONST __attribute__ ((__const__)) 2025-05-07T19:46:45.7694758Z #define _GLIBCXX_CONSTEXPR constexpr 2025-05-07T19:46:45.7695056Z #define _GLIBCXX_CPU_DEFINES 1 2025-05-07T19:46:45.7695313Z #define _GLIBCXX_CSTDLIB 1 2025-05-07T19:46:45.7695745Z #define _GLIBCXX_CXX_CONFIG_H 1 2025-05-07T19:46:45.7696069Z #define _GLIBCXX_DARWIN_USE_64_BIT_INODE 1 2025-05-07T19:46:45.7696404Z #define _GLIBCXX_DEBUG_ASSERT(_Condition) 2025-05-07T19:46:45.7696728Z #define _GLIBCXX_DEBUG_ASSERTIONS_H 1 2025-05-07T19:46:45.7697026Z #define _GLIBCXX_DEBUG_MACRO_SWITCH_H 1 2025-05-07T19:46:45.7697345Z #define _GLIBCXX_DEBUG_ONLY(_Statement) 2025-05-07T19:46:45.7697684Z #define _GLIBCXX_DEBUG_PEDASSERT(_Condition) 2025-05-07T19:46:45.7698095Z #define _GLIBCXX_DEFAULT_ABI_TAG _GLIBCXX_ABI_TAG_CXX11 2025-05-07T19:46:45.7698604Z #define _GLIBCXX_DEPRECATED __attribute__ ((__deprecated__)) 2025-05-07T19:46:45.7699213Z #define _GLIBCXX_DEPRECATED_SUGGEST(ALT) __attribute__ ((__deprecated__ ("use '" ALT "' instead"))) 2025-05-07T19:46:45.7699770Z #define _GLIBCXX_DOUBLE_IS_IEEE_BINARY64 1 2025-05-07T19:46:45.7700087Z #define _GLIBCXX_END_EXTERN_C } 2025-05-07T19:46:45.7700400Z #define _GLIBCXX_END_NAMESPACE_ALGO 2025-05-07T19:46:45.7700783Z #define _GLIBCXX_END_NAMESPACE_CONTAINER 2025-05-07T19:46:45.7701131Z #define _GLIBCXX_END_NAMESPACE_CXX11 } 2025-05-07T19:46:45.7701446Z #define _GLIBCXX_END_NAMESPACE_LDBL 2025-05-07T19:46:45.7701890Z #define _GLIBCXX_END_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_END_NAMESPACE_CXX11 2025-05-07T19:46:45.7702330Z #define _GLIBCXX_END_NAMESPACE_VERSION 2025-05-07T19:46:45.7702675Z #define _GLIBCXX_EXTERN_TEMPLATE 1 2025-05-07T19:46:45.7703002Z #define _GLIBCXX_FAST_MATH 0 2025-05-07T19:46:45.7703300Z #define _GLIBCXX_FLOAT_IS_IEEE_BINARY32 1 2025-05-07T19:46:45.7703738Z #define _GLIBCXX_FORWARD(_Tp,__val) std::forward<_Tp>(__val) 2025-05-07T19:46:45.7704133Z #define _GLIBCXX_FULLY_DYNAMIC_STRING 0 2025-05-07T19:46:45.7704478Z #define _GLIBCXX_FWDREF(_Tp) _Tp&& 2025-05-07T19:46:45.7704778Z #define _GLIBCXX_HAS_GTHREADS 1 2025-05-07T19:46:45.7705874Z #define _GLIBCXX_HAS_NESTED_TYPE(_NTYPE) template> struct __has_##_NTYPE : false_type { }; template struct __has_##_NTYPE<_Tp, __void_t> : true_type { }; 2025-05-07T19:46:45.7706975Z #define _GLIBCXX_HAVE_ACOSF 1 2025-05-07T19:46:45.7707266Z #define _GLIBCXX_HAVE_ACOSL 1 2025-05-07T19:46:45.7707694Z #define _GLIBCXX_HAVE_ALIGNED_ALLOC 1 2025-05-07T19:46:45.7708010Z #define _GLIBCXX_HAVE_ARPA_INET_H 1 2025-05-07T19:46:45.7708339Z #define _GLIBCXX_HAVE_ASINF 1 2025-05-07T19:46:45.7708625Z #define _GLIBCXX_HAVE_ASINL 1 2025-05-07T19:46:45.7708948Z #define _GLIBCXX_HAVE_AS_SYMVER_DIRECTIVE 1 2025-05-07T19:46:45.7709301Z #define _GLIBCXX_HAVE_ATAN2F 1 2025-05-07T19:46:45.7709591Z #define _GLIBCXX_HAVE_ATAN2L 1 2025-05-07T19:46:45.7709893Z #define _GLIBCXX_HAVE_ATANF 1 2025-05-07T19:46:45.7710170Z #define _GLIBCXX_HAVE_ATANL 1 2025-05-07T19:46:45.7710487Z #define _GLIBCXX_HAVE_ATOMIC_LOCK_POLICY 1 2025-05-07T19:46:45.7710833Z #define _GLIBCXX_HAVE_ATTRIBUTE_VISIBILITY 1 2025-05-07T19:46:45.7711189Z #define _GLIBCXX_HAVE_AT_QUICK_EXIT 1 2025-05-07T19:46:45.7711526Z #define _GLIBCXX_HAVE_BUILTIN_HAS_UNIQ_OBJ_REP 1 2025-05-07T19:46:45.7711915Z #define _GLIBCXX_HAVE_BUILTIN_IS_AGGREGATE 1 2025-05-07T19:46:45.7712282Z #define _GLIBCXX_HAVE_BUILTIN_IS_CONSTANT_EVALUATED 1 2025-05-07T19:46:45.7712665Z #define _GLIBCXX_HAVE_BUILTIN_IS_SAME 1 2025-05-07T19:46:45.7713012Z #define _GLIBCXX_HAVE_BUILTIN_LAUNDER 1 2025-05-07T19:46:45.7713321Z #define _GLIBCXX_HAVE_CEILF 1 2025-05-07T19:46:45.7713620Z #define _GLIBCXX_HAVE_CEILL 1 2025-05-07T19:46:45.7713906Z #define _GLIBCXX_HAVE_COMPLEX_H 1 2025-05-07T19:46:45.7714319Z #define _GLIBCXX_HAVE_COSF 1 2025-05-07T19:46:45.7714776Z #define _GLIBCXX_HAVE_COSHF 1 2025-05-07T19:46:45.7715085Z #define _GLIBCXX_HAVE_COSHL 1 2025-05-07T19:46:45.7715428Z #define _GLIBCXX_HAVE_COSL 1 2025-05-07T19:46:45.7715744Z #define _GLIBCXX_HAVE_DIRENT_H 1 2025-05-07T19:46:45.7716043Z #define _GLIBCXX_HAVE_DLFCN_H 1 2025-05-07T19:46:45.7716360Z #define _GLIBCXX_HAVE_ENDIAN_H 1 2025-05-07T19:46:45.7716722Z #define _GLIBCXX_HAVE_EXCEPTION_PTR_SINCE_GCC46 1 2025-05-07T19:46:45.7717084Z #define _GLIBCXX_HAVE_EXECINFO_H 1 2025-05-07T19:46:45.7717413Z #define _GLIBCXX_HAVE_EXPF 1 2025-05-07T19:46:45.7717695Z #define _GLIBCXX_HAVE_EXPL 1 2025-05-07T19:46:45.7717994Z #define _GLIBCXX_HAVE_FABSF 1 2025-05-07T19:46:45.7718278Z #define _GLIBCXX_HAVE_FABSL 1 2025-05-07T19:46:45.7718585Z #define _GLIBCXX_HAVE_FCNTL_H 1 2025-05-07T19:46:45.7718873Z #define _GLIBCXX_HAVE_FENV_H 1 2025-05-07T19:46:45.7719182Z #define _GLIBCXX_HAVE_FINITE 1 2025-05-07T19:46:45.7719467Z #define _GLIBCXX_HAVE_FINITEF 1 2025-05-07T19:46:45.7719777Z #define _GLIBCXX_HAVE_FINITEL 1 2025-05-07T19:46:45.7720173Z #define _GLIBCXX_HAVE_FLOAT_H 1 2025-05-07T19:46:45.7720466Z #define _GLIBCXX_HAVE_FLOORF 1 2025-05-07T19:46:45.7720775Z #define _GLIBCXX_HAVE_FLOORL 1 2025-05-07T19:46:45.7721067Z #define _GLIBCXX_HAVE_FMODF 1 2025-05-07T19:46:45.7721382Z #define _GLIBCXX_HAVE_FMODL 1 2025-05-07T19:46:45.7721672Z #define _GLIBCXX_HAVE_FREXPF 1 2025-05-07T19:46:45.7721988Z #define _GLIBCXX_HAVE_FREXPL 1 2025-05-07T19:46:45.7722281Z #define _GLIBCXX_HAVE_GETIPINFO 1 2025-05-07T19:46:45.7722666Z #define _GLIBCXX_HAVE_GETS 1 2025-05-07T19:46:45.7722950Z #define _GLIBCXX_HAVE_HYPOT 1 2025-05-07T19:46:45.7723261Z #define _GLIBCXX_HAVE_HYPOTF 1 2025-05-07T19:46:45.7723572Z #define _GLIBCXX_HAVE_HYPOTL 1 2025-05-07T19:46:45.7723864Z #define _GLIBCXX_HAVE_ICONV 1 2025-05-07T19:46:45.7724175Z #define _GLIBCXX_HAVE_INT64_T 1 2025-05-07T19:46:45.7724467Z #define _GLIBCXX_HAVE_INT64_T_LONG 1 2025-05-07T19:46:45.7724815Z #define _GLIBCXX_HAVE_INTTYPES_H 1 2025-05-07T19:46:45.7725120Z #define _GLIBCXX_HAVE_ISINF 1 2025-05-07T19:46:45.7725594Z #define _GLIBCXX_HAVE_ISINFF 1 2025-05-07T19:46:45.7725887Z #define _GLIBCXX_HAVE_ISINFL 1 2025-05-07T19:46:45.7726196Z #define _GLIBCXX_HAVE_ISNAN 1 2025-05-07T19:46:45.7726484Z #define _GLIBCXX_HAVE_ISNANF 1 2025-05-07T19:46:45.7726974Z #define _GLIBCXX_HAVE_ISNANL 1 2025-05-07T19:46:45.7727291Z #define _GLIBCXX_HAVE_ISWBLANK 1 2025-05-07T19:46:45.7727585Z #define _GLIBCXX_HAVE_LC_MESSAGES 1 2025-05-07T19:46:45.7727915Z #define _GLIBCXX_HAVE_LDEXPF 1 2025-05-07T19:46:45.7728206Z #define _GLIBCXX_HAVE_LDEXPL 1 2025-05-07T19:46:45.7728665Z #define _GLIBCXX_HAVE_LIMIT_AS 1 2025-05-07T19:46:45.7729132Z #define _GLIBCXX_HAVE_LIMIT_DATA 1 2025-05-07T19:46:45.7729536Z #define _GLIBCXX_HAVE_LIMIT_FSIZE 1 2025-05-07T19:46:45.7729850Z #define _GLIBCXX_HAVE_LIMIT_RSS 1 2025-05-07T19:46:45.7730182Z #define _GLIBCXX_HAVE_LIMIT_VMEM 0 2025-05-07T19:46:45.7730510Z #define _GLIBCXX_HAVE_LINK 1 2025-05-07T19:46:45.7730799Z #define _GLIBCXX_HAVE_LINUX_FUTEX 1 2025-05-07T19:46:45.7731147Z #define _GLIBCXX_HAVE_LINUX_RANDOM_H 1 2025-05-07T19:46:45.7731479Z #define _GLIBCXX_HAVE_LINUX_TYPES_H 1 2025-05-07T19:46:45.7731823Z #define _GLIBCXX_HAVE_LOCALE_H 1 2025-05-07T19:46:45.7732125Z #define _GLIBCXX_HAVE_LOG10F 1 2025-05-07T19:46:45.7732450Z #define _GLIBCXX_HAVE_LOG10L 1 2025-05-07T19:46:45.7732739Z #define _GLIBCXX_HAVE_LOGF 1 2025-05-07T19:46:45.7733047Z #define _GLIBCXX_HAVE_LOGL 1 2025-05-07T19:46:45.7733331Z #define _GLIBCXX_HAVE_MBSTATE_T 1 2025-05-07T19:46:45.7733662Z #define _GLIBCXX_HAVE_MEMALIGN 1 2025-05-07T19:46:45.7733975Z #define _GLIBCXX_HAVE_MEMORY_H 1 2025-05-07T19:46:45.7734264Z #define _GLIBCXX_HAVE_MODF 1 2025-05-07T19:46:45.7734564Z #define _GLIBCXX_HAVE_MODFF 1 2025-05-07T19:46:45.7734847Z #define _GLIBCXX_HAVE_MODFL 1 2025-05-07T19:46:45.7735151Z #define _GLIBCXX_HAVE_NETDB_H 1 2025-05-07T19:46:45.7735444Z #define _GLIBCXX_HAVE_NETINET_IN_H 1 2025-05-07T19:46:45.7735781Z #define _GLIBCXX_HAVE_NETINET_TCP_H 1 2025-05-07T19:46:45.7736101Z #define _GLIBCXX_HAVE_OBSOLETE_ISINF 1 2025-05-07T19:46:45.7736453Z #define _GLIBCXX_HAVE_OBSOLETE_ISNAN 1 2025-05-07T19:46:45.7736768Z #define _GLIBCXX_HAVE_POLL 1 2025-05-07T19:46:45.7737073Z #define _GLIBCXX_HAVE_POLL_H 1 2025-05-07T19:46:45.7737392Z #define _GLIBCXX_HAVE_POSIX_MEMALIGN 1 2025-05-07T19:46:45.7737717Z #define _GLIBCXX_HAVE_POSIX_SEMAPHORE 1 2025-05-07T19:46:45.7738065Z #define _GLIBCXX_HAVE_POWF 1 2025-05-07T19:46:45.7738347Z #define _GLIBCXX_HAVE_POWL 1 2025-05-07T19:46:45.7738658Z #define _GLIBCXX_HAVE_QUICK_EXIT 1 2025-05-07T19:46:45.7738962Z #define _GLIBCXX_HAVE_READLINK 1 2025-05-07T19:46:45.7739274Z #define _GLIBCXX_HAVE_SETENV 1 2025-05-07T19:46:45.7739557Z #define _GLIBCXX_HAVE_SINCOS 1 2025-05-07T19:46:45.7739868Z #define _GLIBCXX_HAVE_SINCOSF 1 2025-05-07T19:46:45.7740160Z #define _GLIBCXX_HAVE_SINCOSL 1 2025-05-07T19:46:45.7740465Z #define _GLIBCXX_HAVE_SINF 1 2025-05-07T19:46:45.7740876Z #define _GLIBCXX_HAVE_SINHF 1 2025-05-07T19:46:45.7741159Z #define _GLIBCXX_HAVE_SINHL 1 2025-05-07T19:46:45.7741459Z #define _GLIBCXX_HAVE_SINL 1 2025-05-07T19:46:45.7741879Z #define _GLIBCXX_HAVE_SOCKATMARK 1 2025-05-07T19:46:45.7742209Z #define _GLIBCXX_HAVE_SQRTF 1 2025-05-07T19:46:45.7742491Z #define _GLIBCXX_HAVE_SQRTL 1 2025-05-07T19:46:45.7742804Z #define _GLIBCXX_HAVE_STDALIGN_H 1 2025-05-07T19:46:45.7743211Z #define _GLIBCXX_HAVE_STDBOOL_H 1 2025-05-07T19:46:45.7743520Z #define _GLIBCXX_HAVE_STDINT_H 1 2025-05-07T19:46:45.7743800Z #define _GLIBCXX_HAVE_STDLIB_H 1 2025-05-07T19:46:45.7744190Z #define _GLIBCXX_HAVE_STRERROR_L 1 2025-05-07T19:46:45.7744502Z #define _GLIBCXX_HAVE_STRERROR_R 1 2025-05-07T19:46:45.7744789Z #define _GLIBCXX_HAVE_STRINGS_H 1 2025-05-07T19:46:45.7745096Z #define _GLIBCXX_HAVE_STRING_H 1 2025-05-07T19:46:45.7745374Z #define _GLIBCXX_HAVE_STRTOF 1 2025-05-07T19:46:45.7745668Z #define _GLIBCXX_HAVE_STRTOLD 1 2025-05-07T19:46:45.7745963Z #define _GLIBCXX_HAVE_STRUCT_DIRENT_D_TYPE 1 2025-05-07T19:46:45.7746302Z #define _GLIBCXX_HAVE_STRXFRM_L 1 2025-05-07T19:46:45.7746583Z #define _GLIBCXX_HAVE_SYMLINK 1 2025-05-07T19:46:45.7746954Z #define _GLIBCXX_HAVE_SYMVER_SYMBOL_RENAMING_RUNTIME_SUPPORT 1 2025-05-07T19:46:45.7747332Z #define _GLIBCXX_HAVE_SYS_IOCTL_H 1 2025-05-07T19:46:45.7747642Z #define _GLIBCXX_HAVE_SYS_IPC_H 1 2025-05-07T19:46:45.7747945Z #define _GLIBCXX_HAVE_SYS_PARAM_H 1 2025-05-07T19:46:45.7748232Z #define _GLIBCXX_HAVE_SYS_RESOURCE_H 1 2025-05-07T19:46:45.7748545Z #define _GLIBCXX_HAVE_SYS_SEM_H 1 2025-05-07T19:46:45.7748828Z #define _GLIBCXX_HAVE_SYS_SOCKET_H 1 2025-05-07T19:46:45.7749144Z #define _GLIBCXX_HAVE_SYS_STATVFS_H 1 2025-05-07T19:46:45.7749436Z #define _GLIBCXX_HAVE_SYS_STAT_H 1 2025-05-07T19:46:45.7749739Z #define _GLIBCXX_HAVE_SYS_SYSINFO_H 1 2025-05-07T19:46:45.7750026Z #define _GLIBCXX_HAVE_SYS_TIME_H 1 2025-05-07T19:46:45.7750332Z #define _GLIBCXX_HAVE_SYS_TYPES_H 1 2025-05-07T19:46:45.7750635Z #define _GLIBCXX_HAVE_SYS_UIO_H 1 2025-05-07T19:46:45.7750912Z #define _GLIBCXX_HAVE_S_ISREG 1 2025-05-07T19:46:45.7751198Z #define _GLIBCXX_HAVE_TANF 1 2025-05-07T19:46:45.7751464Z #define _GLIBCXX_HAVE_TANHF 1 2025-05-07T19:46:45.7751754Z #define _GLIBCXX_HAVE_TANHL 1 2025-05-07T19:46:45.7752018Z #define _GLIBCXX_HAVE_TANL 1 2025-05-07T19:46:45.7752310Z #define _GLIBCXX_HAVE_TGMATH_H 1 2025-05-07T19:46:45.7752585Z #define _GLIBCXX_HAVE_TLS 1 2025-05-07T19:46:45.7752872Z #define _GLIBCXX_HAVE_TRUNCATE 1 2025-05-07T19:46:45.7753155Z #define _GLIBCXX_HAVE_UNISTD_H 1 2025-05-07T19:46:45.7753459Z #define _GLIBCXX_HAVE_USELOCALE 1 2025-05-07T19:46:45.7753769Z #define _GLIBCXX_HAVE_UTIME_H 1 2025-05-07T19:46:45.7754110Z #define _GLIBCXX_HAVE_VFWSCANF 1 2025-05-07T19:46:45.7754411Z #define _GLIBCXX_HAVE_VSWSCANF 1 2025-05-07T19:46:45.7754871Z #define _GLIBCXX_HAVE_VWSCANF 1 2025-05-07T19:46:45.7755188Z #define _GLIBCXX_HAVE_WCHAR_H 1 2025-05-07T19:46:45.7755477Z #define _GLIBCXX_HAVE_WCSTOF 1 2025-05-07T19:46:45.7755790Z #define _GLIBCXX_HAVE_WCTYPE_H 1 2025-05-07T19:46:45.7756093Z #define _GLIBCXX_HAVE_WRITEV 1 2025-05-07T19:46:45.7756409Z #define _GLIBCXX_HAVE_XLOCALE_H 1 2025-05-07T19:46:45.7756714Z #define _GLIBCXX_HOSTED 1 2025-05-07T19:46:45.7757007Z #define _GLIBCXX_ICONV_CONST 2025-05-07T19:46:45.7757325Z #define _GLIBCXX_INLINE_VERSION 0 2025-05-07T19:46:45.7757632Z #define _GLIBCXX_LT_OBJDIR ".libs/" 2025-05-07T19:46:45.7758179Z #define _GLIBCXX_MAKE_MOVE_IF_NOEXCEPT_ITERATOR(_Iter) std::__make_move_if_noexcept_iterator(_Iter) 2025-05-07T19:46:45.7758834Z #define _GLIBCXX_MAKE_MOVE_ITERATOR(_Iter) std::make_move_iterator(_Iter) 2025-05-07T19:46:45.7759307Z #define _GLIBCXX_MANGLE_SIZE_T m 2025-05-07T19:46:45.7759604Z #define _GLIBCXX_MATH_H 1 2025-05-07T19:46:45.7759933Z #define _GLIBCXX_MOVE(__val) std::move(__val) 2025-05-07T19:46:45.7760332Z #define _GLIBCXX_MOVE3(_Tp,_Up,_Vp) std::move(_Tp, _Up, _Vp) 2025-05-07T19:46:45.7760887Z #define _GLIBCXX_MOVE_BACKWARD3(_Tp,_Up,_Vp) std::move_backward(_Tp, _Up, _Vp) 2025-05-07T19:46:45.7761396Z #define _GLIBCXX_NAMESPACE_CXX11 __cxx11:: 2025-05-07T19:46:45.7761735Z #define _GLIBCXX_NAMESPACE_LDBL 2025-05-07T19:46:45.7762236Z #define _GLIBCXX_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_NAMESPACE_CXX11 2025-05-07T19:46:45.7762823Z #define _GLIBCXX_NATIVE_THREAD_ID (__gthread_active_p() ? __gthread_self() : (__gthread_t)1) 2025-05-07T19:46:45.7763374Z #define _GLIBCXX_NODISCARD [[__nodiscard__]] 2025-05-07T19:46:45.7763726Z #define _GLIBCXX_NOEXCEPT noexcept 2025-05-07T19:46:45.7764115Z #define _GLIBCXX_NOEXCEPT_IF(...) noexcept(__VA_ARGS__) 2025-05-07T19:46:45.7764501Z #define _GLIBCXX_NOEXCEPT_PARM , bool _NE 2025-05-07T19:46:45.7764942Z #define _GLIBCXX_NOEXCEPT_QUAL noexcept (_NE) 2025-05-07T19:46:45.7765370Z #define _GLIBCXX_NORETURN __attribute__ ((__noreturn__)) 2025-05-07T19:46:45.7765780Z #define _GLIBCXX_NOTHROW _GLIBCXX_USE_NOEXCEPT 2025-05-07T19:46:45.7766259Z #define _GLIBCXX_NO_OBSOLETE_ISINF_ISNAN_DYNAMIC __GLIBC_PREREQ(2,23) 2025-05-07T19:46:45.7766802Z #define _GLIBCXX_NUMERIC_LIMITS 1 2025-05-07T19:46:45.7767226Z #define _GLIBCXX_OS_DEFINES 1 2025-05-07T19:46:45.7767507Z #define _GLIBCXX_PACKAGE_BUGREPORT "" 2025-05-07T19:46:45.7767859Z #define _GLIBCXX_PACKAGE_NAME "package-unused" 2025-05-07T19:46:45.7768265Z #define _GLIBCXX_PACKAGE_STRING "package-unused version-unused" 2025-05-07T19:46:45.7768691Z #define _GLIBCXX_PACKAGE_TARNAME "libstdc++" 2025-05-07T19:46:45.7769032Z #define _GLIBCXX_PACKAGE_URL "" 2025-05-07T19:46:45.7769363Z #define _GLIBCXX_PACKAGE__GLIBCXX_VERSION "version-unused" 2025-05-07T19:46:45.7769758Z #define _GLIBCXX_PREDEFINED_OPS_H 1 2025-05-07T19:46:45.7770056Z #define _GLIBCXX_PSEUDO_VISIBILITY(V) 2025-05-07T19:46:45.7770402Z #define _GLIBCXX_PURE __attribute__ ((__pure__)) 2025-05-07T19:46:45.7770723Z #define _GLIBCXX_RELEASE 11 2025-05-07T19:46:45.7771011Z #define _GLIBCXX_RES_LIMITS 1 2025-05-07T19:46:45.7771280Z #define _GLIBCXX_STDC_HEADERS 1 2025-05-07T19:46:45.7771581Z #define _GLIBCXX_STDIO_EOF -1 2025-05-07T19:46:45.7771874Z #define _GLIBCXX_STDIO_SEEK_CUR 1 2025-05-07T19:46:45.7772164Z #define _GLIBCXX_STDIO_SEEK_END 2 2025-05-07T19:46:45.7772471Z #define _GLIBCXX_STDLIB_H 1 2025-05-07T19:46:45.7772729Z #define _GLIBCXX_STD_A std 2025-05-07T19:46:45.7773011Z #define _GLIBCXX_STD_C std 2025-05-07T19:46:45.7773265Z #define _GLIBCXX_SYMVER 1 2025-05-07T19:46:45.7773537Z #define _GLIBCXX_SYMVER_GNU 1 2025-05-07T19:46:45.7773842Z #define _GLIBCXX_SYNCHRONIZATION_HAPPENS_AFTER(A) 2025-05-07T19:46:45.7774237Z #define _GLIBCXX_SYNCHRONIZATION_HAPPENS_BEFORE(A) 2025-05-07T19:46:45.7774576Z #define _GLIBCXX_THROW(_EXC) 2025-05-07T19:46:45.7774902Z #define _GLIBCXX_THROW_OR_ABORT(_EXC) (throw (_EXC)) 2025-05-07T19:46:45.7775275Z #define _GLIBCXX_TR1_BESSEL_FUNCTION_TCC 1 2025-05-07T19:46:45.7775592Z #define _GLIBCXX_TR1_BETA_FUNCTION_TCC 1 2025-05-07T19:46:45.7775928Z #define _GLIBCXX_TR1_ELL_INTEGRAL_TCC 1 2025-05-07T19:46:45.7776235Z #define _GLIBCXX_TR1_EXP_INTEGRAL_TCC 1 2025-05-07T19:46:45.7776554Z #define _GLIBCXX_TR1_GAMMA_TCC 1 2025-05-07T19:46:45.7776847Z #define _GLIBCXX_TR1_HYPERGEOMETRIC_TCC 1 2025-05-07T19:46:45.7777194Z #define _GLIBCXX_TR1_LEGENDRE_FUNCTION_TCC 1 2025-05-07T19:46:45.7777532Z #define _GLIBCXX_TR1_MODIFIED_BESSEL_FUNC_TCC 1 2025-05-07T19:46:45.7777886Z #define _GLIBCXX_TR1_POLY_HERMITE_TCC 1 2025-05-07T19:46:45.7778214Z #define _GLIBCXX_TR1_POLY_LAGUERRE_TCC 1 2025-05-07T19:46:45.7778519Z #define _GLIBCXX_TR1_RIEMANN_ZETA_TCC 1 2025-05-07T19:46:45.7778861Z #define _GLIBCXX_TR1_SPECIAL_FUNCTION_UTIL_H 1 2025-05-07T19:46:45.7779175Z #define _GLIBCXX_TXN_SAFE 2025-05-07T19:46:45.7779457Z #define _GLIBCXX_TXN_SAFE_DYN 2025-05-07T19:46:45.7779725Z #define _GLIBCXX_TYPE_TRAITS 1 2025-05-07T19:46:45.7780029Z #define _GLIBCXX_USE_ALLOCATOR_NEW 1 2025-05-07T19:46:45.7780314Z #define _GLIBCXX_USE_C99 1 2025-05-07T19:46:45.7780653Z #define _GLIBCXX_USE_C99_COMPLEX _GLIBCXX11_USE_C99_COMPLEX 2025-05-07T19:46:45.7781043Z #define _GLIBCXX_USE_C99_COMPLEX_TR1 1 2025-05-07T19:46:45.7781341Z #define _GLIBCXX_USE_C99_CTYPE_TR1 1 2025-05-07T19:46:45.7781653Z #define _GLIBCXX_USE_C99_FENV_TR1 1 2025-05-07T19:46:45.7781940Z #define _GLIBCXX_USE_C99_INTTYPES_TR1 1 2025-05-07T19:46:45.7782281Z #define _GLIBCXX_USE_C99_INTTYPES_WCHAR_T_TR1 1 2025-05-07T19:46:45.7782717Z #define _GLIBCXX_USE_C99_MATH _GLIBCXX11_USE_C99_MATH 2025-05-07T19:46:45.7783087Z #define _GLIBCXX_USE_C99_MATH_TR1 1 2025-05-07T19:46:45.7783375Z #define _GLIBCXX_USE_C99_STDINT_TR1 1 2025-05-07T19:46:45.7783733Z #define _GLIBCXX_USE_C99_STDIO _GLIBCXX11_USE_C99_STDIO 2025-05-07T19:46:45.7784124Z #define _GLIBCXX_USE_C99_STDLIB _GLIBCXX11_USE_C99_STDLIB 2025-05-07T19:46:45.7784540Z #define _GLIBCXX_USE_C99_WCHAR _GLIBCXX11_USE_C99_WCHAR 2025-05-07T19:46:45.7785187Z #define _GLIBCXX_USE_CLOCK_MONOTONIC 1 2025-05-07T19:46:45.7785505Z #define _GLIBCXX_USE_CLOCK_REALTIME 1 2025-05-07T19:46:45.7785846Z #define _GLIBCXX_USE_CONSTEXPR constexpr 2025-05-07T19:46:45.7786165Z #define _GLIBCXX_USE_CXX11_ABI 1 2025-05-07T19:46:45.7786483Z #define _GLIBCXX_USE_DECIMAL_FLOAT 1 2025-05-07T19:46:45.7786786Z #define _GLIBCXX_USE_DEPRECATED 1 2025-05-07T19:46:45.7787101Z #define _GLIBCXX_USE_DEV_RANDOM 1 2025-05-07T19:46:45.7787393Z #define _GLIBCXX_USE_DUAL_ABI 1 2025-05-07T19:46:45.7787703Z #define _GLIBCXX_USE_FCHMOD 1 2025-05-07T19:46:45.7788013Z #define _GLIBCXX_USE_FCHMODAT 1 2025-05-07T19:46:45.7788300Z #define _GLIBCXX_USE_FLOAT128 1 2025-05-07T19:46:45.7788622Z #define _GLIBCXX_USE_GETTIMEOFDAY 1 2025-05-07T19:46:45.7788923Z #define _GLIBCXX_USE_GET_NPROCS 1 2025-05-07T19:46:45.7789243Z #define _GLIBCXX_USE_INT128 1 2025-05-07T19:46:45.7789517Z #define _GLIBCXX_USE_LFS 1 2025-05-07T19:46:45.7789808Z #define _GLIBCXX_USE_LONG_LONG 1 2025-05-07T19:46:45.7790095Z #define _GLIBCXX_USE_LSTAT 1 2025-05-07T19:46:45.7790393Z #define _GLIBCXX_USE_NANOSLEEP 1 2025-05-07T19:46:45.7790690Z #define _GLIBCXX_USE_NOEXCEPT noexcept 2025-05-07T19:46:45.7791032Z #define _GLIBCXX_USE_PTHREAD_RWLOCK_T 1 2025-05-07T19:46:45.7791372Z #define _GLIBCXX_USE_RANDOM_TR1 1 2025-05-07T19:46:45.7791669Z #define _GLIBCXX_USE_REALPATH 1 2025-05-07T19:46:45.7791991Z #define _GLIBCXX_USE_SCHED_YIELD 1 2025-05-07T19:46:45.7792309Z #define _GLIBCXX_USE_SC_NPROCESSORS_ONLN 1 2025-05-07T19:46:45.7792658Z #define _GLIBCXX_USE_SENDFILE 1 2025-05-07T19:46:45.7792959Z #define _GLIBCXX_USE_STD_SPEC_FUNCS 1 2025-05-07T19:46:45.7793292Z #define _GLIBCXX_USE_ST_MTIM 1 2025-05-07T19:46:45.7793649Z #define _GLIBCXX_USE_TBB_PAR_BACKEND __has_include() 2025-05-07T19:46:45.7794138Z #define _GLIBCXX_USE_TMPNAM 1 2025-05-07T19:46:45.7794614Z #define _GLIBCXX_USE_UTIME 1 2025-05-07T19:46:45.7794902Z #define _GLIBCXX_USE_UTIMENSAT 1 2025-05-07T19:46:45.7795231Z #define _GLIBCXX_USE_WCHAR_T 1 2025-05-07T19:46:45.7795573Z #define _GLIBCXX_USE_WEAK_REF __GXX_WEAK__ 2025-05-07T19:46:45.7795931Z #define _GLIBCXX_UTILITY 1 2025-05-07T19:46:45.7796209Z #define _GLIBCXX_VERBOSE 1 2025-05-07T19:46:45.7796614Z #define _GLIBCXX_VISIBILITY(V) __attribute__ ((__visibility__ (#V))) 2025-05-07T19:46:45.7797050Z #define _GLIBCXX_WEAK_DEFINITION 2025-05-07T19:46:45.7797388Z #define _GLIBCXX_X86_RDRAND 1 2025-05-07T19:46:45.7797674Z #define _GLIBCXX_X86_RDSEED 1 2025-05-07T19:46:45.7797982Z #define _GNU_SOURCE 1 2025-05-07T19:46:45.7798274Z #define _GTHREAD_USE_MUTEX_TIMEDLOCK 1 2025-05-07T19:46:45.7798587Z #define _G_BUFSIZ 8192 2025-05-07T19:46:45.7798874Z #define _G_HAVE_MMAP 1 2025-05-07T19:46:45.7799132Z #define _G_HAVE_MREMAP 1 2025-05-07T19:46:45.7799482Z #define _G_HAVE_ST_BLKSIZE defined (_STATBUF_ST_BLKSIZE) 2025-05-07T19:46:45.7799875Z #define _G_IO_IO_FILE_VERSION 0x20001 2025-05-07T19:46:45.7800215Z #define _G_config_h 1 2025-05-07T19:46:45.7800472Z #define _G_va_list __gnuc_va_list 2025-05-07T19:46:45.7800798Z #define _INITIALIZER_LIST 2025-05-07T19:46:45.7801062Z #define _IOFBF 0 2025-05-07T19:46:45.7801311Z #define _IOLBF 1 2025-05-07T19:46:45.7801535Z #define _IONBF 2 2025-05-07T19:46:45.7801793Z #define _IOS_APPEND 8 2025-05-07T19:46:45.7802072Z #define _IOS_ATEND 4 2025-05-07T19:46:45.7802308Z #define _IOS_BIN 128 2025-05-07T19:46:45.7802575Z #define _IOS_INPUT 1 2025-05-07T19:46:45.7802818Z #define _IOS_NOCREATE 32 2025-05-07T19:46:45.7803111Z #define _IOS_NOREPLACE 64 2025-05-07T19:46:45.7803378Z #define _IOS_OUTPUT 2 2025-05-07T19:46:45.7803733Z #define _IOS_TRUNC 16 2025-05-07T19:46:45.7803994Z #define _IO_BAD_SEEN 0x4000 2025-05-07T19:46:45.7804360Z #define _IO_BE(expr,res) __builtin_expect ((expr), res) 2025-05-07T19:46:45.7804734Z #define _IO_BOOLALPHA 0200000 2025-05-07T19:46:45.7805046Z #define _IO_BUFSIZ _G_BUFSIZ 2025-05-07T19:46:45.7805363Z #define _IO_CURRENTLY_PUTTING 0x800 2025-05-07T19:46:45.7805662Z #define _IO_DEC 020 2025-05-07T19:46:45.7805938Z #define _IO_DELETE_DONT_CLOSE 0x40 2025-05-07T19:46:45.7806310Z #define _IO_DONT_CLOSE 0100000 2025-05-07T19:46:45.7806720Z #define _IO_EOF_SEEN 0x10 2025-05-07T19:46:45.7806965Z #define _IO_ERR_SEEN 0x20 2025-05-07T19:46:45.7807235Z #define _IO_FIXED 010000 2025-05-07T19:46:45.7807489Z #define _IO_FLAGS2_MMAP 1 2025-05-07T19:46:45.7807941Z #define _IO_FLAGS2_NOTCANCEL 2 2025-05-07T19:46:45.7808226Z #define _IO_FLAGS2_USER_WBUF 8 2025-05-07T19:46:45.7808556Z #define _IO_HAVE_ST_BLKSIZE _G_HAVE_ST_BLKSIZE 2025-05-07T19:46:45.7808899Z #define _IO_HEX 0100 2025-05-07T19:46:45.7809149Z #define _IO_INTERNAL 010 2025-05-07T19:46:45.7809433Z #define _IO_IN_BACKUP 0x100 2025-05-07T19:46:45.7809706Z #define _IO_IS_APPENDING 0x1000 2025-05-07T19:46:45.7810008Z #define _IO_IS_FILEBUF 0x2000 2025-05-07T19:46:45.7810273Z #define _IO_LEFT 02 2025-05-07T19:46:45.7810529Z #define _IO_LINE_BUF 0x200 2025-05-07T19:46:45.7810787Z #define _IO_LINKED 0x80 2025-05-07T19:46:45.7811064Z #define _IO_MAGIC 0xFBAD0000 2025-05-07T19:46:45.7811512Z #define _IO_MAGIC_MASK 0xFFFF0000 2025-05-07T19:46:45.7811821Z #define _IO_NO_READS 4 2025-05-07T19:46:45.7812074Z #define _IO_NO_WRITES 8 2025-05-07T19:46:45.7812344Z #define _IO_OCT 040 2025-05-07T19:46:45.7812757Z #define _IO_PENDING_OUTPUT_COUNT(_fp) ((_fp)->_IO_write_ptr - (_fp)->_IO_write_base) 2025-05-07T19:46:45.7813217Z #define _IO_RIGHT 04 2025-05-07T19:46:45.7813495Z #define _IO_SCIENTIFIC 04000 2025-05-07T19:46:45.7813774Z #define _IO_SHOWBASE 0200 2025-05-07T19:46:45.7814069Z #define _IO_SHOWPOINT 0400 2025-05-07T19:46:45.7814339Z #define _IO_SHOWPOS 02000 2025-05-07T19:46:45.7814627Z #define _IO_SKIPWS 01 2025-05-07T19:46:45.7814879Z #define _IO_STDIO 040000 2025-05-07T19:46:45.7815153Z #define _IO_STDIO_H 2025-05-07T19:46:45.7815408Z #define _IO_TIED_PUT_GET 0x400 2025-05-07T19:46:45.7815711Z #define _IO_UNBUFFERED 2 2025-05-07T19:46:45.7816003Z #define _IO_UNIFIED_JUMPTABLES 1 2025-05-07T19:46:45.7816295Z #define _IO_UNITBUF 020000 2025-05-07T19:46:45.7816587Z #define _IO_UPPERCASE 01000 2025-05-07T19:46:45.7816863Z #define _IO_USER_BUF 1 2025-05-07T19:46:45.7817140Z #define _IO_USER_LOCK 0x8000 2025-05-07T19:46:45.7817432Z #define _IO_cleanup_region_end(_Doit) 2025-05-07T19:46:45.7817793Z #define _IO_cleanup_region_start(_fct,_fp) 2025-05-07T19:46:45.7818216Z #define _IO_feof_unlocked(__fp) (((__fp)->_flags & _IO_EOF_SEEN) != 0) 2025-05-07T19:46:45.7818756Z #define _IO_ferror_unlocked(__fp) (((__fp)->_flags & _IO_ERR_SEEN) != 0) 2025-05-07T19:46:45.7819207Z #define _IO_file_flags _flags 2025-05-07T19:46:45.7819502Z #define _IO_flockfile(_fp) 2025-05-07T19:46:45.7819815Z #define _IO_fpos64_t _G_fpos64_t 2025-05-07T19:46:45.7820113Z #define _IO_fpos_t _G_fpos_t 2025-05-07T19:46:45.7820428Z #define _IO_ftrylockfile(_fp) 2025-05-07T19:46:45.7820717Z #define _IO_funlockfile(_fp) 2025-05-07T19:46:45.7821306Z #define _IO_getc_unlocked(_fp) (_IO_BE ((_fp)->_IO_read_ptr >= (_fp)->_IO_read_end, 0) ? __uflow (_fp) : *(unsigned char *) (_fp)->_IO_read_ptr++) 2025-05-07T19:46:45.7821886Z #define _IO_iconv_t _G_iconv_t 2025-05-07T19:46:45.7822192Z #define _IO_off64_t __off64_t 2025-05-07T19:46:45.7822472Z #define _IO_off_t __off_t 2025-05-07T19:46:45.7822797Z #define _IO_peekc(_fp) _IO_peekc_unlocked (_fp) 2025-05-07T19:46:45.7823563Z #define _IO_peekc_unlocked(_fp) (_IO_BE ((_fp)->_IO_read_ptr >= (_fp)->_IO_read_end, 0) && __underflow (_fp) == EOF ? EOF : *(unsigned char *) (_fp)->_IO_read_ptr) 2025-05-07T19:46:45.7824178Z #define _IO_pid_t __pid_t 2025-05-07T19:46:45.7824929Z #define _IO_putc_unlocked(_ch,_fp) (_IO_BE ((_fp)->_IO_write_ptr >= (_fp)->_IO_write_end, 0) ? __overflow (_fp, (unsigned char) (_ch)) : (unsigned char) (*(_fp)->_IO_write_ptr++ = (_ch))) 2025-05-07T19:46:45.7825771Z #define _IO_size_t size_t 2025-05-07T19:46:45.7826067Z #define _IO_ssize_t __ssize_t 2025-05-07T19:46:45.7826410Z #define _IO_stderr ((_IO_FILE*)(&_IO_2_1_stderr_)) 2025-05-07T19:46:45.7826787Z #define _IO_stdin ((_IO_FILE*)(&_IO_2_1_stdin_)) 2025-05-07T19:46:45.7827183Z #define _IO_stdout ((_IO_FILE*)(&_IO_2_1_stdout_)) 2025-05-07T19:46:45.7827590Z #define _IO_uid_t __uid_t 2025-05-07T19:46:45.7827886Z #define _IO_va_list __gnuc_va_list 2025-05-07T19:46:45.7828180Z #define _IO_wint_t wint_t 2025-05-07T19:46:45.7828606Z #define _ISOC11_SOURCE 1 2025-05-07T19:46:45.7829040Z #define _ISOC95_SOURCE 1 2025-05-07T19:46:45.7829444Z #define _ISOC99_SOURCE 1 2025-05-07T19:46:45.7829804Z #define _ISbit(bit) ((bit) < 8 ? ((1 << (bit)) << 8) : ((1 << (bit)) >> 8)) 2025-05-07T19:46:45.7830238Z #define _LARGEFILE64_SOURCE 1 2025-05-07T19:46:45.7830558Z #define _LARGEFILE_SOURCE 1 2025-05-07T19:46:45.7830845Z #define _LIBC_LIMITS_H_ 1 2025-05-07T19:46:45.7831139Z #define _LINUX_LIMITS_H 2025-05-07T19:46:45.7831401Z #define _LP64 1 2025-05-07T19:46:45.7831662Z #define _MATH_H 1 2025-05-07T19:46:45.7831908Z #define _MATH_H_MATHDEF 1 2025-05-07T19:46:45.7832201Z #define _MOVE_H 1 2025-05-07T19:46:45.7832444Z #define _Mfloat_ float 2025-05-07T19:46:45.7832743Z #define _Mlong_double_ long double 2025-05-07T19:46:45.7833039Z #define _NEW 2025-05-07T19:46:45.7833315Z #define _OLD_STDIO_MAGIC 0xFABC0000 2025-05-07T19:46:45.7833655Z #define _POSIX2_BC_BASE_MAX 99 2025-05-07T19:46:45.7833946Z #define _POSIX2_BC_DIM_MAX 2048 2025-05-07T19:46:45.7834346Z #define _POSIX2_BC_SCALE_MAX 99 2025-05-07T19:46:45.7834636Z #define _POSIX2_BC_STRING_MAX 1000 2025-05-07T19:46:45.7834970Z #define _POSIX2_CHARCLASS_NAME_MAX 14 2025-05-07T19:46:45.7835341Z #define _POSIX2_COLL_WEIGHTS_MAX 2 2025-05-07T19:46:45.7835666Z #define _POSIX2_EXPR_NEST_MAX 32 2025-05-07T19:46:45.7835958Z #define _POSIX2_LINE_MAX 2048 2025-05-07T19:46:45.7836266Z #define _POSIX2_RE_DUP_MAX 255 2025-05-07T19:46:45.7836553Z #define _POSIX_AIO_LISTIO_MAX 2 2025-05-07T19:46:45.7836857Z #define _POSIX_AIO_MAX 1 2025-05-07T19:46:45.7837140Z #define _POSIX_ARG_MAX 4096 2025-05-07T19:46:45.7837413Z #define _POSIX_CHILD_MAX 25 2025-05-07T19:46:45.7837723Z #define _POSIX_CLOCKRES_MIN 20000000 2025-05-07T19:46:45.7838036Z #define _POSIX_C_SOURCE 200809L 2025-05-07T19:46:45.7860585Z #define _POSIX_DELAYTIMER_MAX 32 2025-05-07T19:46:45.7861013Z #define _POSIX_FD_SETSIZE _POSIX_OPEN_MAX 2025-05-07T19:46:45.7861368Z #define _POSIX_HIWAT _POSIX_PIPE_BUF 2025-05-07T19:46:45.7861673Z #define _POSIX_HOST_NAME_MAX 255 2025-05-07T19:46:45.7861979Z #define _POSIX_LINK_MAX 8 2025-05-07T19:46:45.7862249Z #define _POSIX_LOGIN_NAME_MAX 9 2025-05-07T19:46:45.7862543Z #define _POSIX_MAX_CANON 255 2025-05-07T19:46:45.7862813Z #define _POSIX_MAX_INPUT 255 2025-05-07T19:46:45.7863105Z #define _POSIX_MQ_OPEN_MAX 8 2025-05-07T19:46:45.7863373Z #define _POSIX_MQ_PRIO_MAX 32 2025-05-07T19:46:45.7863680Z #define _POSIX_NAME_MAX 14 2025-05-07T19:46:45.7863943Z #define _POSIX_NGROUPS_MAX 8 2025-05-07T19:46:45.7864234Z #define _POSIX_OPEN_MAX 20 2025-05-07T19:46:45.7864513Z #define _POSIX_PATH_MAX 256 2025-05-07T19:46:45.7864775Z #define _POSIX_PIPE_BUF 512 2025-05-07T19:46:45.7865059Z #define _POSIX_QLIMIT 1 2025-05-07T19:46:45.7865310Z #define _POSIX_RE_DUP_MAX 255 2025-05-07T19:46:45.7865600Z #define _POSIX_RTSIG_MAX 8 2025-05-07T19:46:45.7865871Z #define _POSIX_SEM_NSEMS_MAX 256 2025-05-07T19:46:45.7866351Z #define _POSIX_SEM_VALUE_MAX 32767 2025-05-07T19:46:45.7866653Z #define _POSIX_SIGQUEUE_MAX 32 2025-05-07T19:46:45.7866952Z #define _POSIX_SOURCE 1 2025-05-07T19:46:45.7867227Z #define _POSIX_SSIZE_MAX 32767 2025-05-07T19:46:45.7867700Z #define _POSIX_STREAM_MAX 8 2025-05-07T19:46:45.7868017Z #define _POSIX_SYMLINK_MAX 255 2025-05-07T19:46:45.7868299Z #define _POSIX_SYMLOOP_MAX 8 2025-05-07T19:46:45.7868634Z #define _POSIX_THREAD_DESTRUCTOR_ITERATIONS 4 2025-05-07T19:46:45.7869156Z #define _POSIX_THREAD_KEYS_MAX 128 2025-05-07T19:46:45.7869499Z #define _POSIX_THREAD_THREADS_MAX 64 2025-05-07T19:46:45.7869810Z #define _POSIX_TIMER_MAX 32 2025-05-07T19:46:45.7870113Z #define _POSIX_TTY_NAME_MAX 9 2025-05-07T19:46:45.7870398Z #define _POSIX_TZNAME_MAX 6 2025-05-07T19:46:45.7870700Z #define _POSIX_UIO_MAXIOV 16 2025-05-07T19:46:45.7871053Z #define _PSTL_ASSERT(_Condition) __glibcxx_assert(_Condition) 2025-05-07T19:46:45.7871599Z #define _PSTL_ASSERT_MSG(_Condition,_Message) __glibcxx_assert(_Condition) 2025-05-07T19:46:45.7874000Z #define _PSTL_CLANG_VERSION (__clang_major__ * 10000 + __clang_minor__ * 100 + __clang_patchlevel__) 2025-05-07T19:46:45.7874631Z #define _PSTL_CONFIG_H 2025-05-07T19:46:45.7875135Z #define _PSTL_CPP11_STD_ROTATE_BROKEN ((__GLIBCXX__ && __GLIBCXX__ < 20150716) || (_MSC_VER && _MSC_VER < 1800)) 2025-05-07T19:46:45.7876007Z #define _PSTL_CPP14_2RANGE_MISMATCH_EQUAL_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201300L || __cpp_lib_robust_nonmodifying_seq_ops == 201304) 2025-05-07T19:46:45.7876867Z #define _PSTL_CPP14_INTEGER_SEQUENCE_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201402L) 2025-05-07T19:46:45.7877671Z #define _PSTL_CPP14_MAKE_REVERSE_ITERATOR_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201402L || __cpp_lib_make_reverse_iterator == 201402) 2025-05-07T19:46:45.7878683Z #define _PSTL_CPP14_VARIABLE_TEMPLATES_PRESENT (!__INTEL_COMPILER || __INTEL_COMPILER >= 1700) && (_MSC_FULL_VER >= 190023918 || __cplusplus >= 201402L) 2025-05-07T19:46:45.7879475Z #define _PSTL_CPP17_EXECUTION_POLICIES_PRESENT (_MSC_VER >= 1912) 2025-05-07T19:46:45.7879952Z #define _PSTL_EARLYEXIT_PRESENT (__INTEL_COMPILER >= 1800) 2025-05-07T19:46:45.7880493Z #define _PSTL_GCC_VERSION (__GNUC__ * 10000 + __GNUC_MINOR__ * 100 + __GNUC_PATCHLEVEL__) 2025-05-07T19:46:45.7880968Z #define _PSTL_HIDE_FROM_ABI_POP 2025-05-07T19:46:45.7881297Z #define _PSTL_HIDE_FROM_ABI_PUSH 2025-05-07T19:46:45.7881669Z #define _PSTL_ICC_18_OMP_SIMD_BROKEN (__INTEL_COMPILER == 1800) 2025-05-07T19:46:45.7882160Z #define _PSTL_MONOTONIC_PRESENT (__INTEL_COMPILER >= 1800) 2025-05-07T19:46:45.7882567Z #define _PSTL_PAR_BACKEND_SERIAL 2025-05-07T19:46:45.7882875Z #define _PSTL_PRAGMA(x) _Pragma(# x) 2025-05-07T19:46:45.7883893Z #define _PSTL_PRAGMA_DECLARE_REDUCTION(NAME,OP) _PSTL_PRAGMA(omp declare reduction(NAME:OP : omp_out(omp_in)) initializer(omp_priv = omp_orig)) 2025-05-07T19:46:45.7884658Z #define _PSTL_PRAGMA_DECLARE_SIMD _PSTL_PRAGMA(omp declare simd) 2025-05-07T19:46:45.7885100Z #define _PSTL_PRAGMA_FORCEINLINE 2025-05-07T19:46:45.7885492Z #define _PSTL_PRAGMA_LOCATION " [Parallel STL message]: " 2025-05-07T19:46:45.7885878Z #define _PSTL_PRAGMA_MESSAGE(x) 2025-05-07T19:46:45.7886524Z #define _PSTL_PRAGMA_MESSAGE_IMPL(x) _PSTL_PRAGMA(message(_PSTL_STRING_CONCAT(_PSTL_PRAGMA_LOCATION, x))) 2025-05-07T19:46:45.7887057Z #define _PSTL_PRAGMA_MESSAGE_POLICIES(x) 2025-05-07T19:46:45.7887418Z #define _PSTL_PRAGMA_SIMD _PSTL_PRAGMA(omp simd) 2025-05-07T19:46:45.7887751Z #define _PSTL_PRAGMA_SIMD_EARLYEXIT 2025-05-07T19:46:45.7888099Z #define _PSTL_PRAGMA_SIMD_EXCLUSIVE_SCAN(PRM) 2025-05-07T19:46:45.7888448Z #define _PSTL_PRAGMA_SIMD_INCLUSIVE_SCAN(PRM) 2025-05-07T19:46:45.7888825Z #define _PSTL_PRAGMA_SIMD_ORDERED_MONOTONIC(PRM) 2025-05-07T19:46:45.7889251Z #define _PSTL_PRAGMA_SIMD_ORDERED_MONOTONIC_2ARGS(PRM1,PRM2) 2025-05-07T19:46:45.7889746Z #define _PSTL_PRAGMA_SIMD_REDUCTION(PRM) _PSTL_PRAGMA(omp simd reduction(PRM)) 2025-05-07T19:46:45.7890202Z #define _PSTL_PRAGMA_SIMD_SCAN(PRM) 2025-05-07T19:46:45.7890670Z #define _PSTL_PRAGMA_VECTOR_UNALIGNED 2025-05-07T19:46:45.7891025Z #define _PSTL_STRING(x) _PSTL_STRING_AUX(x) 2025-05-07T19:46:45.7891347Z #define _PSTL_STRING_AUX(x) #x 2025-05-07T19:46:45.7891694Z #define _PSTL_STRING_CONCAT(x,y) x #y 2025-05-07T19:46:45.7891996Z #define _PSTL_UDR_PRESENT 0 2025-05-07T19:46:45.7892463Z #define _PSTL_UDS_PRESENT (__INTEL_COMPILER >= 1900 && __INTEL_COMPILER_BUILD_DATE >= 20180626) 2025-05-07T19:46:45.7892974Z #define _PSTL_USAGE_WARNINGS 0 2025-05-07T19:46:45.7893376Z #define _PSTL_USE_NONTEMPORAL_STORES_IF_ALLOWED 2025-05-07T19:46:45.7893747Z #define _PSTL_VERSION 12000 2025-05-07T19:46:45.7894061Z #define _PSTL_VERSION_MAJOR (_PSTL_VERSION / 1000) 2025-05-07T19:46:45.7894485Z #define _PSTL_VERSION_MINOR ((_PSTL_VERSION % 1000) / 10) 2025-05-07T19:46:45.7894886Z #define _PSTL_VERSION_PATCH (_PSTL_VERSION % 10) 2025-05-07T19:46:45.7895246Z #define _PTRDIFF_T 2025-05-07T19:46:45.7895488Z #define _PTR_TRAITS_H 1 2025-05-07T19:46:45.7895831Z #define _SIGSET_H_types 1 2025-05-07T19:46:45.7896195Z #define _SIGSET_NWORDS (1024 / (8 * sizeof (unsigned long int))) 2025-05-07T19:46:45.7896570Z #define _SIZE_T 2025-05-07T19:46:45.7896829Z #define _STDC_PREDEF_H 1 2025-05-07T19:46:45.7897085Z #define _STDIO_H 1 2025-05-07T19:46:45.7897355Z #define _STDIO_USES_IOSTREAM 2025-05-07T19:46:45.7897622Z #define _STDLIB_H 1 2025-05-07T19:46:45.7897878Z #define _STL_ALGOBASE_H 1 2025-05-07T19:46:45.7898145Z #define _STL_ITERATOR_BASE_FUNCS_H 1 2025-05-07T19:46:45.7898482Z #define _STL_ITERATOR_BASE_TYPES_H 1 2025-05-07T19:46:45.7898781Z #define _STL_ITERATOR_H 1 2025-05-07T19:46:45.7899058Z #define _STL_PAIR_H 1 2025-05-07T19:46:45.7899299Z #define _STL_RELOPS_H 1 2025-05-07T19:46:45.7899565Z #define _STRING_H 1 2025-05-07T19:46:45.7899823Z #define _STRUCT_TIMEVAL 1 2025-05-07T19:46:45.7900083Z #define _SVID_SOURCE 1 2025-05-07T19:46:45.7900349Z #define _SYS_CDEFS_H 1 2025-05-07T19:46:45.7900591Z #define _SYS_SELECT_H 1 2025-05-07T19:46:45.7900878Z #define _SYS_SYSMACROS_H 1 2025-05-07T19:46:45.7901141Z #define _SYS_TYPES_H 1 2025-05-07T19:46:45.7901409Z #define _TIME_H 1 2025-05-07T19:46:45.7901646Z #define _VA_LIST_DEFINED 2025-05-07T19:46:45.7901924Z #define _XLOCALE_H 1 2025-05-07T19:46:45.7902186Z #define _XOPEN_IOV_MAX _POSIX_UIO_MAXIOV 2025-05-07T19:46:45.7902518Z #define _XOPEN_LIM_H 1 2025-05-07T19:46:45.7902795Z #define _XOPEN_SOURCE 700 2025-05-07T19:46:45.7903069Z #define _XOPEN_SOURCE_EXTENDED 1 2025-05-07T19:46:45.7903469Z #define __ASMNAME(cname) __ASMNAME2 (__USER_LABEL_PREFIX__, cname) 2025-05-07T19:46:45.7903936Z #define __ASMNAME2(prefix,cname) __STRING (prefix) cname 2025-05-07T19:46:45.7904359Z #define __ASSERT_FUNCTION __PRETTY_FUNCTION__ 2025-05-07T19:46:45.7904712Z #define __ASSERT_VOID_CAST static_cast 2025-05-07T19:46:45.7905063Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:46:45.7905328Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:46:45.7905615Z #define __ATOMIC_CONSUME 1 2025-05-07T19:46:45.7905877Z #define __ATOMIC_RELAXED 0 2025-05-07T19:46:45.7906171Z #define __ATOMIC_RELEASE 3 2025-05-07T19:46:45.7906455Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:46:45.7906724Z #define __BEGIN_DECLS extern "C" { 2025-05-07T19:46:45.7907047Z #define __BEGIN_NAMESPACE_C99 2025-05-07T19:46:45.7907340Z #define __BEGIN_NAMESPACE_STD 2025-05-07T19:46:45.7907654Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:46:45.7907935Z #define __BIG_ENDIAN 4321 2025-05-07T19:46:45.7908232Z #define __BITINT_MAXWIDTH__ 8388608 2025-05-07T19:46:45.7908529Z #define __BIT_TYPES_DEFINED__ 1 2025-05-07T19:46:45.7908842Z #define __BLKCNT64_T_TYPE __SQUAD_TYPE 2025-05-07T19:46:45.7909167Z #define __BLKCNT_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:45.7909535Z #define __BLKSIZE_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:45.7909880Z #define __BOOL_WIDTH__ 8 2025-05-07T19:46:45.7910143Z #define __BYTE_ORDER __LITTLE_ENDIAN 2025-05-07T19:46:45.7910490Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:46:45.7910822Z #define __CHANNEL_DESCRIPTOR_H__ 2025-05-07T19:46:45.7911148Z #define __CHAR16_TYPE__ unsigned short 2025-05-07T19:46:45.7911456Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:46:45.7911871Z #define __CHAR_BIT__ 8 2025-05-07T19:46:45.7912120Z #define __CLANG_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:46:45.7912451Z #define __CLANG_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:46:45.7912784Z #define __CLANG_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:46:45.7913092Z #define __CLANG_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:46:45.7913411Z #define __CLANG_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:46:45.7913828Z #define __CLANG_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:46:45.7914234Z #define __CLANG_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:46:45.7914712Z #define __CLANG_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:46:45.7915075Z #define __CLANG_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:46:45.7915401Z #define __CLANG_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:46:45.7915744Z #define __CLANG_LIMITS_H 2025-05-07T19:46:45.7916027Z #define __CLANG_MAX_ALIGN_T_DEFINED 2025-05-07T19:46:45.7916428Z #define __CLOCKID_T_TYPE __S32_TYPE 2025-05-07T19:46:45.7916772Z #define __CLOCK_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:45.7917107Z #define __COMMON_FUNCTIONS_H__ 2025-05-07T19:46:45.7917417Z #define __COMPAR_FN_T 2025-05-07T19:46:45.7917684Z #define __CONCAT(x,y) x ## y 2025-05-07T19:46:45.7917993Z #define __CONSTANT_CFSTRINGS__ 1 2025-05-07T19:46:45.7918289Z #define __CUDACC_VER_BUILD__ 85 2025-05-07T19:46:45.7918594Z #define __CUDACC_VER_MAJOR__ 12 2025-05-07T19:46:45.7918876Z #define __CUDACC_VER_MINOR__ 6 2025-05-07T19:46:45.7919522Z #define __CUDACC_VER__ "__CUDACC_VER__ is no longer supported. Use __CUDACC_VER_MAJOR__, __CUDACC_VER_MINOR__, and __CUDACC_VER_BUILD__ instead." 2025-05-07T19:46:45.7920187Z #define __CUDACC__ 1 2025-05-07T19:46:45.7920452Z #define __CUDART_API_PTDS(api) api 2025-05-07T19:46:45.7920786Z #define __CUDART_API_PTSZ(api) api 2025-05-07T19:46:45.7921256Z #define __CUDART_API_VERSION ((__CUDA_API_VER_MAJOR__ * 1000) + (__CUDA_API_VER_MINOR__ * 10)) 2025-05-07T19:46:45.7921780Z #define __CUDA_API_VER_MAJOR__ 12 2025-05-07T19:46:45.7922082Z #define __CUDA_API_VER_MINOR__ 6 2025-05-07T19:46:45.7922477Z #define __CUDA_ARCH_HAS_FEATURE__(_FEAT) __CUDA_ARCH_FEAT_##_FEAT 2025-05-07T19:46:45.7922886Z #define __CUDA_ARCH_LIST__ 520 2025-05-07T19:46:45.7923194Z #define __CUDA_ARCH__ 520 2025-05-07T19:46:45.7923495Z #define __CUDA_DEVICE_RUNTIME_API_H__ 2025-05-07T19:46:45.7923805Z #define __CUDA_MATH_CRTIMP 2025-05-07T19:46:45.7924105Z #define __CUDA_RUNTIME_API_H__ 2025-05-07T19:46:45.7924386Z #define __CUDA_RUNTIME_H__ 2025-05-07T19:46:45.7924689Z #define __DADDR_T_TYPE __S32_TYPE 2025-05-07T19:46:45.7924986Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:46:45.7925316Z #define __DBL_DENORM_MIN__ 4.9406564584124654e-324 2025-05-07T19:46:45.7925650Z #define __DBL_DIG__ 15 2025-05-07T19:46:45.7925930Z #define __DBL_EPSILON__ 2.2204460492503131e-16 2025-05-07T19:46:45.7926236Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:46:45.7926495Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:46:45.7926880Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:46:45.7927138Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:46:45.7927561Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:46:45.7927818Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:46:45.7928101Z #define __DBL_MAX__ 1.7976931348623157e+308 2025-05-07T19:46:45.7928546Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:46:45.7929011Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:46:45.7929306Z #define __DBL_MIN__ 2.2250738585072014e-308 2025-05-07T19:46:45.7929643Z #define __DECIMAL_DIG__ __LDBL_DECIMAL_DIG__ 2025-05-07T19:46:45.7929965Z #define __DELETE_THROW throw() 2025-05-07T19:46:45.7930246Z #define __DEPRECATED 1 2025-05-07T19:46:45.7930507Z #define __DEVICE_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:46:45.7930845Z #define __DEVICE_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:45.7931151Z #define __DEVICE_DOUBLE_FUNCTIONS_HPP__ 2025-05-07T19:46:45.7931484Z #define __DEVICE_DOUBLE_FUNCTIONS_H__ 2025-05-07T19:46:45.7931795Z #define __DEVICE_FUNCTIONS_HPP__ 2025-05-07T19:46:45.7932094Z #define __DEVICE_FUNCTIONS_H__ 2025-05-07T19:46:45.7932373Z #define __DEVICE_LAUNCH_PARAMETERS_H__ 2025-05-07T19:46:45.7932680Z #define __DEVICE_TYPES_H__ 2025-05-07T19:46:45.7932957Z #define __DEV_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:45.7933241Z #define __DRIVER_FUNCTIONS_H__ 2025-05-07T19:46:45.7933519Z #define __DRIVER_TYPES_H__ 2025-05-07T19:46:45.7933772Z #define __ELF__ 1 2025-05-07T19:46:45.7934001Z #define __END_DECLS } 2025-05-07T19:46:45.7934237Z #define __END_NAMESPACE_C99 2025-05-07T19:46:45.7934616Z #define __END_NAMESPACE_STD 2025-05-07T19:46:45.7934871Z #define __EXCEPTIONS 1 2025-05-07T19:46:45.7935106Z #define __EXCEPTION_H 1 2025-05-07T19:46:45.7935383Z #define __FDS_BITS(set) ((set)->fds_bits) 2025-05-07T19:46:45.7935802Z #define __FD_CLR(d,set) ((void) (__FDS_BITS (set)[__FD_ELT (d)] &= ~__FD_MASK (d))) 2025-05-07T19:46:45.7936246Z #define __FD_ELT(d) ((d) / __NFDBITS) 2025-05-07T19:46:45.7936642Z #define __FD_ISSET(d,set) ((__FDS_BITS (set)[__FD_ELT (d)] & __FD_MASK (d)) != 0) 2025-05-07T19:46:45.7937203Z #define __FD_MASK(d) ((__fd_mask) 1 << ((d) % __NFDBITS)) 2025-05-07T19:46:45.7937659Z #define __FD_SET(d,set) ((void) (__FDS_BITS (set)[__FD_ELT (d)] |= __FD_MASK (d))) 2025-05-07T19:46:45.7938089Z #define __FD_SETSIZE 1024 2025-05-07T19:46:45.7938770Z #define __FD_ZERO(fdsp) do { int __d0, __d1; __asm__ __volatile__ ("cld; rep; " __FD_ZERO_STOS : "=c" (__d0), "=D" (__d1) : "a" (0), "0" (sizeof (fd_set) / sizeof (__fd_mask)), "1" (&__FDS_BITS (fdsp)[0]) : "memory"); } while (0) 2025-05-07T19:46:45.7939501Z #define __FD_ZERO_STOS "stosq" 2025-05-07T19:46:45.7939776Z #define __FILE_defined 1 2025-05-07T19:46:45.7940024Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:46:45.7940292Z #define __FLOAT128__ 1 2025-05-07T19:46:45.7940538Z #define __FLOAT_WORD_ORDER __BYTE_ORDER 2025-05-07T19:46:45.7940852Z #define __FLT16_DECIMAL_DIG__ 5 2025-05-07T19:46:45.7941167Z #define __FLT16_DENORM_MIN__ 5.9604644775390625e-8F16 2025-05-07T19:46:45.7941605Z #define __FLT16_DIG__ 3 2025-05-07T19:46:45.7941869Z #define __FLT16_EPSILON__ 9.765625e-4F16 2025-05-07T19:46:45.7942157Z #define __FLT16_HAS_DENORM__ 1 2025-05-07T19:46:45.7942430Z #define __FLT16_HAS_INFINITY__ 1 2025-05-07T19:46:45.7942699Z #define __FLT16_HAS_QUIET_NAN__ 1 2025-05-07T19:46:45.7942983Z #define __FLT16_MANT_DIG__ 11 2025-05-07T19:46:45.7943241Z #define __FLT16_MAX_10_EXP__ 4 2025-05-07T19:46:45.7943511Z #define __FLT16_MAX_EXP__ 16 2025-05-07T19:46:45.7943766Z #define __FLT16_MAX__ 6.5504e+4F16 2025-05-07T19:46:45.7944053Z #define __FLT16_MIN_10_EXP__ (-4) 2025-05-07T19:46:45.7944331Z #define __FLT16_MIN_EXP__ (-13) 2025-05-07T19:46:45.7944593Z #define __FLT16_MIN__ 6.103515625e-5F16 2025-05-07T19:46:45.7944883Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:46:45.7945161Z #define __FLT_DENORM_MIN__ 1.40129846e-45F 2025-05-07T19:46:45.7945455Z #define __FLT_DIG__ 6 2025-05-07T19:46:45.7945693Z #define __FLT_EPSILON__ 1.19209290e-7F 2025-05-07T19:46:45.7945985Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:46:45.7946247Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:46:45.7946522Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:46:45.7946776Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:46:45.7947042Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:46:45.7947400Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:46:45.7947630Z #define __FLT_MAX__ 3.40282347e+38F 2025-05-07T19:46:45.7947900Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:46:45.7948150Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:46:45.7948416Z #define __FLT_MIN__ 1.17549435e-38F 2025-05-07T19:46:45.7948665Z #define __FLT_RADIX__ 2 2025-05-07T19:46:45.7948914Z #define __FSBLKCNT64_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:45.7949218Z #define __FSBLKCNT_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:45.7949529Z #define __FSFILCNT64_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:45.7949832Z #define __FSFILCNT_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:45.7950166Z #define __FSID_T_TYPE struct { int __val[2]; } 2025-05-07T19:46:45.7950482Z #define __FSWORD_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:45.7950762Z #define __FXSR__ 1 2025-05-07T19:46:45.7950996Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:46:45.7951257Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:46:45.7951549Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:46:45.7951835Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:46:45.7952141Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:46:45.7952418Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:46:45.7952703Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:46:45.7953065Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:46:45.7953348Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:46:45.7953639Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:46:45.7953932Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:46:45.7954315Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:46:45.7954771Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:46:45.7955096Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:46:45.7955488Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:46:45.7955827Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:46:45.7956155Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:46:45.7956485Z #define __GID_T_TYPE __U32_TYPE 2025-05-07T19:46:45.7956771Z #define __GLIBCXX_BITSIZE_INT_N_0 128 2025-05-07T19:46:45.7957071Z #define __GLIBCXX_TYPE_INT_N_0 __int128 2025-05-07T19:46:45.7957372Z #define __GLIBCXX__ 20230528 2025-05-07T19:46:45.7957643Z #define __GLIBC_HAVE_LONG_LONG 1 2025-05-07T19:46:45.7957934Z #define __GLIBC_MINOR__ 17 2025-05-07T19:46:45.7958346Z #define __GLIBC_PREREQ(maj,min) ((__GLIBC__ << 16) + __GLIBC_MINOR__ >= ((maj) << 16) + (min)) 2025-05-07T19:46:45.7958806Z #define __GLIBC__ 2 2025-05-07T19:46:45.7959042Z #define __GNUC_GNU_INLINE__ 1 2025-05-07T19:46:45.7959311Z #define __GNUC_MINOR__ 2 2025-05-07T19:46:45.7959572Z #define __GNUC_PATCHLEVEL__ 1 2025-05-07T19:46:45.7959973Z #define __GNUC_PREREQ(maj,min) ((__GNUC__ << 16) + __GNUC_MINOR__ >= ((maj) << 16) + (min)) 2025-05-07T19:46:45.7960426Z #define __GNUC_VA_LIST 2025-05-07T19:46:45.7960662Z #define __GNUC__ 4 2025-05-07T19:46:45.7960885Z #define __GNUG__ 4 2025-05-07T19:46:45.7961110Z #define __GNU_LIBRARY__ 6 2025-05-07T19:46:45.7961370Z #define __GXX_ABI_VERSION 1002 2025-05-07T19:46:45.7961645Z #define __GXX_EXPERIMENTAL_CXX0X__ 1 2025-05-07T19:46:45.7961936Z #define __GXX_RTTI 1 2025-05-07T19:46:45.7962158Z #define __GXX_WEAK__ 1 2025-05-07T19:46:45.7962406Z #define __HAVE_COLUMN 2025-05-07T19:46:45.7962660Z #define __HOST_CONFIG_H__ 2025-05-07T19:46:45.7962920Z #define __HOST_DEFINES_H__ 2025-05-07T19:46:45.7963189Z #define __ID_T_TYPE __U32_TYPE 2025-05-07T19:46:45.7963461Z #define __INO64_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:45.7963760Z #define __INO_T_MATCHES_INO64_T 1 2025-05-07T19:46:45.7964047Z #define __INO_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:45.7964355Z #define __INT16_C_SUFFIX__ 2025-05-07T19:46:45.7964614Z #define __INT16_FMTd__ "hd" 2025-05-07T19:46:45.7964879Z #define __INT16_FMTi__ "hi" 2025-05-07T19:46:45.7965128Z #define __INT16_MAX__ 32767 2025-05-07T19:46:45.7965396Z #define __INT16_TYPE__ short 2025-05-07T19:46:45.7965663Z #define __INT32_C_SUFFIX__ 2025-05-07T19:46:45.7965912Z #define __INT32_FMTd__ "d" 2025-05-07T19:46:45.7966174Z #define __INT32_FMTi__ "i" 2025-05-07T19:46:45.7966424Z #define __INT32_MAX__ 2147483647 2025-05-07T19:46:45.7966806Z #define __INT32_TYPE__ int 2025-05-07T19:46:45.7967044Z #define __INT64_C_SUFFIX__ L 2025-05-07T19:46:45.7967295Z #define __INT64_FMTd__ "ld" 2025-05-07T19:46:45.7967534Z #define __INT64_FMTi__ "li" 2025-05-07T19:46:45.7967793Z #define __INT64_MAX__ 9223372036854775807L 2025-05-07T19:46:45.7968073Z #define __INT64_TYPE__ long int 2025-05-07T19:46:45.7968334Z #define __INT8_C_SUFFIX__ 2025-05-07T19:46:45.7968563Z #define __INT8_FMTd__ "hhd" 2025-05-07T19:46:45.7968811Z #define __INT8_FMTi__ "hhi" 2025-05-07T19:46:45.7969054Z #define __INT8_MAX__ 127 2025-05-07T19:46:45.7969291Z #define __INT8_TYPE__ signed char 2025-05-07T19:46:45.7969577Z #define __INTMAX_C_SUFFIX__ L 2025-05-07T19:46:45.7969823Z #define __INTMAX_FMTd__ "ld" 2025-05-07T19:46:45.7970073Z #define __INTMAX_FMTi__ "li" 2025-05-07T19:46:45.7970326Z #define __INTMAX_MAX__ 9223372036854775807L 2025-05-07T19:46:45.7970618Z #define __INTMAX_TYPE__ long int 2025-05-07T19:46:45.7970866Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:46:45.7971117Z #define __INTPTR_FMTd__ "ld" 2025-05-07T19:46:45.7971359Z #define __INTPTR_FMTi__ "li" 2025-05-07T19:46:45.7971629Z #define __INTPTR_MAX__ 9223372036854775807L 2025-05-07T19:46:45.7971999Z #define __INTPTR_TYPE__ long int 2025-05-07T19:46:45.7972259Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:46:45.7972514Z #define __INT_FAST16_FMTd__ "hd" 2025-05-07T19:46:45.7972771Z #define __INT_FAST16_FMTi__ "hi" 2025-05-07T19:46:45.7973050Z #define __INT_FAST16_MAX__ 32767 2025-05-07T19:46:45.7973148Z #define __INT_FAST16_TYPE__ short 2025-05-07T19:46:45.7973242Z #define __INT_FAST16_WIDTH__ 16 2025-05-07T19:46:45.7973356Z #define __INT_FAST32_FMTd__ "d" 2025-05-07T19:46:45.7973521Z #define __INT_FAST32_FMTi__ "i" 2025-05-07T19:46:45.7973614Z #define __INT_FAST32_MAX__ 2147483647 2025-05-07T19:46:45.7973702Z #define __INT_FAST32_TYPE__ int 2025-05-07T19:46:45.7973809Z #define __INT_FAST32_WIDTH__ 32 2025-05-07T19:46:45.7973906Z #define __INT_FAST64_FMTd__ "ld" 2025-05-07T19:46:45.7973996Z #define __INT_FAST64_FMTi__ "li" 2025-05-07T19:46:45.7974120Z #define __INT_FAST64_MAX__ 9223372036854775807L 2025-05-07T19:46:45.7974216Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:46:45.7974314Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:46:45.7974421Z #define __INT_FAST8_FMTd__ "hhd" 2025-05-07T19:46:45.7974514Z #define __INT_FAST8_FMTi__ "hhi" 2025-05-07T19:46:45.7974611Z #define __INT_FAST8_MAX__ 127 2025-05-07T19:46:45.7974715Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:46:45.7974818Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:46:45.7974916Z #define __INT_LEAST16_FMTd__ "hd" 2025-05-07T19:46:45.7975010Z #define __INT_LEAST16_FMTi__ "hi" 2025-05-07T19:46:45.7975114Z #define __INT_LEAST16_MAX__ 32767 2025-05-07T19:46:45.7975205Z #define __INT_LEAST16_TYPE__ short 2025-05-07T19:46:45.7975299Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:46:45.7975388Z #define __INT_LEAST32_FMTd__ "d" 2025-05-07T19:46:45.7975488Z #define __INT_LEAST32_FMTi__ "i" 2025-05-07T19:46:45.7975582Z #define __INT_LEAST32_MAX__ 2147483647 2025-05-07T19:46:45.7975674Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:46:45.7975773Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:46:45.7975862Z #define __INT_LEAST64_FMTd__ "ld" 2025-05-07T19:46:45.7975959Z #define __INT_LEAST64_FMTi__ "li" 2025-05-07T19:46:45.7976076Z #define __INT_LEAST64_MAX__ 9223372036854775807L 2025-05-07T19:46:45.7976181Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:46:45.7976271Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:46:45.7976361Z #define __INT_LEAST8_FMTd__ "hhd" 2025-05-07T19:46:45.7976457Z #define __INT_LEAST8_FMTi__ "hhi" 2025-05-07T19:46:45.7976544Z #define __INT_LEAST8_MAX__ 127 2025-05-07T19:46:45.7976640Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:46:45.7976755Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:46:45.7976844Z #define __INT_MAX__ 2147483647 2025-05-07T19:46:45.7976932Z #define __INT_WIDTH__ 32 2025-05-07T19:46:45.7977023Z #define __KERNEL_STRICT_NAMES 2025-05-07T19:46:45.7977132Z #define __KEY_T_TYPE __S32_TYPE 2025-05-07T19:46:45.7977217Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:46:45.7977359Z #define __LDBL_DENORM_MIN__ 3.64519953188247460253e-4951L 2025-05-07T19:46:45.7977464Z #define __LDBL_DIG__ 18 2025-05-07T19:46:45.7977587Z #define __LDBL_EPSILON__ 1.08420217248550443401e-19L 2025-05-07T19:46:45.7977677Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:46:45.7977767Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:46:45.7977874Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:46:45.7977963Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:46:45.7978049Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:46:45.7978149Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:46:45.7978264Z #define __LDBL_MAX__ 1.18973149535723176502e+4932L 2025-05-07T19:46:45.7978356Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:46:45.7978453Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:46:45.7978560Z #define __LDBL_MIN__ 3.36210314311209350626e-4932L 2025-05-07T19:46:45.7978665Z #define __LDBL_REDIR(name,proto) name proto 2025-05-07T19:46:45.7978788Z #define __LDBL_REDIR1(name,proto,alias) name proto 2025-05-07T19:46:45.7978968Z #define __LDBL_REDIR1_NTH(name,proto,alias) name proto __THROW 2025-05-07T19:46:45.7979108Z #define __LDBL_REDIR_DECL(name) 2025-05-07T19:46:45.7979246Z #define __LDBL_REDIR_NTH(name,proto) name proto __THROW 2025-05-07T19:46:45.7979334Z #define __LEAF 2025-05-07T19:46:45.7979413Z #define __LEAF_ATTR 2025-05-07T19:46:45.7979499Z #define __LIBRARY_TYPES_H__ 2025-05-07T19:46:45.7979585Z #define __LITTLE_ENDIAN 1234 2025-05-07T19:46:45.7979691Z #define __LITTLE_ENDIAN__ 1 2025-05-07T19:46:45.7979781Z #define __LLONG_WIDTH__ 64 2025-05-07T19:46:45.7979936Z #define __LONG_LONG_MAX__ 9223372036854775807LL 2025-05-07T19:46:45.7980048Z #define __LONG_LONG_PAIR(HI,LO) LO, HI 2025-05-07T19:46:45.7980150Z #define __LONG_MAX__ 9223372036854775807L 2025-05-07T19:46:45.7980236Z #define __LONG_WIDTH__ 64 2025-05-07T19:46:45.7980317Z #define __LP64__ 1 2025-05-07T19:46:45.7980644Z #define __MATHCALLX(function,suffix,args,attrib) __MATHDECLX (_Mdouble_,function,suffix, args, attrib) 2025-05-07T19:46:45.7981265Z #define __MATHDECLX(type,function,suffix,args,attrib) __MATHDECL_1(type, function,suffix, args) __attribute__ (attrib); __MATHDECL_1(type, __CONCAT(__,function),suffix, args) __attribute__ (attrib) 2025-05-07T19:46:45.7981358Z #define __MATH_DECLARE_LDOUBLE 1 2025-05-07T19:46:45.7981456Z #define __MATH_FUNCTIONS_HPP__ 2025-05-07T19:46:45.7981549Z #define __MATH_FUNCTIONS_H__ 2025-05-07T19:46:45.7981632Z #define __MMX__ 1 2025-05-07T19:46:45.7981730Z #define __MODE_T_TYPE __U32_TYPE 2025-05-07T19:46:45.7981817Z #define __N(msgid) (msgid) 2025-05-07T19:46:45.7981946Z #define __NFDBITS (8 * (int) sizeof (__fd_mask)) 2025-05-07T19:46:45.7982053Z #define __NLINK_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:45.7982148Z #define __NO_CTYPE 1 2025-05-07T19:46:45.7982234Z #define __NO_INLINE__ 1 2025-05-07T19:46:45.7982327Z #define __NO_MATH_INLINES 1 2025-05-07T19:46:45.7982438Z #define __NTH(fct) __LEAF_ATTR fct throw () 2025-05-07T19:46:45.7982538Z #define __NVCC_DIAG_PRAGMA_SUPPORT__ 1 2025-05-07T19:46:45.7982618Z #define __NVCC__ 1 2025-05-07T19:46:45.7982711Z #define __NV_GLIBCXX_VERSION 40800 2025-05-07T19:46:45.7982817Z #define __NV_LEGACY_LAUNCH 1 2025-05-07T19:46:45.7982909Z #define __NV_NO_HOST_COMPILER_CHECK 1 2025-05-07T19:46:45.7982997Z #define __OBJC_BOOL_IS_BOOL 0 2025-05-07T19:46:45.7983103Z #define __OFF64_T_TYPE __SQUAD_TYPE 2025-05-07T19:46:45.7983193Z #define __OFF_T_MATCHES_OFF64_T 1 2025-05-07T19:46:45.7983295Z #define __OFF_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:45.7983409Z #define __OPENCL_MEMORY_SCOPE_ALL_SVM_DEVICES 3 2025-05-07T19:46:45.7983523Z #define __OPENCL_MEMORY_SCOPE_DEVICE 2 2025-05-07T19:46:45.7983624Z #define __OPENCL_MEMORY_SCOPE_SUB_GROUP 4 2025-05-07T19:46:45.7983734Z #define __OPENCL_MEMORY_SCOPE_WORK_GROUP 1 2025-05-07T19:46:45.7983848Z #define __OPENCL_MEMORY_SCOPE_WORK_ITEM 0 2025-05-07T19:46:45.7983937Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:46:45.7984030Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:46:45.7984125Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:46:45.7984229Z #define __P(args) args 2025-05-07T19:46:45.7984314Z #define __PDP_ENDIAN 3412 2025-05-07T19:46:45.7984399Z #define __PIC__ 2 2025-05-07T19:46:45.7984504Z #define __PID_T_TYPE __S32_TYPE 2025-05-07T19:46:45.7984580Z #define __PIE__ 2 2025-05-07T19:46:45.7984663Z #define __PMT(args) args 2025-05-07T19:46:45.7984754Z #define __POINTER_WIDTH__ 64 2025-05-07T19:46:45.7984868Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:46:45.7984958Z #define __PTHREAD_MUTEX_HAVE_PREV 1 2025-05-07T19:46:45.7985067Z #define __PTHREAD_RWLOCK_INT_FLAGS_SHARED 1 2025-05-07T19:46:45.7985168Z #define __PTHREAD_SPINS 0, 0 2025-05-07T19:46:45.7985255Z #define __PTRDIFF_FMTd__ "ld" 2025-05-07T19:46:45.7985342Z #define __PTRDIFF_FMTi__ "li" 2025-05-07T19:46:45.7985439Z #define __PTRDIFF_MAX__ 9223372036854775807L 2025-05-07T19:46:45.7985537Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:46:45.7985622Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:46:45.7985826Z #define __REDIRECT(name,proto,alias) name proto __asm__ (__ASMNAME (#alias)) 2025-05-07T19:46:45.7986076Z #define __REDIRECT_LDBL(name,proto,alias) __REDIRECT (name, proto, alias) 2025-05-07T19:46:45.7986315Z #define __REDIRECT_NTH(name,proto,alias) name proto __THROW __asm__ (__ASMNAME (#alias)) 2025-05-07T19:46:45.7986566Z #define __REDIRECT_NTHNL(name,proto,alias) name proto __THROWNL __asm__ (__ASMNAME (#alias)) 2025-05-07T19:46:45.7986803Z #define __REDIRECT_NTH_LDBL(name,proto,alias) __REDIRECT_NTH (name, proto, alias) 2025-05-07T19:46:45.7986896Z #define __REGISTER_PREFIX__ 2025-05-07T19:46:45.7987032Z #define __RLIM64_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:45.7987139Z #define __RLIM_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:45.7987249Z #define __S16_TYPE short int 2025-05-07T19:46:45.7987336Z #define __S32_TYPE int 2025-05-07T19:46:45.7987428Z #define __S64_TYPE long int 2025-05-07T19:46:45.7987523Z #define __SCHAR_MAX__ 127 2025-05-07T19:46:45.7987607Z #define __SEG_FS 1 2025-05-07T19:46:45.7987851Z #define __SEG_GS 1 2025-05-07T19:46:45.7987939Z #define __SHRT_MAX__ 32767 2025-05-07T19:46:45.7988031Z #define __SHRT_WIDTH__ 16 2025-05-07T19:46:45.7988128Z #define __SIG_ATOMIC_MAX__ 2147483647 2025-05-07T19:46:45.7988220Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:46:45.7988320Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:46:45.7988416Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:46:45.7988507Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:46:45.7988598Z #define __SIZEOF_INT128__ 16 2025-05-07T19:46:45.7988699Z #define __SIZEOF_INT__ 4 2025-05-07T19:46:45.7988798Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:46:45.7988895Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:46:45.7988992Z #define __SIZEOF_LONG__ 8 2025-05-07T19:46:45.7989089Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:46:45.7989194Z #define __SIZEOF_PTHREAD_ATTR_T 56 2025-05-07T19:46:45.7989300Z #define __SIZEOF_PTHREAD_BARRIERATTR_T 4 2025-05-07T19:46:45.7989414Z #define __SIZEOF_PTHREAD_BARRIER_T 32 2025-05-07T19:46:45.7989520Z #define __SIZEOF_PTHREAD_CONDATTR_T 4 2025-05-07T19:46:45.7989624Z #define __SIZEOF_PTHREAD_COND_T 48 2025-05-07T19:46:45.7989738Z #define __SIZEOF_PTHREAD_MUTEXATTR_T 4 2025-05-07T19:46:45.7989838Z #define __SIZEOF_PTHREAD_MUTEX_T 40 2025-05-07T19:46:45.7989943Z #define __SIZEOF_PTHREAD_RWLOCKATTR_T 8 2025-05-07T19:46:45.7990053Z #define __SIZEOF_PTHREAD_RWLOCK_T 56 2025-05-07T19:46:45.7990147Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:46:45.7990240Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:46:45.7990334Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:46:45.7990432Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:46:45.7990525Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:46:45.7990619Z #define __SIZE_FMTX__ "lX" 2025-05-07T19:46:45.7990724Z #define __SIZE_FMTo__ "lo" 2025-05-07T19:46:45.7990810Z #define __SIZE_FMTu__ "lu" 2025-05-07T19:46:45.7990898Z #define __SIZE_FMTx__ "lx" 2025-05-07T19:46:45.7991003Z #define __SIZE_MAX__ 18446744073709551615UL 2025-05-07T19:46:45.7991118Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:46:45.7991203Z #define __SIZE_WIDTH__ 64 2025-05-07T19:46:45.7991295Z #define __SLONG32_TYPE int 2025-05-07T19:46:45.7991402Z #define __SLONGWORD_TYPE long int 2025-05-07T19:46:45.7991506Z #define __SM_20_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:46:45.7991609Z #define __SM_20_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:45.7991705Z #define __SM_20_INTRINSICS_HPP__ 2025-05-07T19:46:45.7991810Z #define __SM_20_INTRINSICS_H__ 2025-05-07T19:46:45.7991903Z #define __SM_30_INTRINSICS_HPP__ 2025-05-07T19:46:45.7991993Z #define __SM_30_INTRINSICS_H__ 2025-05-07T19:46:45.7992110Z #define __SM_32_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:46:45.7992212Z #define __SM_32_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:45.7992307Z #define __SM_32_INTRINSICS_HPP__ 2025-05-07T19:46:45.7992394Z #define __SM_32_INTRINSICS_H__ 2025-05-07T19:46:45.7992508Z #define __SM_35_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:45.7992599Z #define __SM_35_INTRINSICS_H__ 2025-05-07T19:46:45.7992698Z #define __SM_60_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:46:45.7992808Z #define __SM_60_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:45.7992902Z #define __SM_61_INTRINSICS_HPP__ 2025-05-07T19:46:45.7992993Z #define __SM_61_INTRINSICS_H__ 2025-05-07T19:46:45.7993134Z #define __SM_70_RT_HPP__ 2025-05-07T19:46:45.7993235Z #define __SM_70_RT_H__ 2025-05-07T19:46:45.7993322Z #define __SM_80_RT_HPP__ 2025-05-07T19:46:45.7993410Z #define __SM_80_RT_H__ 2025-05-07T19:46:45.7993519Z #define __SM_90_RT_HPP__ 2025-05-07T19:46:45.7993599Z #define __SM_90_RT_H__ 2025-05-07T19:46:45.7993698Z #define __SQUAD_TYPE long int 2025-05-07T19:46:45.7993786Z #define __SSE2_MATH__ 1 2025-05-07T19:46:45.7993928Z #define __SSE2__ 1 2025-05-07T19:46:45.7994079Z #define __SSE_MATH__ 1 2025-05-07T19:46:45.7994169Z #define __SSE__ 1 2025-05-07T19:46:45.7994287Z #define __SSIZE_T_TYPE __SWORD_TYPE 2025-05-07T19:46:45.7994414Z #define __STDCPP_DEFAULT_NEW_ALIGNMENT__ 16UL 2025-05-07T19:46:45.7994690Z #define __STDCPP_MATH_SPEC_FUNCS__ 201003L 2025-05-07T19:46:45.7994789Z #define __STDCPP_THREADS__ 1 2025-05-07T19:46:45.7994906Z #define __STDC_HOSTED__ 1 2025-05-07T19:46:45.7995004Z #define __STDC_IEC_559_COMPLEX__ 1 2025-05-07T19:46:45.7995139Z #define __STDC_IEC_559__ 1 2025-05-07T19:46:45.7995249Z #define __STDC_ISO_10646__ 201103L 2025-05-07T19:46:45.7995344Z #define __STDC_NO_THREADS__ 1 2025-05-07T19:46:45.7995433Z #define __STDC_UTF_16__ 1 2025-05-07T19:46:45.7995526Z #define __STDC_UTF_32__ 1 2025-05-07T19:46:45.7995626Z #define __STDC__ 1 2025-05-07T19:46:45.7995710Z #define __STDDEF_H 2025-05-07T19:46:45.7995793Z #define __STRING(x) #x 2025-05-07T19:46:45.7995920Z #define __SURFACE_INDIRECT_FUNCTIONS_H__ 2025-05-07T19:46:45.7996021Z #define __SURFACE_TYPES_H__ 2025-05-07T19:46:45.7996146Z #define __SUSECONDS_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:45.7996245Z #define __SWORD_TYPE long int 2025-05-07T19:46:45.7996373Z #define __SYSCALL_SLONG_TYPE __SLONGWORD_TYPE 2025-05-07T19:46:45.7996498Z #define __SYSCALL_ULONG_TYPE __ULONGWORD_TYPE 2025-05-07T19:46:45.7996594Z #define __SYSCALL_WORDSIZE 64 2025-05-07T19:46:45.7996715Z #define __TEXTURE_INDIRECT_FUNCTIONS_H__ 2025-05-07T19:46:45.7996813Z #define __TEXTURE_TYPES_H__ 2025-05-07T19:46:45.7996905Z #define __THROW throw () 2025-05-07T19:46:45.7996994Z #define __THROWNL throw () 2025-05-07T19:46:45.7997105Z #define __TIMER_T_TYPE void * 2025-05-07T19:46:45.7997215Z #define __TIME_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:45.7997317Z #define __U16_TYPE unsigned short int 2025-05-07T19:46:45.7997421Z #define __U32_TYPE unsigned int 2025-05-07T19:46:45.7997522Z #define __U64_TYPE unsigned long int 2025-05-07T19:46:45.7997613Z #define __UID_T_TYPE __U32_TYPE 2025-05-07T19:46:45.7997711Z #define __UINT16_C_SUFFIX__ 2025-05-07T19:46:45.7997815Z #define __UINT16_FMTX__ "hX" 2025-05-07T19:46:45.7997911Z #define __UINT16_FMTo__ "ho" 2025-05-07T19:46:45.7998002Z #define __UINT16_FMTu__ "hu" 2025-05-07T19:46:45.7998105Z #define __UINT16_FMTx__ "hx" 2025-05-07T19:46:45.7998196Z #define __UINT16_MAX__ 65535 2025-05-07T19:46:45.7998299Z #define __UINT16_TYPE__ unsigned short 2025-05-07T19:46:45.7998387Z #define __UINT32_C_SUFFIX__ U 2025-05-07T19:46:45.7998486Z #define __UINT32_FMTX__ "X" 2025-05-07T19:46:45.7998579Z #define __UINT32_FMTo__ "o" 2025-05-07T19:46:45.7998667Z #define __UINT32_FMTu__ "u" 2025-05-07T19:46:45.7998762Z #define __UINT32_FMTx__ "x" 2025-05-07T19:46:45.7998856Z #define __UINT32_MAX__ 4294967295U 2025-05-07T19:46:45.7998953Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:46:45.7999049Z #define __UINT64_C_SUFFIX__ UL 2025-05-07T19:46:45.7999152Z #define __UINT64_FMTX__ "lX" 2025-05-07T19:46:45.7999249Z #define __UINT64_FMTo__ "lo" 2025-05-07T19:46:45.7999344Z #define __UINT64_FMTu__ "lu" 2025-05-07T19:46:45.7999452Z #define __UINT64_FMTx__ "lx" 2025-05-07T19:46:45.7999565Z #define __UINT64_MAX__ 18446744073709551615UL 2025-05-07T19:46:45.7999679Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:46:45.7999768Z #define __UINT8_C_SUFFIX__ 2025-05-07T19:46:45.7999872Z #define __UINT8_FMTX__ "hhX" 2025-05-07T19:46:45.7999963Z #define __UINT8_FMTo__ "hho" 2025-05-07T19:46:45.8000050Z #define __UINT8_FMTu__ "hhu" 2025-05-07T19:46:45.8000146Z #define __UINT8_FMTx__ "hhx" 2025-05-07T19:46:45.8000287Z #define __UINT8_MAX__ 255 2025-05-07T19:46:45.8000389Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:46:45.8000483Z #define __UINTMAX_C_SUFFIX__ UL 2025-05-07T19:46:45.8000581Z #define __UINTMAX_FMTX__ "lX" 2025-05-07T19:46:45.8000672Z #define __UINTMAX_FMTo__ "lo" 2025-05-07T19:46:45.8000762Z #define __UINTMAX_FMTu__ "lu" 2025-05-07T19:46:45.8000867Z #define __UINTMAX_FMTx__ "lx" 2025-05-07T19:46:45.8000984Z #define __UINTMAX_MAX__ 18446744073709551615UL 2025-05-07T19:46:45.8001159Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:46:45.8001259Z #define __UINTMAX_WIDTH__ 64 2025-05-07T19:46:45.8001351Z #define __UINTPTR_FMTX__ "lX" 2025-05-07T19:46:45.8001450Z #define __UINTPTR_FMTo__ "lo" 2025-05-07T19:46:45.8001549Z #define __UINTPTR_FMTu__ "lu" 2025-05-07T19:46:45.8001656Z #define __UINTPTR_FMTx__ "lx" 2025-05-07T19:46:45.8001766Z #define __UINTPTR_MAX__ 18446744073709551615UL 2025-05-07T19:46:45.8001880Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:46:45.8001992Z #define __UINTPTR_WIDTH__ 64 2025-05-07T19:46:45.8002088Z #define __UINT_FAST16_FMTX__ "hX" 2025-05-07T19:46:45.8002185Z #define __UINT_FAST16_FMTo__ "ho" 2025-05-07T19:46:45.8002281Z #define __UINT_FAST16_FMTu__ "hu" 2025-05-07T19:46:45.8002391Z #define __UINT_FAST16_FMTx__ "hx" 2025-05-07T19:46:45.8002486Z #define __UINT_FAST16_MAX__ 65535 2025-05-07T19:46:45.8002598Z #define __UINT_FAST16_TYPE__ unsigned short 2025-05-07T19:46:45.8002706Z #define __UINT_FAST32_FMTX__ "X" 2025-05-07T19:46:45.8002805Z #define __UINT_FAST32_FMTo__ "o" 2025-05-07T19:46:45.8002897Z #define __UINT_FAST32_FMTu__ "u" 2025-05-07T19:46:45.8002993Z #define __UINT_FAST32_FMTx__ "x" 2025-05-07T19:46:45.8003104Z #define __UINT_FAST32_MAX__ 4294967295U 2025-05-07T19:46:45.8003211Z #define __UINT_FAST32_TYPE__ unsigned int 2025-05-07T19:46:45.8003305Z #define __UINT_FAST64_FMTX__ "lX" 2025-05-07T19:46:45.8003407Z #define __UINT_FAST64_FMTo__ "lo" 2025-05-07T19:46:45.8003501Z #define __UINT_FAST64_FMTu__ "lu" 2025-05-07T19:46:45.8003597Z #define __UINT_FAST64_FMTx__ "lx" 2025-05-07T19:46:45.8003715Z #define __UINT_FAST64_MAX__ 18446744073709551615UL 2025-05-07T19:46:45.8003847Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:46:45.8003941Z #define __UINT_FAST8_FMTX__ "hhX" 2025-05-07T19:46:45.8004030Z #define __UINT_FAST8_FMTo__ "hho" 2025-05-07T19:46:45.8004134Z #define __UINT_FAST8_FMTu__ "hhu" 2025-05-07T19:46:45.8004225Z #define __UINT_FAST8_FMTx__ "hhx" 2025-05-07T19:46:45.8004321Z #define __UINT_FAST8_MAX__ 255 2025-05-07T19:46:45.8004422Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:46:45.8004526Z #define __UINT_LEAST16_FMTX__ "hX" 2025-05-07T19:46:45.8004619Z #define __UINT_LEAST16_FMTo__ "ho" 2025-05-07T19:46:45.8004717Z #define __UINT_LEAST16_FMTu__ "hu" 2025-05-07T19:46:45.8004818Z #define __UINT_LEAST16_FMTx__ "hx" 2025-05-07T19:46:45.8004912Z #define __UINT_LEAST16_MAX__ 65535 2025-05-07T19:46:45.8005022Z #define __UINT_LEAST16_TYPE__ unsigned short 2025-05-07T19:46:45.8005132Z #define __UINT_LEAST32_FMTX__ "X" 2025-05-07T19:46:45.8005228Z #define __UINT_LEAST32_FMTo__ "o" 2025-05-07T19:46:45.8005321Z #define __UINT_LEAST32_FMTu__ "u" 2025-05-07T19:46:45.8005417Z #define __UINT_LEAST32_FMTx__ "x" 2025-05-07T19:46:45.8005537Z #define __UINT_LEAST32_MAX__ 4294967295U 2025-05-07T19:46:45.8005645Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:46:45.8005742Z #define __UINT_LEAST64_FMTX__ "lX" 2025-05-07T19:46:45.8005849Z #define __UINT_LEAST64_FMTo__ "lo" 2025-05-07T19:46:45.8005948Z #define __UINT_LEAST64_FMTu__ "lu" 2025-05-07T19:46:45.8006042Z #define __UINT_LEAST64_FMTx__ "lx" 2025-05-07T19:46:45.8006160Z #define __UINT_LEAST64_MAX__ 18446744073709551615UL 2025-05-07T19:46:45.8006417Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:46:45.8006527Z #define __UINT_LEAST8_FMTX__ "hhX" 2025-05-07T19:46:45.8006631Z #define __UINT_LEAST8_FMTo__ "hho" 2025-05-07T19:46:45.8006758Z #define __UINT_LEAST8_FMTu__ "hhu" 2025-05-07T19:46:45.8006858Z #define __UINT_LEAST8_FMTx__ "hhx" 2025-05-07T19:46:45.8007007Z #define __UINT_LEAST8_MAX__ 255 2025-05-07T19:46:45.8007122Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:46:45.8007250Z #define __ULONG32_TYPE unsigned int 2025-05-07T19:46:45.8007367Z #define __ULONGWORD_TYPE unsigned long int 2025-05-07T19:46:45.8007474Z #define __UQUAD_TYPE unsigned long int 2025-05-07T19:46:45.8007598Z #define __USECONDS_T_TYPE __U32_TYPE 2025-05-07T19:46:45.8007699Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:46:45.8007837Z #define __USE_ANSI 1 2025-05-07T19:46:45.8007931Z #define __USE_ATFILE 1 2025-05-07T19:46:45.8008044Z #define __USE_BSD 1 2025-05-07T19:46:45.8008143Z #define __USE_FORTIFY_LEVEL 0 2025-05-07T19:46:45.8008231Z #define __USE_GNU 1 2025-05-07T19:46:45.8008346Z #define __USE_ISOC11 1 2025-05-07T19:46:45.8008438Z #define __USE_ISOC95 1 2025-05-07T19:46:45.8008530Z #define __USE_ISOC99 1 2025-05-07T19:46:45.8008628Z #define __USE_ISOCXX11 1 2025-05-07T19:46:45.8008745Z #define __USE_LARGEFILE 1 2025-05-07T19:46:45.8008846Z #define __USE_LARGEFILE64 1 2025-05-07T19:46:45.8008941Z #define __USE_MISC 1 2025-05-07T19:46:45.8009057Z #define __USE_POSIX 1 2025-05-07T19:46:45.8009155Z #define __USE_POSIX199309 1 2025-05-07T19:46:45.8009254Z #define __USE_POSIX199506 1 2025-05-07T19:46:45.8009347Z #define __USE_POSIX2 1 2025-05-07T19:46:45.8009458Z #define __USE_SVID 1 2025-05-07T19:46:45.8009552Z #define __USE_UNIX98 1 2025-05-07T19:46:45.8009646Z #define __USE_XOPEN 1 2025-05-07T19:46:45.8009768Z #define __USE_XOPEN2K 1 2025-05-07T19:46:45.8009867Z #define __USE_XOPEN2K8 1 2025-05-07T19:46:45.8009966Z #define __USE_XOPEN2K8XSI 1 2025-05-07T19:46:45.8010064Z #define __USE_XOPEN2KXSI 1 2025-05-07T19:46:45.8010185Z #define __USE_XOPEN_EXTENDED 1 2025-05-07T19:46:45.8010289Z #define __USING_NAMESPACE_C99(name) 2025-05-07T19:46:45.8010393Z #define __USING_NAMESPACE_STD(name) 2025-05-07T19:46:45.8010519Z #define __UWORD_TYPE unsigned long int 2025-05-07T19:46:45.8010623Z #define __VECTOR_FUNCTIONS_HPP__ 2025-05-07T19:46:45.8010722Z #define __VECTOR_FUNCTIONS_H__ 2025-05-07T19:46:45.8010824Z #define __VECTOR_TYPES_H__ 2025-05-07T19:46:45.8011278Z #define __VERSION__ "Clang 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:46:45.8011568Z #define __WAIT_INT(status) (*(int *) &(status)) 2025-05-07T19:46:45.8011672Z #define __WAIT_STATUS void * 2025-05-07T19:46:45.8011841Z #define __WAIT_STATUS_DEFN void * 2025-05-07T19:46:45.8011939Z #define __WALL 0x40000000 2025-05-07T19:46:45.8012048Z #define __WCHAR_MAX__ 2147483647 2025-05-07T19:46:45.8012149Z #define __WCHAR_TYPE__ int 2025-05-07T19:46:45.8012270Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:46:45.8012371Z #define __WCLONE 0x80000000 2025-05-07T19:46:45.8012514Z #define __WCOREDUMP(status) ((status) & __WCOREFLAG) 2025-05-07T19:46:45.8012636Z #define __WCOREFLAG 0x80 2025-05-07T19:46:45.8012790Z #define __WEXITSTATUS(status) (((status) & 0xff00) >> 8) 2025-05-07T19:46:45.8012954Z #define __WIFCONTINUED(status) ((status) == __W_CONTINUED) 2025-05-07T19:46:45.8013124Z #define __WIFEXITED(status) (__WTERMSIG(status) == 0) 2025-05-07T19:46:45.8013353Z #define __WIFSIGNALED(status) (((signed char) (((status) & 0x7f) + 1) >> 1) > 0) 2025-05-07T19:46:45.8013506Z #define __WIFSTOPPED(status) (((status) & 0xff) == 0x7f) 2025-05-07T19:46:45.8013609Z #define __WINT_MAX__ 4294967295U 2025-05-07T19:46:45.8013738Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:46:45.8013840Z #define __WINT_UNSIGNED__ 1 2025-05-07T19:46:45.8013943Z #define __WINT_WIDTH__ 32 2025-05-07T19:46:45.8014067Z #define __WNOTHREAD 0x20000000 2025-05-07T19:46:45.8014163Z #define __WORDSIZE 64 2025-05-07T19:46:45.8014272Z #define __WORDSIZE_TIME64_COMPAT32 1 2025-05-07T19:46:45.8014409Z #define __WSTOPSIG(status) __WEXITSTATUS(status) 2025-05-07T19:46:45.8014554Z #define __WTERMSIG(status) ((status) & 0x7f) 2025-05-07T19:46:45.8014656Z #define __W_CONTINUED 0xffff 2025-05-07T19:46:45.8014790Z #define __W_EXITCODE(ret,sig) ((ret) << 8 | (sig)) 2025-05-07T19:46:45.8014932Z #define __W_STOPCODE(sig) ((sig) << 8 | 0x7f) 2025-05-07T19:46:45.8015083Z #define ____FILE_defined 1 2025-05-07T19:46:45.8015188Z #define ____mbstate_t_defined 1 2025-05-07T19:46:45.8015319Z #define __align__(n) __attribute__((aligned(n))) 2025-05-07T19:46:45.8015534Z #define __always_inline __inline __attribute__ ((__always_inline__)) 2025-05-07T19:46:45.8015625Z #define __amd64 1 2025-05-07T19:46:45.8015718Z #define __amd64__ 1 2025-05-07T19:46:45.8015861Z #define __annotate__(a) __attribute__((a)) 2025-05-07T19:46:45.8016024Z #define __attribute_artificial__ 2025-05-07T19:46:45.8016174Z #define __attribute_const__ __attribute__ ((__const__)) 2025-05-07T19:46:45.8016386Z #define __attribute_deprecated__ __attribute__ ((__deprecated__)) 2025-05-07T19:46:45.8016592Z #define __attribute_format_arg__(x) __attribute__ ((__format_arg__ (x))) 2025-05-07T19:46:45.8016851Z #define __attribute_format_strfmon__(a,b) __attribute__ ((__format__ (__strfmon__, a, b))) 2025-05-07T19:46:45.8017010Z #define __attribute_malloc__ __attribute__ ((__malloc__)) 2025-05-07T19:46:45.8017206Z #define __attribute_noinline__ __attribute__ ((__noinline__)) 2025-05-07T19:46:45.8017349Z #define __attribute_pure__ __attribute__ ((__pure__)) 2025-05-07T19:46:45.8017489Z #define __attribute_used__ __attribute__ ((__used__)) 2025-05-07T19:46:45.8017751Z #define __attribute_warn_unused_result__ __attribute__ ((__warn_unused_result__)) 2025-05-07T19:46:45.8017854Z #define __blkcnt_t_defined 2025-05-07T19:46:45.8017961Z #define __blksize_t_defined 2025-05-07T19:46:45.8018186Z #define __bos(ptr) __builtin_object_size (ptr, __USE_FORTIFY_LEVEL > 1) 2025-05-07T19:46:45.8018326Z #define __bos0(ptr) __builtin_object_size (ptr, 0) 2025-05-07T19:46:45.8018418Z #define __bounded 2025-05-07T19:46:45.8019034Z #define __bswap_16(x) (__extension__ ({ unsigned short int __v, __x = (unsigned short int) (x); if (__builtin_constant_p (__x)) __v = __bswap_constant_16 (__x); else __asm__ ("rorw $8, %w0" : "=r" (__v) : "0" (__x) : "cc"); __v; })) 2025-05-07T19:46:45.8019550Z #define __bswap_32(x) (__extension__ ({ unsigned int __v, __x = (x); if (__builtin_constant_p (__x)) __v = __bswap_constant_32 (__x); else __asm__ ("bswap %0" : "=r" (__v) : "0" (__x)); __v; })) 2025-05-07T19:46:45.8020025Z #define __bswap_64(x) (__extension__ ({ __uint64_t __v, __x = (x); if (__builtin_constant_p (__x)) __v = __bswap_constant_64 (__x); else __asm__ ("bswap %q0" : "=r" (__v) : "0" (__x)); __v; })) 2025-05-07T19:46:45.8020313Z #define __bswap_constant_16(x) ((unsigned short int) ((((x) >> 8) & 0xff) | (((x) & 0xff) << 8))) 2025-05-07T19:46:45.8020655Z #define __bswap_constant_32(x) ((((x) & 0xff000000) >> 24) | (((x) & 0x00ff0000) >> 8) | (((x) & 0x0000ff00) << 8) | (((x) & 0x000000ff) << 24)) 2025-05-07T19:46:45.8021611Z #define __bswap_constant_64(x) (__extension__ ((((x) & 0xff00000000000000ull) >> 56) | (((x) & 0x00ff000000000000ull) >> 40) | (((x) & 0x0000ff0000000000ull) >> 24) | (((x) & 0x000000ff00000000ull) >> 8) | (((x) & 0x00000000ff000000ull) << 8) | (((x) & 0x0000000000ff0000ull) << 24) | (((x) & 0x000000000000ff00ull) << 40) | (((x) & 0x00000000000000ffull) << 56))) 2025-05-07T19:46:45.8021754Z #define __builtin_align__(a) __align__(a) 2025-05-07T19:46:45.8021858Z #define __catch(X) catch(X) 2025-05-07T19:46:45.8021951Z #define __cdecl 2025-05-07T19:46:45.8022052Z #define __clang__ 1 2025-05-07T19:46:45.8022199Z #define __clang_literal_encoding__ "UTF-8" 2025-05-07T19:46:45.8022304Z #define __clang_major__ 16 2025-05-07T19:46:45.8022409Z #define __clang_minor__ 0 2025-05-07T19:46:45.8022538Z #define __clang_patchlevel__ 6 2025-05-07T19:46:45.8022965Z #define __clang_version__ "16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:46:45.8023100Z #define __clang_wide_literal_encoding__ "UTF-32" 2025-05-07T19:46:45.8023223Z #define __clock_t_defined 1 2025-05-07T19:46:45.8023327Z #define __clockid_t_defined 1 2025-05-07T19:46:45.8023530Z #define __cluster_dims__(...) __attribute__((cluster_dims(__VA_ARGS__))) 2025-05-07T19:46:45.8023682Z #define __code_model_small__ 1 2025-05-07T19:46:45.8023825Z #define __constant__ __location__(constant) 2025-05-07T19:46:45.8023934Z #define __cplusplus 201703L 2025-05-07T19:46:45.8024050Z #define __cpp_aggregate_bases 201603L 2025-05-07T19:46:45.8024183Z #define __cpp_aggregate_nsdmi 201304L 2025-05-07T19:46:45.8024296Z #define __cpp_alias_templates 200704L 2025-05-07T19:46:45.8024403Z #define __cpp_aligned_new 201606L 2025-05-07T19:46:45.8024562Z #define __cpp_attributes 200809L 2025-05-07T19:46:45.8024688Z #define __cpp_binary_literals 201304L 2025-05-07T19:46:45.8024791Z #define __cpp_capture_star_this 201603L 2025-05-07T19:46:45.8024896Z #define __cpp_constexpr 201603L 2025-05-07T19:46:45.8025022Z #define __cpp_constexpr_in_decltype 201711L 2025-05-07T19:46:45.8025117Z #define __cpp_decltype 200707L 2025-05-07T19:46:45.8025220Z #define __cpp_decltype_auto 201304L 2025-05-07T19:46:45.8025328Z #define __cpp_deduction_guides 201703L 2025-05-07T19:46:45.8025461Z #define __cpp_delegating_constructors 200604L 2025-05-07T19:46:45.8025570Z #define __cpp_digit_separators 201309L 2025-05-07T19:46:45.8025684Z #define __cpp_enumerator_attributes 201411L 2025-05-07T19:46:45.8025800Z #define __cpp_exceptions 199711L 2025-05-07T19:46:45.8025903Z #define __cpp_fold_expressions 201603L 2025-05-07T19:46:45.8026007Z #define __cpp_generic_lambdas 201304L 2025-05-07T19:46:45.8026251Z #define __cpp_guaranteed_copy_elision 201606L 2025-05-07T19:46:45.8026346Z #define __cpp_hex_float 201603L 2025-05-07T19:46:45.8026447Z #define __cpp_if_constexpr 201606L 2025-05-07T19:46:45.8026559Z #define __cpp_impl_destroying_delete 201806L 2025-05-07T19:46:45.8026690Z #define __cpp_inheriting_constructors 201511L 2025-05-07T19:46:45.8026789Z #define __cpp_init_captures 201304L 2025-05-07T19:46:45.8026891Z #define __cpp_initializer_lists 200806L 2025-05-07T19:46:45.8027009Z #define __cpp_inline_variables 201606L 2025-05-07T19:46:45.8027100Z #define __cpp_lambdas 200907L 2025-05-07T19:46:45.8027213Z #define __cpp_lib_addressof_constexpr 201603 2025-05-07T19:46:45.8027321Z #define __cpp_lib_array_constexpr 201803L 2025-05-07T19:46:45.8027426Z #define __cpp_lib_as_const 201510 2025-05-07T19:46:45.8027521Z #define __cpp_lib_bool_constant 201505 2025-05-07T19:46:45.8027626Z #define __cpp_lib_exchange_function 201304 2025-05-07T19:46:45.8027936Z #define __cpp_lib_has_unique_object_representations 201606 2025-05-07T19:46:45.8028026Z #define __cpp_lib_hypot 201603 2025-05-07T19:46:45.8028128Z #define __cpp_lib_integer_sequence 201304 2025-05-07T19:46:45.8028257Z #define __cpp_lib_integral_constant_callable 201304 2025-05-07T19:46:45.8028504Z #define __cpp_lib_is_aggregate 201703 2025-05-07T19:46:45.8028597Z #define __cpp_lib_is_final 201402L 2025-05-07T19:46:45.8028856Z #define __cpp_lib_is_invocable 201703 2025-05-07T19:46:45.8028976Z #define __cpp_lib_is_null_pointer 201309 2025-05-07T19:46:45.8029080Z #define __cpp_lib_is_swappable 201603 2025-05-07T19:46:45.8029184Z #define __cpp_lib_launder 201606 2025-05-07T19:46:45.8029294Z #define __cpp_lib_logical_traits 201510 2025-05-07T19:46:45.8029437Z #define __cpp_lib_make_reverse_iterator 201402 2025-05-07T19:46:45.8029565Z #define __cpp_lib_math_special_functions 201603L 2025-05-07T19:46:45.8029673Z #define __cpp_lib_result_of_sfinae 201210 2025-05-07T19:46:45.8029824Z #define __cpp_lib_robust_nonmodifying_seq_ops 201304 2025-05-07T19:46:45.8029970Z #define __cpp_lib_transformation_trait_aliases 201304 2025-05-07T19:46:45.8030077Z #define __cpp_lib_tuple_element_t 201402L 2025-05-07T19:46:45.8030198Z #define __cpp_lib_tuples_by_type 201304 2025-05-07T19:46:45.8030352Z #define __cpp_lib_type_trait_variable_templates 201510L 2025-05-07T19:46:45.8030449Z #define __cpp_lib_void_t 201411 2025-05-07T19:46:45.8030571Z #define __cpp_named_character_escapes 202207L 2025-05-07T19:46:45.8030709Z #define __cpp_namespace_attributes 201411L 2025-05-07T19:46:45.8030850Z #define __cpp_nested_namespace_definitions 201411L 2025-05-07T19:46:45.8030970Z #define __cpp_noexcept_function_type 201510L 2025-05-07T19:46:45.8031573Z #define __cpp_nontype_template_args 201411L 2025-05-07T19:46:45.8031723Z #define __cpp_nontype_template_parameter_auto 201606L 2025-05-07T19:46:45.8031818Z #define __cpp_nsdmi 200809L 2025-05-07T19:46:45.8031921Z #define __cpp_range_based_for 201603L 2025-05-07T19:46:45.8032036Z #define __cpp_raw_strings 200710L 2025-05-07T19:46:45.8032142Z #define __cpp_ref_qualifiers 200710L 2025-05-07T19:46:45.8032254Z #define __cpp_return_type_deduction 201304L 2025-05-07T19:46:45.8032436Z #define __cpp_rtti 199711L 2025-05-07T19:46:45.8032545Z #define __cpp_rvalue_references 200610L 2025-05-07T19:46:45.8032650Z #define __cpp_static_assert 201411L 2025-05-07T19:46:45.8032775Z #define __cpp_static_call_operator 202207L 2025-05-07T19:46:45.8032889Z #define __cpp_structured_bindings 201606L 2025-05-07T19:46:45.8032995Z #define __cpp_template_auto 201606L 2025-05-07T19:46:45.8033116Z #define __cpp_threadsafe_static_init 200806L 2025-05-07T19:46:45.8033240Z #define __cpp_unicode_characters 200704L 2025-05-07T19:46:45.8033350Z #define __cpp_unicode_literals 200710L 2025-05-07T19:46:45.8033469Z #define __cpp_user_defined_literals 200809L 2025-05-07T19:46:45.8033594Z #define __cpp_variable_templates 201304L 2025-05-07T19:46:45.8033705Z #define __cpp_variadic_templates 200704L 2025-05-07T19:46:45.8033810Z #define __cpp_variadic_using 201611L 2025-05-07T19:46:45.8033928Z #define __cudaCDP2DeviceGetAttribute 2025-05-07T19:46:45.8034129Z #define __cudaCDP2DeviceGetCacheConfig 2025-05-07T19:46:45.8034245Z #define __cudaCDP2DeviceGetLimit 2025-05-07T19:46:45.8034376Z #define __cudaCDP2DeviceGetSharedMemConfig 2025-05-07T19:46:45.8034508Z #define __cudaCDP2EventCreateWithFlags 2025-05-07T19:46:45.8034611Z #define __cudaCDP2EventDestroy 2025-05-07T19:46:45.8034712Z #define __cudaCDP2EventRecord 2025-05-07T19:46:45.8034860Z #define __cudaCDP2EventRecordWithFlags 2025-05-07T19:46:45.8035005Z #define __cudaCDP2EventRecordWithFlags_ptsz 2025-05-07T19:46:45.8035116Z #define __cudaCDP2EventRecord_ptsz 2025-05-07T19:46:45.8035211Z #define __cudaCDP2Free 2025-05-07T19:46:45.8035344Z #define __cudaCDP2FuncGetAttributes 2025-05-07T19:46:45.8035442Z #define __cudaCDP2GetDevice 2025-05-07T19:46:45.8035548Z #define __cudaCDP2GetDeviceCount 2025-05-07T19:46:45.8035651Z #define __cudaCDP2GetErrorName 2025-05-07T19:46:45.8035766Z #define __cudaCDP2GetErrorString 2025-05-07T19:46:45.8035870Z #define __cudaCDP2GetLastError 2025-05-07T19:46:45.8035988Z #define __cudaCDP2GetParameterBuffer 2025-05-07T19:46:45.8036126Z #define __cudaCDP2GetParameterBufferV2 2025-05-07T19:46:45.8036227Z #define __cudaCDP2LaunchDevice 2025-05-07T19:46:45.8036333Z #define __cudaCDP2LaunchDeviceV2 2025-05-07T19:46:45.8036447Z #define __cudaCDP2LaunchDeviceV2_ptsz 2025-05-07T19:46:45.8036570Z #define __cudaCDP2LaunchDevice_ptsz 2025-05-07T19:46:45.8036666Z #define __cudaCDP2Malloc 2025-05-07T19:46:45.8036770Z #define __cudaCDP2Memcpy2DAsync 2025-05-07T19:46:45.8036898Z #define __cudaCDP2Memcpy2DAsync_ptsz 2025-05-07T19:46:45.8036999Z #define __cudaCDP2Memcpy3DAsync 2025-05-07T19:46:45.8037119Z #define __cudaCDP2Memcpy3DAsync_ptsz 2025-05-07T19:46:45.8037227Z #define __cudaCDP2MemcpyAsync 2025-05-07T19:46:45.8037352Z #define __cudaCDP2MemcpyAsync_ptsz 2025-05-07T19:46:45.8037460Z #define __cudaCDP2Memset2DAsync 2025-05-07T19:46:45.8037573Z #define __cudaCDP2Memset2DAsync_ptsz 2025-05-07T19:46:45.8037698Z #define __cudaCDP2Memset3DAsync 2025-05-07T19:46:45.8037806Z #define __cudaCDP2Memset3DAsync_ptsz 2025-05-07T19:46:45.8037913Z #define __cudaCDP2MemsetAsync 2025-05-07T19:46:45.8038030Z #define __cudaCDP2MemsetAsync_ptsz 2025-05-07T19:46:45.8038242Z #define __cudaCDP2OccupancyMaxActiveBlocksPerMultiprocessor 2025-05-07T19:46:45.8038507Z #define __cudaCDP2OccupancyMaxActiveBlocksPerMultiprocessorWithFlags 2025-05-07T19:46:45.8038618Z #define __cudaCDP2PeekAtLastError 2025-05-07T19:46:45.8038741Z #define __cudaCDP2RuntimeGetVersion 2025-05-07T19:46:45.8038859Z #define __cudaCDP2StreamCreateWithFlags 2025-05-07T19:46:45.8038964Z #define __cudaCDP2StreamDestroy 2025-05-07T19:46:45.8039146Z #define __cudaCDP2StreamWaitEvent 2025-05-07T19:46:45.8039265Z #define __cudaCDP2StreamWaitEvent_ptsz 2025-05-07T19:46:45.8039368Z #define __cudaGet_blockDim() blockDim 2025-05-07T19:46:45.8039467Z #define __cudaGet_blockIdx() blockIdx 2025-05-07T19:46:45.8039581Z #define __cudaGet_gridDim() gridDim 2025-05-07T19:46:45.8039689Z #define __cudaGet_threadIdx() threadIdx 2025-05-07T19:46:45.8039792Z #define __cudaGet_warpSize() warpSize 2025-05-07T19:46:45.8040012Z #define __cudart_builtin__ __location__(cudart_builtin) 2025-05-07T19:46:45.8040113Z #define __daddr_t_defined 2025-05-07T19:46:45.8040205Z #define __dev_t_defined 2025-05-07T19:46:45.8040310Z #define __device__ __location__(device) 2025-05-07T19:46:45.8040466Z #define __device_builtin__ __location__(device_builtin) 2025-05-07T19:46:45.8040710Z #define __device_builtin_surface_type__ __location__(device_builtin_surface_type) 2025-05-07T19:46:45.8040944Z #define __device_builtin_texture_type__ __location__(device_builtin_texture_type) 2025-05-07T19:46:45.8041105Z #define __errordecl(name,msg) extern void name (void) 2025-05-07T19:46:45.8041243Z #define __exctype(name) extern int name (int) __THROW 2025-05-07T19:46:45.8041427Z #define __exctype_l(name) extern int name (int, __locale_t) __THROW 2025-05-07T19:46:45.8041524Z #define __export__ 2025-05-07T19:46:45.8041775Z #define __extern_always_inline extern __always_inline __attribute__ ((__gnu_inline__)) 2025-05-07T19:46:45.8041984Z #define __extern_inline extern __inline __attribute__ ((__gnu_inline__)) 2025-05-07T19:46:45.8042075Z #define __flexarr [] 2025-05-07T19:46:45.8042273Z #define __forceinline__ __inline__ __attribute__((always_inline)) 2025-05-07T19:46:45.8042493Z #define __fortify_function __extern_always_inline __attribute_artificial__ 2025-05-07T19:46:45.8042594Z #define __fsblkcnt_t_defined 2025-05-07T19:46:45.8042699Z #define __fsfilcnt_t_defined 2025-05-07T19:46:45.8042791Z #define __gid_t_defined 2025-05-07T19:46:45.8042955Z #define __glibc_likely(cond) __builtin_expect((cond), 1) 2025-05-07T19:46:45.8043118Z #define __glibc_unlikely(cond) __builtin_expect((cond), 0) 2025-05-07T19:46:45.8043362Z #define __glibcxx_assert(cond) do { __glibcxx_constexpr_assert(cond); } while (false) 2025-05-07T19:46:45.8043472Z #define __glibcxx_class_requires(_a,_b) 2025-05-07T19:46:45.8043596Z #define __glibcxx_class_requires2(_a,_b,_c) 2025-05-07T19:46:45.8043729Z #define __glibcxx_class_requires3(_a,_b,_c,_d) 2025-05-07T19:46:45.8043862Z #define __glibcxx_class_requires4(_a,_b,_c,_d,_e) 2025-05-07T19:46:45.8044233Z #define __glibcxx_constexpr_assert(cond) if (__builtin_is_constant_evaluated() && !bool(cond)) __builtin_unreachable() 2025-05-07T19:46:45.8044461Z #define __glibcxx_digits10_b(T,B) (__glibcxx_digits_b (T,B) * 643L / 2136) 2025-05-07T19:46:45.8044637Z #define __glibcxx_digits_b(T,B) (B - __glibcxx_signed_b (T,B)) 2025-05-07T19:46:45.8044745Z #define __glibcxx_function_requires(...) 2025-05-07T19:46:45.8044867Z #define __glibcxx_integral_traps true 2025-05-07T19:46:45.8045184Z #define __glibcxx_max_b(T,B) (__glibcxx_signed_b (T,B) ? (((((T)1 << (__glibcxx_digits_b (T,B) - 1)) - 1) << 1) + 1) : ~(T)0) 2025-05-07T19:46:45.8045440Z #define __glibcxx_min_b(T,B) (__glibcxx_signed_b (T,B) ? -__glibcxx_max_b (T,B) - 1 : (T)0) 2025-05-07T19:46:45.8045662Z #define __glibcxx_requires_can_decrement_range(_First1,_Last1,_First2) 2025-05-07T19:46:45.8045815Z #define __glibcxx_requires_can_increment(_First,_Size) 2025-05-07T19:46:45.8046023Z #define __glibcxx_requires_can_increment_range(_First1,_Last1,_First2) 2025-05-07T19:46:45.8046250Z #define __glibcxx_requires_cond(_Cond,_Msg) 2025-05-07T19:46:45.8046387Z #define __glibcxx_requires_heap(_First,_Last) 2025-05-07T19:46:45.8046542Z #define __glibcxx_requires_heap_pred(_First,_Last,_Pred) 2025-05-07T19:46:45.8046686Z #define __glibcxx_requires_irreflexive(_First,_Last) 2025-05-07T19:46:45.8046847Z #define __glibcxx_requires_irreflexive2(_First,_Last) 2025-05-07T19:46:45.8047176Z #define __glibcxx_requires_irreflexive_pred(_First,_Last,_Pred) 2025-05-07T19:46:45.8047351Z #define __glibcxx_requires_irreflexive_pred2(_First,_Last,_Pred) 2025-05-07T19:46:45.8047508Z #define __glibcxx_requires_non_empty_range(_First,_Last) 2025-05-07T19:46:45.8047614Z #define __glibcxx_requires_nonempty() 2025-05-07T19:46:45.8047791Z #define __glibcxx_requires_partitioned_lower(_First,_Last,_Value) 2025-05-07T19:46:45.8048009Z #define __glibcxx_requires_partitioned_lower_pred(_First,_Last,_Value,_Pred) 2025-05-07T19:46:45.8048255Z #define __glibcxx_requires_partitioned_upper(_First,_Last,_Value) 2025-05-07T19:46:45.8048470Z #define __glibcxx_requires_partitioned_upper_pred(_First,_Last,_Value,_Pred) 2025-05-07T19:46:45.8048586Z #define __glibcxx_requires_sorted(_First,_Last) 2025-05-07T19:46:45.8048754Z #define __glibcxx_requires_sorted_pred(_First,_Last,_Pred) 2025-05-07T19:46:45.8048920Z #define __glibcxx_requires_sorted_set(_First1,_Last1,_First2) 2025-05-07T19:46:45.8049120Z #define __glibcxx_requires_sorted_set_pred(_First1,_Last1,_First2,_Pred) 2025-05-07T19:46:45.8049244Z #define __glibcxx_requires_string(_String) 2025-05-07T19:46:45.8049376Z #define __glibcxx_requires_string_len(_String,_Len) 2025-05-07T19:46:45.8049484Z #define __glibcxx_requires_subscript(_N) 2025-05-07T19:46:45.8049618Z #define __glibcxx_requires_valid_range(_First,_Last) 2025-05-07T19:46:45.8049752Z #define __glibcxx_signed_b(T,B) ((T)(-1) < 0) 2025-05-07T19:46:45.8049853Z #define __global__ __location__(global) 2025-05-07T19:46:45.8049941Z #define __gnu_linux__ 1 2025-05-07T19:46:45.8050077Z #define __grid_constant__ __location__(grid_constant) 2025-05-07T19:46:45.8050178Z #define __have_pthread_attr_t 1 2025-05-07T19:46:45.8050273Z #define __host__ __location__(host) 2025-05-07T19:46:45.8050361Z #define __id_t_defined 2025-05-07T19:46:45.8050452Z #define __import__ 2025-05-07T19:46:45.8050587Z #define __inline_hint__ __attribute__((nv_inline_hint)) 2025-05-07T19:46:45.8050676Z #define __ino64_t_defined 2025-05-07T19:46:45.8050769Z #define __ino_t_defined 2025-05-07T19:46:45.8050856Z #define __int8_t_defined 2025-05-07T19:46:45.8051064Z #define __intN_t(N,MODE) typedef int int##N##_t __attribute__ ((__mode__ (MODE))) 2025-05-07T19:46:45.8051206Z #define __isalnum_l(c,l) __isctype_l((c), _ISalnum, (l)) 2025-05-07T19:46:45.8051351Z #define __isalpha_l(c,l) __isctype_l((c), _ISalpha, (l)) 2025-05-07T19:46:45.8051448Z #define __isascii(c) (((c) & ~0x7f) == 0) 2025-05-07T19:46:45.8051581Z #define __isascii_l(c,l) ((l), __isascii (c)) 2025-05-07T19:46:45.8051730Z #define __isblank_l(c,l) __isctype_l((c), _ISblank, (l)) 2025-05-07T19:46:45.8051868Z #define __iscntrl_l(c,l) __isctype_l((c), _IScntrl, (l)) 2025-05-07T19:46:45.8052121Z #define __isctype_l(c,type,locale) ((locale)->__ctype_b[(int) (c)] & (unsigned short int) type) 2025-05-07T19:46:45.8052263Z #define __isdigit_l(c,l) __isctype_l((c), _ISdigit, (l)) 2025-05-07T19:46:45.8052400Z #define __isgraph_l(c,l) __isctype_l((c), _ISgraph, (l)) 2025-05-07T19:46:45.8052591Z #define __isleap(year) ((year) % 4 == 0 && ((year) % 100 != 0 || (year) % 400 == 0)) 2025-05-07T19:46:45.8052725Z #define __islower_l(c,l) __isctype_l((c), _ISlower, (l)) 2025-05-07T19:46:45.8052872Z #define __isprint_l(c,l) __isctype_l((c), _ISprint, (l)) 2025-05-07T19:46:45.8053011Z #define __ispunct_l(c,l) __isctype_l((c), _ISpunct, (l)) 2025-05-07T19:46:45.8053154Z #define __isspace_l(c,l) __isctype_l((c), _ISspace, (l)) 2025-05-07T19:46:45.8053307Z #define __isupper_l(c,l) __isctype_l((c), _ISupper, (l)) 2025-05-07T19:46:45.8053461Z #define __isxdigit_l(c,l) __isctype_l((c), _ISxdigit, (l)) 2025-05-07T19:46:45.8053546Z #define __k8 1 2025-05-07T19:46:45.8053643Z #define __k8__ 1 2025-05-07T19:46:45.8053733Z #define __key_t_defined 2025-05-07T19:46:45.8053921Z #define __launch_bounds__(...) __annotate__(launch_bounds(__VA_ARGS__)) 2025-05-07T19:46:45.8054014Z #define __ldiv_t_defined 1 2025-05-07T19:46:45.8054107Z #define __linux 1 2025-05-07T19:46:45.8054187Z #define __linux__ 1 2025-05-07T19:46:45.8054276Z #define __lldiv_t_defined 1 2025-05-07T19:46:45.8054408Z #define __llvm__ 1 2025-05-07T19:46:45.8054509Z #define __location__(a) __annotate__(a) 2025-05-07T19:46:45.8054612Z #define __long_double_t long double 2025-05-07T19:46:45.8054715Z #define __malloc_and_calloc_defined 2025-05-07T19:46:45.8054822Z #define __managed__ __location__(managed) 2025-05-07T19:46:45.8054944Z #define __maxnreg__(a) __attribute__((maxnreg(a))) 2025-05-07T19:46:45.8055029Z #define __mode_t_defined 2025-05-07T19:46:45.8055177Z #define __need_IOV_MAX 2025-05-07T19:46:45.8055266Z #define __need_clockid_t 2025-05-07T19:46:45.8055357Z #define __nlink_t_defined 2025-05-07T19:46:45.8055475Z #define __no_return__ __attribute__((noreturn)) 2025-05-07T19:46:45.8055602Z #define __noinline__ __attribute__((noinline)) 2025-05-07T19:46:45.8055764Z #define __nonnull(params) __attribute__ ((__nonnull__ params)) 2025-05-07T19:46:45.8055865Z #define __nv_pure__ __location__(nv_pure) 2025-05-07T19:46:45.8055963Z #define __off64_t_defined 2025-05-07T19:46:45.8056050Z #define __off_t_defined 2025-05-07T19:46:45.8056129Z #define __pic__ 2 2025-05-07T19:46:45.8056212Z #define __pid_t_defined 2025-05-07T19:46:45.8056307Z #define __pie__ 2 2025-05-07T19:46:45.8056399Z #define __private_extern__ extern 2025-05-07T19:46:45.8056480Z #define __ptr_t void * 2025-05-07T19:46:45.8056565Z #define __ptrvalue 2025-05-07T19:46:45.8056650Z #define __restrict_arr 2025-05-07T19:46:45.8056780Z #define __seg_fs __attribute__((address_space(257))) 2025-05-07T19:46:45.8056909Z #define __seg_gs __attribute__((address_space(256))) 2025-05-07T19:46:45.8057018Z #define __shared__ __location__(shared) 2025-05-07T19:46:45.8057103Z #define __sigset_t_defined 2025-05-07T19:46:45.8057198Z #define __specialization_static 2025-05-07T19:46:45.8057300Z #define __ssize_t_defined 2025-05-07T19:46:45.8057380Z #define __stub_bdflush 2025-05-07T19:46:45.8057462Z #define __stub_chflags 2025-05-07T19:46:45.8057553Z #define __stub_fattach 2025-05-07T19:46:45.8057649Z #define __stub_fchflags 2025-05-07T19:46:45.8057737Z #define __stub_fdetach 2025-05-07T19:46:45.8057820Z #define __stub_getmsg 2025-05-07T19:46:45.8057916Z #define __stub_gtty 2025-05-07T19:46:45.8057996Z #define __stub_lchmod 2025-05-07T19:46:45.8058079Z #define __stub_putmsg 2025-05-07T19:46:45.8058162Z #define __stub_revoke 2025-05-07T19:46:45.8058262Z #define __stub_setlogin 2025-05-07T19:46:45.8058346Z #define __stub_sigreturn 2025-05-07T19:46:45.8058426Z #define __stub_sstk 2025-05-07T19:46:45.8058527Z #define __stub_stty 2025-05-07T19:46:45.8058615Z #define __suseconds_t_defined 2025-05-07T19:46:45.8058701Z #define __thread__ __thread 2025-05-07T19:46:45.8058802Z #define __throw_exception_again throw 2025-05-07T19:46:45.8058907Z #define __time_t_defined 1 2025-05-07T19:46:45.8058992Z #define __timer_t_defined 1 2025-05-07T19:46:45.8059085Z #define __timespec_defined 1 2025-05-07T19:46:45.8059199Z #define __toascii(c) ((c) & 0x7f) 2025-05-07T19:46:45.8059304Z #define __toascii_l(c,l) ((l), __toascii (c)) 2025-05-07T19:46:45.8059834Z #define __tobody(c,f,a,args) (__extension__ ({ int __res; if (sizeof (c) > 1) { if (__builtin_constant_p (c)) { int __c = (c); __res = __c < -128 || __c > 255 ? __c : (a)[__c]; } else __res = f args; } else __res = (a)[(int) (c)]; __res; })) 2025-05-07T19:46:45.8059931Z #define __try try 2025-05-07T19:46:45.8060019Z #define __tune_k8__ 1 2025-05-07T19:46:45.8060103Z #define __u_char_defined 2025-05-07T19:46:45.8060355Z #define __u_intN_t(N,MODE) typedef unsigned int u_int##N##_t __attribute__ ((__mode__ (MODE))) 2025-05-07T19:46:45.8060462Z #define __uid_t_defined 2025-05-07T19:46:45.8060542Z #define __unbounded 2025-05-07T19:46:45.8060621Z #define __unix 1 2025-05-07T19:46:45.8060717Z #define __unix__ 1 2025-05-07T19:46:45.8060808Z #define __useconds_t_defined 2025-05-07T19:46:45.8060890Z #define __warnattr(msg) 2025-05-07T19:46:45.8061020Z #define __warndecl(name,msg) extern void name (void) 2025-05-07T19:46:45.8061110Z #define __wur 2025-05-07T19:46:45.8061190Z #define __x86_64 1 2025-05-07T19:46:45.8061329Z #define __x86_64__ 1 2025-05-07T19:46:45.8061504Z #define _tolower(c) ((int) (*__ctype_tolower_loc ())[(int) (c)]) 2025-05-07T19:46:45.8061666Z #define _toupper(c) ((int) (*__ctype_toupper_loc ())[(int) (c)]) 2025-05-07T19:46:45.8061780Z #define alloca(size) __builtin_alloca (size) 2025-05-07T19:46:45.8062104Z #define assert(expr) ((expr) ? __ASSERT_VOID_CAST (0) : __assert_fail (__STRING(expr), __FILE__, __LINE__, __ASSERT_FUNCTION)) 2025-05-07T19:46:45.8062540Z #define assert_perror(errnum) (!(errnum) ? __ASSERT_VOID_CAST (0) : __assert_perror_fail ((errnum), __FILE__, __LINE__, __ASSERT_FUNCTION)) 2025-05-07T19:46:45.8062638Z #define be16toh(x) __bswap_16 (x) 2025-05-07T19:46:45.8062728Z #define be32toh(x) __bswap_32 (x) 2025-05-07T19:46:45.8062836Z #define be64toh(x) __bswap_64 (x) 2025-05-07T19:46:45.8062951Z #define cudaArrayColorAttachment 0x20 2025-05-07T19:46:45.8063048Z #define cudaArrayCubemap 0x04 2025-05-07T19:46:45.8063149Z #define cudaArrayDefault 0x00 2025-05-07T19:46:45.8063263Z #define cudaArrayDeferredMapping 0x80 2025-05-07T19:46:45.8063358Z #define cudaArrayLayered 0x01 2025-05-07T19:46:45.8063452Z #define cudaArraySparse 0x40 2025-05-07T19:46:45.8063610Z #define cudaArraySparsePropertiesSingleMipTail 0x1 2025-05-07T19:46:45.8063724Z #define cudaArraySurfaceLoadStore 0x02 2025-05-07T19:46:45.8063833Z #define cudaArrayTextureGather 0x08 2025-05-07T19:46:45.8064017Z #define cudaCooperativeLaunchMultiDeviceNoPostSync 0x02 2025-05-07T19:46:45.8064190Z #define cudaCooperativeLaunchMultiDeviceNoPreSync 0x01 2025-05-07T19:46:45.8064294Z #define cudaCpuDeviceId ((int)-1) 2025-05-07T19:46:45.8064401Z #define cudaDeviceBlockingSync 0x04 2025-05-07T19:46:45.8064522Z #define cudaDeviceLmemResizeToMax 0x10 2025-05-07T19:46:45.8064623Z #define cudaDeviceMapHost 0x08 2025-05-07T19:46:45.8064723Z #define cudaDeviceMask 0xff 2025-05-07T19:46:45.8064838Z #define cudaDeviceScheduleAuto 0x00 2025-05-07T19:46:45.8064959Z #define cudaDeviceScheduleBlockingSync 0x04 2025-05-07T19:46:45.8065066Z #define cudaDeviceScheduleMask 0x07 2025-05-07T19:46:45.8065169Z #define cudaDeviceScheduleSpin 0x01 2025-05-07T19:46:45.8065286Z #define cudaDeviceScheduleYield 0x02 2025-05-07T19:46:45.8065389Z #define cudaDeviceSyncMemops 0x80 2025-05-07T19:46:45.8065497Z #define cudaEventBlockingSync 0x01 2025-05-07T19:46:45.8065600Z #define cudaEventDefault 0x00 2025-05-07T19:46:45.8065703Z #define cudaEventDisableTiming 0x02 2025-05-07T19:46:45.8065805Z #define cudaEventInterprocess 0x04 2025-05-07T19:46:45.8065921Z #define cudaEventRecordDefault 0x00 2025-05-07T19:46:45.8066028Z #define cudaEventRecordExternal 0x01 2025-05-07T19:46:45.8066125Z #define cudaEventWaitDefault 0x00 2025-05-07T19:46:45.8066227Z #define cudaEventWaitExternal 0x01 2025-05-07T19:46:45.8066354Z #define cudaExternalMemoryDedicated 0x1 2025-05-07T19:46:45.8066551Z #define cudaExternalSemaphoreSignalSkipNvSciBufMemSync 0x01 2025-05-07T19:46:45.8066733Z #define cudaExternalSemaphoreWaitSkipNvSciBufMemSync 0x02 2025-05-07T19:46:45.8066925Z #define cudaGetDeviceProperties cudaGetDeviceProperties_v2 2025-05-07T19:46:45.8067045Z #define cudaGraphKernelNodePortDefault 0 2025-05-07T19:46:45.8067194Z #define cudaGraphKernelNodePortLaunchCompletion 2 2025-05-07T19:46:45.8067329Z #define cudaGraphKernelNodePortProgrammatic 1 2025-05-07T19:46:45.8067444Z #define cudaHostAllocDefault 0x00 2025-05-07T19:46:45.8067544Z #define cudaHostAllocMapped 0x02 2025-05-07T19:46:45.8067646Z #define cudaHostAllocPortable 0x01 2025-05-07T19:46:45.8067775Z #define cudaHostAllocWriteCombined 0x04 2025-05-07T19:46:45.8067877Z #define cudaHostRegisterDefault 0x00 2025-05-07T19:46:45.8067983Z #define cudaHostRegisterIoMemory 0x04 2025-05-07T19:46:45.8068087Z #define cudaHostRegisterMapped 0x02 2025-05-07T19:46:45.8068205Z #define cudaHostRegisterPortable 0x01 2025-05-07T19:46:45.8068307Z #define cudaHostRegisterReadOnly 0x08 2025-05-07T19:46:45.8068421Z #define cudaInitDeviceFlagsAreValid 0x01 2025-05-07T19:46:45.8068537Z #define cudaInvalidDeviceId ((int)-2) 2025-05-07T19:46:45.8068797Z #define cudaIpcMemLazyEnablePeerAccess 0x01 2025-05-07T19:46:45.8068934Z #define cudaKernelNodeAttrID cudaLaunchAttributeID 2025-05-07T19:46:45.8069112Z #define cudaKernelNodeAttrValue cudaLaunchAttributeValue 2025-05-07T19:46:45.8069432Z #define cudaKernelNodeAttributeAccessPolicyWindow cudaLaunchAttributeAccessPolicyWindow 2025-05-07T19:46:45.8069727Z #define cudaKernelNodeAttributeClusterDimension cudaLaunchAttributeClusterDimension 2025-05-07T19:46:45.8070269Z #define cudaKernelNodeAttributeClusterSchedulingPolicyPreference cudaLaunchAttributeClusterSchedulingPolicyPreference 2025-05-07T19:46:45.8070535Z #define cudaKernelNodeAttributeCooperative cudaLaunchAttributeCooperative 2025-05-07T19:46:45.8070929Z #define cudaKernelNodeAttributeDeviceUpdatableKernelNode cudaLaunchAttributeDeviceUpdatableKernelNode 2025-05-07T19:46:45.8071194Z #define cudaKernelNodeAttributeMemSyncDomain cudaLaunchAttributeMemSyncDomain 2025-05-07T19:46:45.8071504Z #define cudaKernelNodeAttributeMemSyncDomainMap cudaLaunchAttributeMemSyncDomainMap 2025-05-07T19:46:45.8071948Z #define cudaKernelNodeAttributePreferredSharedMemoryCarveout cudaLaunchAttributePreferredSharedMemoryCarveout 2025-05-07T19:46:45.8072164Z #define cudaKernelNodeAttributePriority cudaLaunchAttributePriority 2025-05-07T19:46:45.8072273Z #define cudaMemAttachGlobal 0x01 2025-05-07T19:46:45.8072370Z #define cudaMemAttachHost 0x02 2025-05-07T19:46:45.8072467Z #define cudaMemAttachSingle 0x04 2025-05-07T19:46:45.8072585Z #define cudaNvSciSyncAttrSignal 0x1 2025-05-07T19:46:45.8072684Z #define cudaNvSciSyncAttrWait 0x2 2025-05-07T19:46:45.8072783Z #define cudaOccupancyDefault 0x00 2025-05-07T19:46:45.8072919Z #define cudaOccupancyDisableCachingOverride 0x01 2025-05-07T19:46:45.8073031Z #define cudaPeerAccessDefault 0x00 2025-05-07T19:46:45.8073368Z #define cudaSignalExternalSemaphoresAsync __CUDART_API_PTSZ(cudaSignalExternalSemaphoresAsync_v2) 2025-05-07T19:46:45.8073493Z #define cudaStreamAttrID cudaLaunchAttributeID 2025-05-07T19:46:45.8073656Z #define cudaStreamAttrValue cudaLaunchAttributeValue 2025-05-07T19:46:45.8073948Z #define cudaStreamAttributeAccessPolicyWindow cudaLaunchAttributeAccessPolicyWindow 2025-05-07T19:46:45.8074265Z #define cudaStreamAttributeMemSyncDomain cudaLaunchAttributeMemSyncDomain 2025-05-07T19:46:45.8074730Z #define cudaStreamAttributeMemSyncDomainMap cudaLaunchAttributeMemSyncDomainMap 2025-05-07T19:46:45.8074942Z #define cudaStreamAttributePriority cudaLaunchAttributePriority 2025-05-07T19:46:45.8075297Z #define cudaStreamAttributeSynchronizationPolicy cudaLaunchAttributeSynchronizationPolicy 2025-05-07T19:46:45.8075402Z #define cudaStreamDefault 0x00 2025-05-07T19:46:45.8075555Z #define cudaStreamFireAndForget ((cudaStream_t)0x4) 2025-05-07T19:46:45.8075826Z #define cudaStreamGetCaptureInfo __CUDART_API_PTSZ(cudaStreamGetCaptureInfo_v2) 2025-05-07T19:46:45.8076048Z #define cudaStreamGraphFireAndForget (cudaStream_t)0x0200000000000000 2025-05-07T19:46:45.8076329Z #define cudaStreamGraphFireAndForgetAsSibling (cudaStream_t)0x0300000000000000 2025-05-07T19:46:45.8076541Z #define cudaStreamGraphTailLaunch (cudaStream_t)0x0100000000000000 2025-05-07T19:46:45.8076664Z #define cudaStreamLegacy ((cudaStream_t)0x1) 2025-05-07T19:46:45.8076790Z #define cudaStreamNonBlocking 0x01 2025-05-07T19:46:45.8076922Z #define cudaStreamPerThread ((cudaStream_t)0x2) 2025-05-07T19:46:45.8077068Z #define cudaStreamTailLaunch ((cudaStream_t)0x3) 2025-05-07T19:46:45.8077175Z #define cudaSurfaceType1D 0x01 2025-05-07T19:46:45.8077308Z #define cudaSurfaceType1DLayered 0xF1 2025-05-07T19:46:45.8077415Z #define cudaSurfaceType2D 0x02 2025-05-07T19:46:45.8077530Z #define cudaSurfaceType2DLayered 0xF2 2025-05-07T19:46:45.8077646Z #define cudaSurfaceType3D 0x03 2025-05-07T19:46:45.8077758Z #define cudaSurfaceTypeCubemap 0x0C 2025-05-07T19:46:45.8077886Z #define cudaSurfaceTypeCubemapLayered 0xFC 2025-05-07T19:46:45.8077995Z #define cudaTextureType1D 0x01 2025-05-07T19:46:45.8078123Z #define cudaTextureType1DLayered 0xF1 2025-05-07T19:46:45.8078226Z #define cudaTextureType2D 0x02 2025-05-07T19:46:45.8078391Z #define cudaTextureType2DLayered 0xF2 2025-05-07T19:46:45.8078510Z #define cudaTextureType3D 0x03 2025-05-07T19:46:45.8078618Z #define cudaTextureTypeCubemap 0x0C 2025-05-07T19:46:45.8078748Z #define cudaTextureTypeCubemapLayered 0xFC 2025-05-07T19:46:45.8079091Z #define cudaWaitExternalSemaphoresAsync __CUDART_API_PTSZ(cudaWaitExternalSemaphoresAsync_v2) 2025-05-07T19:46:45.8079209Z #define getc(_fp) _IO_getc (_fp) 2025-05-07T19:46:45.8079352Z #define htobe16(x) __bswap_16 (x) 2025-05-07T19:46:45.8079450Z #define htobe32(x) __bswap_32 (x) 2025-05-07T19:46:45.8079566Z #define htobe64(x) __bswap_64 (x) 2025-05-07T19:46:45.8079655Z #define htole16(x) (x) 2025-05-07T19:46:45.8079746Z #define htole32(x) (x) 2025-05-07T19:46:45.8079837Z #define htole64(x) (x) 2025-05-07T19:46:45.8079980Z #define isalnum_l(c,l) __isalnum_l ((c), (l)) 2025-05-07T19:46:45.8080100Z #define isalpha_l(c,l) __isalpha_l ((c), (l)) 2025-05-07T19:46:45.8080196Z #define isascii(c) __isascii (c) 2025-05-07T19:46:45.8080338Z #define isascii_l(c,l) __isascii_l ((c), (l)) 2025-05-07T19:46:45.8080450Z #define isblank_l(c,l) __isblank_l ((c), (l)) 2025-05-07T19:46:45.8080570Z #define iscntrl_l(c,l) __iscntrl_l ((c), (l)) 2025-05-07T19:46:45.8080711Z #define isdigit_l(c,l) __isdigit_l ((c), (l)) 2025-05-07T19:46:45.8080829Z #define isgraph_l(c,l) __isgraph_l ((c), (l)) 2025-05-07T19:46:45.8080943Z #define islower_l(c,l) __islower_l ((c), (l)) 2025-05-07T19:46:45.8081063Z #define isprint_l(c,l) __isprint_l ((c), (l)) 2025-05-07T19:46:45.8081200Z #define ispunct_l(c,l) __ispunct_l ((c), (l)) 2025-05-07T19:46:45.8081310Z #define isspace_l(c,l) __isspace_l ((c), (l)) 2025-05-07T19:46:45.8081427Z #define isupper_l(c,l) __isupper_l ((c), (l)) 2025-05-07T19:46:45.8081561Z #define isxdigit_l(c,l) __isxdigit_l ((c), (l)) 2025-05-07T19:46:45.8081657Z #define le16toh(x) (x) 2025-05-07T19:46:45.8081747Z #define le32toh(x) (x) 2025-05-07T19:46:45.8081839Z #define le64toh(x) (x) 2025-05-07T19:46:45.8081938Z #define linux 1 2025-05-07T19:46:45.8082046Z #define major(dev) gnu_dev_major (dev) 2025-05-07T19:46:45.8082182Z #define makedev(maj,min) gnu_dev_makedev (maj, min) 2025-05-07T19:46:45.8082354Z #define math_errhandling (MATH_ERRNO | MATH_ERREXCEPT) 2025-05-07T19:46:45.8082467Z #define minor(dev) gnu_dev_minor (dev) 2025-05-07T19:46:45.8082588Z #define offsetof(t,d) __builtin_offsetof(t, d) 2025-05-07T19:46:45.8082716Z #define putc(_ch,_fp) _IO_putc (_ch, _fp) 2025-05-07T19:46:45.8082814Z #define stderr stderr 2025-05-07T19:46:45.8082907Z #define stdin stdin 2025-05-07T19:46:45.8082993Z #define stdout stdout 2025-05-07T19:46:45.8083513Z #define strdupa(s) (__extension__ ({ const char *__old = (s); size_t __len = strlen (__old) + 1; char *__new = (char *) __builtin_alloca (__len); (char *) memcpy (__new, __old, __len); })) 2025-05-07T19:46:45.8084078Z #define strndupa(s,n) (__extension__ ({ const char *__old = (s); size_t __len = strnlen (__old, (n)); char *__new = (char *) __builtin_alloca (__len + 1); __new[__len] = '\0'; (char *) memcpy (__new, __old, __len); })) 2025-05-07T19:46:45.8084184Z #define toascii(c) __toascii (c) 2025-05-07T19:46:45.8084317Z #define toascii_l(c,l) __toascii_l ((c), (l)) 2025-05-07T19:46:45.8084404Z #define unix 1 2025-05-07T19:46:45.8084545Z #define w_coredump __wait_terminated.__w_coredump 2025-05-07T19:46:45.8084685Z #define w_retcode __wait_terminated.__w_retcode 2025-05-07T19:46:45.8084805Z #define w_stopsig __wait_stopped.__w_stopsig 2025-05-07T19:46:45.8084928Z #define w_stopval __wait_stopped.__w_stopval 2025-05-07T19:46:45.8085060Z #define w_termsig __wait_terminated.__w_termsig 2025-05-07T19:46:45.8085068Z 2025-05-07T19:46:45.8312590Z 2025-05-07T19:46:45.8312821Z + conda run -n build_binary nvcc --version 2025-05-07T19:46:45.8312830Z 2025-05-07T19:46:47.4231324Z nvcc: NVIDIA (R) Cuda compiler driver 2025-05-07T19:46:47.4231711Z Copyright (c) 2005-2024 NVIDIA Corporation 2025-05-07T19:46:47.4232076Z Built on Tue_Oct_29_23:50:19_PDT_2024 2025-05-07T19:46:47.4232407Z Cuda compilation tools, release 12.6, V12.6.85 2025-05-07T19:46:47.4233041Z Build cuda_12.6.r12.6/compiler.35059454_0 2025-05-07T19:46:47.4233281Z 2025-05-07T19:46:47.4826103Z 2025-05-07T19:46:47.4839033Z which: no nvidia-smi in (CONDA=/github/home/miniconda:/github/home/miniconda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:46:47.4839795Z [CHECK] nvidia-smi not found 2025-05-07T19:46:47.4840138Z [INSTALL] Successfully installed CUDA 12.6.3 2025-05-07T19:46:47.4940219Z ##[group]Run . $PRELUDE; install_pytorch_pip $BUILD_ENV nightly cuda/12.6.3 2025-05-07T19:46:47.4940857Z . $PRELUDE; install_pytorch_pip $BUILD_ENV nightly cuda/12.6.3 2025-05-07T19:46:47.4941620Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:46:47.4941943Z env: 2025-05-07T19:46:47.4942170Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:46:47.4942462Z BUILD_ENV: build_binary 2025-05-07T19:46:47.4942705Z BUILD_TARGET: default 2025-05-07T19:46:47.4942923Z BUILD_VARIANT: cuda 2025-05-07T19:46:47.4943156Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:46:47.4943389Z ##[endgroup] 2025-05-07T19:46:47.9822201Z ################################################################################ 2025-05-07T19:46:47.9822644Z # Install PyTorch (PIP) 2025-05-07T19:46:47.9822910Z # 2025-05-07T19:46:47.9835106Z # [2025-05-07T19:46:47.983Z] + install_pytorch_pip build_binary nightly cuda/12.6.3 2025-05-07T19:46:47.9835688Z ################################################################################ 2025-05-07T19:46:47.9835947Z 2025-05-07T19:46:47.9863375Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y numpy 2025-05-07T19:46:48.9008974Z Channels: 2025-05-07T19:46:48.9010046Z - conda-forge 2025-05-07T19:46:48.9010357Z Platform: linux-64 2025-05-07T19:46:51.9356461Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:46:53.6176015Z Solving environment: \ | / - done 2025-05-07T19:46:53.9223307Z 2025-05-07T19:46:53.9224058Z ## Package Plan ## 2025-05-07T19:46:53.9224761Z 2025-05-07T19:46:53.9225403Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:46:53.9226330Z 2025-05-07T19:46:53.9226627Z added / updated specs: 2025-05-07T19:46:53.9227384Z - numpy 2025-05-07T19:46:53.9227734Z 2025-05-07T19:46:53.9227746Z 2025-05-07T19:46:53.9228104Z The following packages will be downloaded: 2025-05-07T19:46:53.9229245Z 2025-05-07T19:46:53.9229638Z package | build 2025-05-07T19:46:53.9230644Z ---------------------------|----------------- 2025-05-07T19:46:53.9231783Z libblas-3.9.0 |31_h59b9bed_openblas 16 KB conda-forge 2025-05-07T19:46:53.9232314Z libcblas-3.9.0 |31_he106b2a_openblas 16 KB conda-forge 2025-05-07T19:46:53.9232817Z liblapack-3.9.0 |31_h7ac8fdf_openblas 16 KB conda-forge 2025-05-07T19:46:53.9233326Z numpy-2.2.5 | py311h5d046bc_0 8.6 MB conda-forge 2025-05-07T19:46:53.9233760Z ------------------------------------------------------------ 2025-05-07T19:46:53.9234268Z Total: 8.7 MB 2025-05-07T19:46:53.9234507Z 2025-05-07T19:46:53.9234651Z The following NEW packages will be INSTALLED: 2025-05-07T19:46:53.9234925Z 2025-05-07T19:46:53.9235175Z libblas conda-forge/linux-64::libblas-3.9.0-31_h59b9bed_openblas 2025-05-07T19:46:53.9235771Z libcblas conda-forge/linux-64::libcblas-3.9.0-31_he106b2a_openblas 2025-05-07T19:46:53.9236360Z liblapack conda-forge/linux-64::liblapack-3.9.0-31_h7ac8fdf_openblas 2025-05-07T19:46:53.9236932Z numpy conda-forge/linux-64::numpy-2.2.5-py311h5d046bc_0 2025-05-07T19:46:53.9237230Z 2025-05-07T19:46:53.9237234Z 2025-05-07T19:46:53.9237237Z 2025-05-07T19:46:53.9237426Z Downloading and Extracting Packages: ...working... 2025-05-07T19:46:53.9237853Z numpy-2.2.5 | 8.6 MB | | 0% 2025-05-07T19:46:53.9238128Z 2025-05-07T19:46:53.9238452Z libblas-3.9.0 | 16 KB | | 0%  2025-05-07T19:46:53.9238716Z 2025-05-07T19:46:53.9238720Z 2025-05-07T19:46:53.9240778Z libcblas-3.9.0 | 16 KB | | 0%  2025-05-07T19:46:53.9241480Z 2025-05-07T19:46:53.9241492Z 2025-05-07T19:46:53.9241680Z 2025-05-07T19:46:54.2704382Z liblapack-3.9.0 | 16 KB | | 0%  2025-05-07T19:46:54.2705104Z 2025-05-07T19:46:54.2705659Z 2025-05-07T19:46:54.2705668Z 2025-05-07T19:46:54.2706230Z liblapack-3.9.0 | 16 KB | #########7 | 98%  2025-05-07T19:46:54.2706566Z 2025-05-07T19:46:54.2706570Z 2025-05-07T19:46:54.2706573Z 2025-05-07T19:46:54.2873100Z liblapack-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:54.2874486Z 2025-05-07T19:46:54.2874516Z 2025-05-07T19:46:54.2876207Z libcblas-3.9.0 | 16 KB | #########7 | 98%  2025-05-07T19:46:54.2876823Z 2025-05-07T19:46:54.2876845Z 2025-05-07T19:46:54.2927695Z libcblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:54.2928175Z 2025-05-07T19:46:54.2928655Z 2025-05-07T19:46:54.2928665Z 2025-05-07T19:46:54.3028690Z liblapack-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:54.3029069Z 2025-05-07T19:46:54.3029074Z 2025-05-07T19:46:54.3169854Z libcblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:54.3272283Z numpy-2.2.5 | 8.6 MB | | 0% 2025-05-07T19:46:54.3272731Z 2025-05-07T19:46:54.3275258Z libblas-3.9.0 | 16 KB | #########7 | 97%  2025-05-07T19:46:54.3275651Z 2025-05-07T19:46:54.3550671Z libblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:54.3551482Z 2025-05-07T19:46:54.4195155Z libblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:54.4412339Z numpy-2.2.5 | 8.6 MB | #########7 | 98% 2025-05-07T19:46:54.8180800Z numpy-2.2.5 | 8.6 MB | ########## | 100% 2025-05-07T19:46:54.8183853Z numpy-2.2.5 | 8.6 MB | ########## | 100% 2025-05-07T19:46:54.8184321Z 2025-05-07T19:46:54.8184552Z 2025-05-07T19:46:54.8184897Z  2025-05-07T19:46:54.8185127Z 2025-05-07T19:46:54.8185131Z 2025-05-07T19:46:54.8185353Z  2025-05-07T19:46:54.8185582Z 2025-05-07T19:46:54.8185586Z 2025-05-07T19:46:54.8185627Z 2025-05-07T19:46:54.8185820Z  done 2025-05-07T19:46:54.9195276Z Preparing transaction: | done 2025-05-07T19:46:55.1205556Z Verifying transaction: - \ done 2025-05-07T19:46:55.2217939Z Executing transaction: / done 2025-05-07T19:46:55.3310788Z ################################################################################ 2025-05-07T19:46:55.3311797Z # Install Package From PyTorch PIP: torch 2025-05-07T19:46:55.3312214Z # 2025-05-07T19:46:55.3332034Z # [2025-05-07T19:46:55.332Z] + install_from_pytorch_pip build_binary torch nightly cuda/12.6.3 2025-05-07T19:46:55.3332617Z ################################################################################ 2025-05-07T19:46:55.3332860Z 2025-05-07T19:46:55.3350907Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:46:55.4265461Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:46:55.4266383Z ################################################################################ 2025-05-07T19:46:55.4266876Z # Prepare PIP Arguments (PyTorch PIP) 2025-05-07T19:46:55.4267230Z # 2025-05-07T19:46:55.4293958Z # [2025-05-07T19:46:55.428Z] + __prepare_pip_arguments torch nightly cuda/12.6.3 2025-05-07T19:46:55.4294467Z ################################################################################ 2025-05-07T19:46:55.4294739Z 2025-05-07T19:46:55.4317270Z [INSTALL] Extracted package (channel, version): (nightly, LATEST) 2025-05-07T19:46:55.4341028Z [INSTALL] Extracted package variant: cu126 2025-05-07T19:46:55.4356758Z [INSTALL] Using a non-RELEASE channel: nightly ... 2025-05-07T19:46:55.4358466Z [INSTALL] Extracted the full PIP channel: https://download.pytorch.org/whl/nightly/cu126/ 2025-05-07T19:46:55.4361514Z [INSTALL] Extracted the full PIP package: --pre torch 2025-05-07T19:46:55.4369730Z [INSTALL] Attempting to install [torch, LATEST] from PyTorch PIP using channel https://download.pytorch.org/whl/nightly/cu126/ ... 2025-05-07T19:46:55.4394834Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --pre torch --index-url https://download.pytorch.org/whl/nightly/cu126/ 2025-05-07T19:48:31.4809605Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:48:31.4811250Z 2025-05-07T19:48:31.4811562Z Looking in indexes: https://download.pytorch.org/whl/nightly/cu126/ 2025-05-07T19:48:31.4812010Z Collecting torch 2025-05-07T19:48:31.4812722Z Downloading https://download.pytorch.org/whl/nightly/cu126/torch-2.8.0.dev20250507%2Bcu126-cp311-cp311-manylinux_2_28_x86_64.whl.metadata (30 kB) 2025-05-07T19:48:31.4813475Z Collecting filelock (from torch) 2025-05-07T19:48:31.4814044Z Downloading https://download.pytorch.org/whl/nightly/filelock-3.16.1-py3-none-any.whl (16 kB) 2025-05-07T19:48:31.4815050Z Requirement already satisfied: typing-extensions>=4.10.0 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from torch) (4.13.2) 2025-05-07T19:48:31.4815825Z Collecting sympy>=1.13.3 (from torch) 2025-05-07T19:48:31.4816358Z Downloading https://download.pytorch.org/whl/nightly/sympy-1.13.3-py3-none-any.whl (6.2 MB) 2025-05-07T19:48:31.4817452Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 6.2/6.2 MB 29.0 MB/s eta 0:00:00 2025-05-07T19:48:31.4817816Z Collecting networkx (from torch) 2025-05-07T19:48:31.4818312Z Downloading https://download.pytorch.org/whl/nightly/networkx-3.4.2-py3-none-any.whl (1.7 MB) 2025-05-07T19:48:31.4818984Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.7/1.7 MB 14.3 MB/s eta 0:00:00 2025-05-07T19:48:31.4819666Z Requirement already satisfied: jinja2 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from torch) (3.1.6) 2025-05-07T19:48:31.4820353Z Collecting fsspec (from torch) 2025-05-07T19:48:31.4820875Z Downloading https://download.pytorch.org/whl/nightly/fsspec-2024.10.0-py3-none-any.whl (179 kB) 2025-05-07T19:48:31.4821447Z Collecting nvidia-cuda-nvrtc-cu12==12.6.77 (from torch) 2025-05-07T19:48:31.4822175Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cuda_nvrtc_cu12-12.6.77-py3-none-manylinux2014_x86_64.whl (23.7 MB) 2025-05-07T19:48:31.4822979Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 23.7/23.7 MB 46.9 MB/s eta 0:00:00 2025-05-07T19:48:31.4823399Z Collecting nvidia-cuda-runtime-cu12==12.6.77 (from torch) 2025-05-07T19:48:31.4824140Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cuda_runtime_cu12-12.6.77-py3-none-manylinux2014_x86_64.whl (897 kB) 2025-05-07T19:48:31.4824938Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 897.7/897.7 kB 5.4 MB/s eta 0:00:00 2025-05-07T19:48:31.4825346Z Collecting nvidia-cuda-cupti-cu12==12.6.80 (from torch) 2025-05-07T19:48:31.4826067Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cuda_cupti_cu12-12.6.80-py3-none-manylinux2014_x86_64.whl (8.9 MB) 2025-05-07T19:48:31.4826887Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 8.9/8.9 MB 47.7 MB/s eta 0:00:00 2025-05-07T19:48:31.4827255Z Collecting nvidia-cudnn-cu12==9.5.1.17 (from torch) 2025-05-07T19:48:31.4827950Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cudnn_cu12-9.5.1.17-py3-none-manylinux_2_28_x86_64.whl (571.0 MB) 2025-05-07T19:48:31.4829383Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 571.0/571.0 MB 29.9 MB/s eta 0:00:00 2025-05-07T19:48:31.4829788Z Collecting nvidia-cublas-cu12==12.6.4.1 (from torch) 2025-05-07T19:48:31.4830644Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cublas_cu12-12.6.4.1-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (393.1 MB) 2025-05-07T19:48:31.4831576Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 393.1/393.1 MB 40.0 MB/s eta 0:00:00 2025-05-07T19:48:31.4831993Z Collecting nvidia-cufft-cu12==11.3.0.4 (from torch) 2025-05-07T19:48:31.4833319Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cufft_cu12-11.3.0.4-py3-none-manylinux2014_x86_64.whl (200.2 MB) 2025-05-07T19:48:31.4834279Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 200.2/200.2 MB 60.0 MB/s eta 0:00:00 2025-05-07T19:48:31.4834711Z Collecting nvidia-curand-cu12==10.3.7.77 (from torch) 2025-05-07T19:48:31.4835528Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_curand_cu12-10.3.7.77-py3-none-manylinux2014_x86_64.whl (56.3 MB) 2025-05-07T19:48:31.4836390Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 56.3/56.3 MB 61.7 MB/s eta 0:00:00 2025-05-07T19:48:31.4836825Z Collecting nvidia-cusolver-cu12==11.7.1.2 (from torch) 2025-05-07T19:48:31.4837587Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cusolver_cu12-11.7.1.2-py3-none-manylinux2014_x86_64.whl (158.2 MB) 2025-05-07T19:48:31.4838448Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 158.2/158.2 MB 63.6 MB/s eta 0:00:00 2025-05-07T19:48:31.4838868Z Collecting nvidia-cusparse-cu12==12.5.4.2 (from torch) 2025-05-07T19:48:31.4839658Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cusparse_cu12-12.5.4.2-py3-none-manylinux2014_x86_64.whl (216.6 MB) 2025-05-07T19:48:31.4840623Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 216.6/216.6 MB 121.8 MB/s eta 0:00:00 2025-05-07T19:48:31.4841025Z Collecting nvidia-cusparselt-cu12==0.6.3 (from torch) 2025-05-07T19:48:31.4841751Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cusparselt_cu12-0.6.3-py3-none-manylinux2014_x86_64.whl (156.8 MB) 2025-05-07T19:48:31.4842542Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 156.8/156.8 MB 142.0 MB/s eta 0:00:00 2025-05-07T19:48:31.4842918Z Collecting nvidia-nccl-cu12==2.26.2 (from torch) 2025-05-07T19:48:31.4843691Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_nccl_cu12-2.26.2-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.metadata (2.0 kB) 2025-05-07T19:48:31.4844483Z Collecting nvidia-nvtx-cu12==12.6.77 (from torch) 2025-05-07T19:48:31.4845175Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_nvtx_cu12-12.6.77-py3-none-manylinux2014_x86_64.whl (89 kB) 2025-05-07T19:48:31.4845850Z Collecting nvidia-nvjitlink-cu12==12.6.85 (from torch) 2025-05-07T19:48:31.4846653Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_nvjitlink_cu12-12.6.85-py3-none-manylinux2010_x86_64.manylinux_2_12_x86_64.whl (19.7 MB) 2025-05-07T19:48:31.4847527Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 19.7/19.7 MB 125.4 MB/s eta 0:00:00 2025-05-07T19:48:31.4847920Z Collecting nvidia-cufile-cu12==1.11.1.6 (from torch) 2025-05-07T19:48:31.4848734Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cufile_cu12-1.11.1.6-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.metadata (1.5 kB) 2025-05-07T19:48:31.4849551Z Collecting pytorch-triton==3.3.0+git96316ce5 (from torch) 2025-05-07T19:48:31.4850416Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.3.0%2Bgit96316ce5-cp311-cp311-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.metadata (1.6 kB) 2025-05-07T19:48:31.4851826Z Requirement already satisfied: setuptools>=40.8.0 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from pytorch-triton==3.3.0+git96316ce5->torch) (78.1.1) 2025-05-07T19:48:31.4852707Z Collecting mpmath<1.4,>=1.1.0 (from sympy>=1.13.3->torch) 2025-05-07T19:48:31.4853282Z Downloading https://download.pytorch.org/whl/nightly/mpmath-1.3.0-py3-none-any.whl (536 kB) 2025-05-07T19:48:31.4853922Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 536.2/536.2 kB 52.2 MB/s eta 0:00:00 2025-05-07T19:48:31.4854683Z Requirement already satisfied: MarkupSafe>=2.0 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from jinja2->torch) (3.0.2) 2025-05-07T19:48:31.4855769Z Downloading https://download.pytorch.org/whl/nightly/cu126/torch-2.8.0.dev20250507%2Bcu126-cp311-cp311-manylinux_2_28_x86_64.whl (825.6 MB) 2025-05-07T19:48:31.4856677Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 825.6/825.6 MB 31.5 MB/s eta 0:00:00 2025-05-07T19:48:31.4857474Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cufile_cu12-1.11.1.6-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (1.1 MB) 2025-05-07T19:48:31.4858335Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.1/1.1 MB 82.1 MB/s eta 0:00:00 2025-05-07T19:48:31.4859108Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_nccl_cu12-2.26.2-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (201.3 MB) 2025-05-07T19:48:31.4859959Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 201.3/201.3 MB 67.6 MB/s eta 0:00:00 2025-05-07T19:48:31.4860769Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.3.0%2Bgit96316ce5-cp311-cp311-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (153.5 MB) 2025-05-07T19:48:31.4861677Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 153.5/153.5 MB 72.4 MB/s eta 0:00:00 2025-05-07T19:48:31.4863383Z Installing collected packages: nvidia-cusparselt-cu12, mpmath, sympy, pytorch-triton, nvidia-nvtx-cu12, nvidia-nvjitlink-cu12, nvidia-nccl-cu12, nvidia-curand-cu12, nvidia-cufile-cu12, nvidia-cuda-runtime-cu12, nvidia-cuda-nvrtc-cu12, nvidia-cuda-cupti-cu12, nvidia-cublas-cu12, networkx, fsspec, filelock, nvidia-cusparse-cu12, nvidia-cufft-cu12, nvidia-cudnn-cu12, nvidia-cusolver-cu12, torch 2025-05-07T19:48:31.4864969Z 2025-05-07T19:48:31.4866906Z Successfully installed filelock-3.16.1 fsspec-2024.10.0 mpmath-1.3.0 networkx-3.4.2 nvidia-cublas-cu12-12.6.4.1 nvidia-cuda-cupti-cu12-12.6.80 nvidia-cuda-nvrtc-cu12-12.6.77 nvidia-cuda-runtime-cu12-12.6.77 nvidia-cudnn-cu12-9.5.1.17 nvidia-cufft-cu12-11.3.0.4 nvidia-cufile-cu12-1.11.1.6 nvidia-curand-cu12-10.3.7.77 nvidia-cusolver-cu12-11.7.1.2 nvidia-cusparse-cu12-12.5.4.2 nvidia-cusparselt-cu12-0.6.3 nvidia-nccl-cu12-2.26.2 nvidia-nvjitlink-cu12-12.6.85 nvidia-nvtx-cu12-12.6.77 pytorch-triton-3.3.0+git96316ce5 sympy-1.13.3 torch-2.8.0.dev20250507+cu126 2025-05-07T19:48:31.4868944Z 2025-05-07T19:48:33.4156502Z torch 2.8.0.dev20250507+cu126 2025-05-07T19:48:33.4157767Z [CHECK] The installed package [torch, nightly/LATEST] is the correct variant (cu126) 2025-05-07T19:48:36.4753961Z [CHECK] Python (sub-)package 'torch.distributed' found ... 2025-05-07T19:48:39.4775946Z [CHECK] NOTE: The installed version is: 2.8.0.dev20250507+cu126 2025-05-07T19:48:39.4776464Z [CHECK] NOTE: Checking _GLIBCXX_USE_CXX11_ABI ... 2025-05-07T19:48:42.4076001Z True 2025-05-07T19:48:42.4076297Z True 2025-05-07T19:48:42.4076420Z 2025-05-07T19:48:42.4642934Z [INSTALL] Successfully installed PyTorch through PyTorch PIP 2025-05-07T19:48:42.4732031Z ##[group]Run if . $PRELUDE && which conda; then collect_pytorch_env_info $BUILD_ENV; fi 2025-05-07T19:48:42.4732757Z if . $PRELUDE && which conda; then collect_pytorch_env_info $BUILD_ENV; fi 2025-05-07T19:48:42.4733489Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:48:42.4733837Z env: 2025-05-07T19:48:42.4734123Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:48:42.4734450Z BUILD_ENV: build_binary 2025-05-07T19:48:42.4734957Z BUILD_TARGET: default 2025-05-07T19:48:42.4735226Z BUILD_VARIANT: cuda 2025-05-07T19:48:42.4735636Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:48:42.4735899Z ##[endgroup] 2025-05-07T19:48:42.9205324Z /github/home/miniconda/bin/conda 2025-05-07T19:48:42.9205898Z ################################################################################ 2025-05-07T19:48:42.9206331Z # Collect PyTorch Environment Information (for Reporting Issues) 2025-05-07T19:48:42.9206723Z # 2025-05-07T19:48:42.9224310Z # [2025-05-07T19:48:42.922Z] + collect_pytorch_env_info build_binary 2025-05-07T19:48:42.9224923Z ################################################################################ 2025-05-07T19:48:42.9225168Z 2025-05-07T19:48:42.9241990Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:48:43.0171791Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:48:43.0182534Z [INFO] Downloading the PyTorch environment info collection script ... 2025-05-07T19:48:43.0183436Z + wget -q https://raw.githubusercontent.com/pytorch/pytorch/main/torch/utils/collect_env.py 2025-05-07T19:48:43.0183867Z 2025-05-07T19:48:43.1026235Z 2025-05-07T19:48:43.1027037Z [INFO] Collecting PyTorch environment info (will be needed for reporting issues to PyTorch) ... 2025-05-07T19:48:43.1049557Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary python collect_env.py 2025-05-07T19:48:48.3183701Z Collecting environment information... 2025-05-07T19:48:48.3184194Z PyTorch version: 2.8.0.dev20250507+cu126 2025-05-07T19:48:48.3184577Z Is debug build: False 2025-05-07T19:48:48.3184881Z CUDA used to build PyTorch: 12.6 2025-05-07T19:48:48.3185200Z ROCM used to build PyTorch: N/A 2025-05-07T19:48:48.3185420Z 2025-05-07T19:48:48.3185541Z OS: Amazon Linux 2023.7.20250428 (x86_64) 2025-05-07T19:48:48.3185990Z GCC version: Could not collect 2025-05-07T19:48:48.3186624Z Clang version: 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:48:48.3187428Z CMake version: version 4.0.2 2025-05-07T19:48:48.3187879Z Libc version: glibc-2.34 2025-05-07T19:48:48.3188047Z 2025-05-07T19:48:48.3188401Z Python version: 3.11.11 | packaged by conda-forge | (main, Mar 3 2025, 20:43:55) [GCC 13.3.0] (64-bit runtime) 2025-05-07T19:48:48.3189060Z Python platform: Linux-6.1.130-139.222.amzn2023.x86_64-x86_64-with-glibc2.34 2025-05-07T19:48:48.3189537Z Is CUDA available: False 2025-05-07T19:48:48.3189813Z CUDA runtime version: 12.6.85 2025-05-07T19:48:48.3190136Z CUDA_MODULE_LOADING set to: N/A 2025-05-07T19:48:48.3190474Z GPU models and configuration: Could not collect 2025-05-07T19:48:48.3190863Z Nvidia driver version: Could not collect 2025-05-07T19:48:48.3191216Z cuDNN version: Could not collect 2025-05-07T19:48:48.3191509Z HIP runtime version: N/A 2025-05-07T19:48:48.3191806Z MIOpen runtime version: N/A 2025-05-07T19:48:48.3192089Z Is XNNPACK available: True 2025-05-07T19:48:48.3192355Z 2025-05-07T19:48:48.3192449Z CPU: 2025-05-07T19:48:48.3192735Z Architecture: x86_64 2025-05-07T19:48:48.3193091Z CPU op-mode(s): 32-bit, 64-bit 2025-05-07T19:48:48.3193534Z Address sizes: 46 bits physical, 48 bits virtual 2025-05-07T19:48:48.3193948Z Byte Order: Little Endian 2025-05-07T19:48:48.3194639Z CPU(s): 96 2025-05-07T19:48:48.3194967Z On-line CPU(s) list: 0-95 2025-05-07T19:48:48.3195341Z Vendor ID: GenuineIntel 2025-05-07T19:48:48.3196174Z Model name: Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:48:48.3196596Z CPU family: 6 2025-05-07T19:48:48.3196941Z Model: 85 2025-05-07T19:48:48.3197266Z Thread(s) per core: 2 2025-05-07T19:48:48.3197616Z Core(s) per socket: 24 2025-05-07T19:48:48.3197933Z Socket(s): 2 2025-05-07T19:48:48.3198278Z Stepping: 7 2025-05-07T19:48:48.3201648Z BogoMIPS: 5999.98 2025-05-07T19:48:48.3203936Z Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:48:48.3206214Z Hypervisor vendor: KVM 2025-05-07T19:48:48.3206571Z Virtualization type: full 2025-05-07T19:48:48.3206931Z L1d cache: 1.5 MiB (48 instances) 2025-05-07T19:48:48.3207355Z L1i cache: 1.5 MiB (48 instances) 2025-05-07T19:48:48.3207736Z L2 cache: 48 MiB (48 instances) 2025-05-07T19:48:48.3208134Z L3 cache: 71.5 MiB (2 instances) 2025-05-07T19:48:48.3208502Z NUMA node(s): 2 2025-05-07T19:48:48.3208816Z NUMA node0 CPU(s): 0-23,48-71 2025-05-07T19:48:48.3209188Z NUMA node1 CPU(s): 24-47,72-95 2025-05-07T19:48:48.3209661Z Vulnerability Gather data sampling: Unknown: Dependent on hypervisor status 2025-05-07T19:48:48.3210247Z Vulnerability Itlb multihit: KVM: Mitigation: VMX unsupported 2025-05-07T19:48:48.3210744Z Vulnerability L1tf: Mitigation; PTE Inversion 2025-05-07T19:48:48.3211359Z Vulnerability Mds: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:48:48.3211959Z Vulnerability Meltdown: Mitigation; PTI 2025-05-07T19:48:48.3212560Z Vulnerability Mmio stale data: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:48:48.3213190Z Vulnerability Reg file data sampling: Not affected 2025-05-07T19:48:48.3213561Z Vulnerability Retbleed: Vulnerable 2025-05-07T19:48:48.3213963Z Vulnerability Spec rstack overflow: Not affected 2025-05-07T19:48:48.3214337Z Vulnerability Spec store bypass: Vulnerable 2025-05-07T19:48:48.3215119Z Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization 2025-05-07T19:48:48.3216178Z Vulnerability Spectre v2: Mitigation; Retpolines; STIBP disabled; RSB filling; PBRSB-eIBRS Not affected; BHI Retpoline 2025-05-07T19:48:48.3216849Z Vulnerability Srbds: Not affected 2025-05-07T19:48:48.3217280Z Vulnerability Tsx async abort: Not affected 2025-05-07T19:48:48.3217543Z 2025-05-07T19:48:48.3217666Z Versions of relevant libraries: 2025-05-07T19:48:48.3217988Z [pip3] numpy==2.2.5 2025-05-07T19:48:48.3218263Z [pip3] nvidia-cublas-cu12==12.6.4.1 2025-05-07T19:48:48.3218622Z [pip3] nvidia-cuda-cupti-cu12==12.6.80 2025-05-07T19:48:48.3218969Z [pip3] nvidia-cuda-nvrtc-cu12==12.6.77 2025-05-07T19:48:48.3219335Z [pip3] nvidia-cuda-runtime-cu12==12.6.77 2025-05-07T19:48:48.3219708Z [pip3] nvidia-cudnn-cu12==9.5.1.17 2025-05-07T19:48:48.3220025Z [pip3] nvidia-cufft-cu12==11.3.0.4 2025-05-07T19:48:48.3220368Z [pip3] nvidia-curand-cu12==10.3.7.77 2025-05-07T19:48:48.3220697Z [pip3] nvidia-cusolver-cu12==11.7.1.2 2025-05-07T19:48:48.3221194Z [pip3] nvidia-cusparse-cu12==12.5.4.2 2025-05-07T19:48:48.3221539Z [pip3] nvidia-cusparselt-cu12==0.6.3 2025-05-07T19:48:48.3222024Z [pip3] nvidia-nccl-cu12==2.26.2 2025-05-07T19:48:48.3222331Z [pip3] nvidia-nvjitlink-cu12==12.6.85 2025-05-07T19:48:48.3222683Z [pip3] nvidia-nvtx-cu12==12.6.77 2025-05-07T19:48:48.3223028Z [pip3] pytorch-triton==3.3.0+git96316ce5 2025-05-07T19:48:48.3223360Z [pip3] torch==2.8.0.dev20250507+cu126 2025-05-07T19:48:48.3223875Z [conda] cuda-cudart 12.6.77 h5888daf_0 conda-forge 2025-05-07T19:48:48.3224393Z [conda] cuda-cudart-dev 12.6.77 h5888daf_0 conda-forge 2025-05-07T19:48:48.3224965Z [conda] cuda-cudart-dev_linux-64 12.6.77 h3f2d84a_0 conda-forge 2025-05-07T19:48:48.3225528Z [conda] cuda-cudart-static 12.6.77 h5888daf_0 conda-forge 2025-05-07T19:48:48.3226132Z [conda] cuda-cudart-static_linux-64 12.6.77 h3f2d84a_0 conda-forge 2025-05-07T19:48:48.3226856Z [conda] cuda-cudart_linux-64 12.6.77 h3f2d84a_0 conda-forge 2025-05-07T19:48:48.3227531Z [conda] cuda-cupti 12.6.80 hbd13f7d_0 conda-forge 2025-05-07T19:48:48.3228063Z [conda] cuda-cupti-dev 12.6.80 h5888daf_0 conda-forge 2025-05-07T19:48:48.3228807Z [conda] cuda-libraries 12.6.3 ha770c72_0 conda-forge 2025-05-07T19:48:48.3229613Z [conda] cuda-libraries-dev 12.6.3 ha770c72_0 conda-forge 2025-05-07T19:48:48.3230187Z [conda] cuda-nvrtc 12.6.85 hbd13f7d_0 conda-forge 2025-05-07T19:48:48.3230742Z [conda] cuda-nvrtc-dev 12.6.85 h5888daf_0 conda-forge 2025-05-07T19:48:48.3231283Z [conda] cuda-nvtx 12.6.77 hbd13f7d_0 conda-forge 2025-05-07T19:48:48.3231784Z [conda] cuda-opencl 12.6.77 hbd13f7d_0 conda-forge 2025-05-07T19:48:48.3232336Z [conda] cuda-opencl-dev 12.6.77 h5888daf_0 conda-forge 2025-05-07T19:48:48.3232863Z [conda] cuda-runtime 12.6.3 ha804496_0 conda-forge 2025-05-07T19:48:48.3233391Z [conda] libcublas 12.6.4.1 h5888daf_1 conda-forge 2025-05-07T19:48:48.3233924Z [conda] libcublas-dev 12.6.4.1 h5888daf_1 conda-forge 2025-05-07T19:48:48.3234553Z [conda] libcufft 11.3.0.4 hbd13f7d_0 conda-forge 2025-05-07T19:48:48.3235104Z [conda] libcufft-dev 11.3.0.4 h5888daf_0 conda-forge 2025-05-07T19:48:48.3235614Z [conda] libcurand 10.3.7.77 hbd13f7d_0 conda-forge 2025-05-07T19:48:48.3236160Z [conda] libcurand-dev 10.3.7.77 h5888daf_0 conda-forge 2025-05-07T19:48:48.3236683Z [conda] libcusolver 11.7.1.2 h5888daf_1 conda-forge 2025-05-07T19:48:48.3237251Z [conda] libcusolver-dev 11.7.1.2 h5888daf_1 conda-forge 2025-05-07T19:48:48.3237812Z [conda] libcusparse 12.5.4.2 hbd13f7d_0 conda-forge 2025-05-07T19:48:48.3238343Z [conda] libcusparse-dev 12.5.4.2 h5888daf_0 conda-forge 2025-05-07T19:48:48.3238903Z [conda] libnvjitlink 12.6.85 hbd13f7d_0 conda-forge 2025-05-07T19:48:48.3239443Z [conda] libnvjitlink-dev 12.6.85 h5888daf_0 conda-forge 2025-05-07T19:48:48.3240098Z [conda] numpy 2.2.5 py311h5d046bc_0 conda-forge 2025-05-07T19:48:48.3240566Z [conda] nvidia-cublas-cu12 12.6.4.1 pypi_0 pypi 2025-05-07T19:48:48.3241102Z [conda] nvidia-cuda-cupti-cu12 12.6.80 pypi_0 pypi 2025-05-07T19:48:48.3241637Z [conda] nvidia-cuda-nvrtc-cu12 12.6.77 pypi_0 pypi 2025-05-07T19:48:48.3242152Z [conda] nvidia-cuda-runtime-cu12 12.6.77 pypi_0 pypi 2025-05-07T19:48:48.3242856Z [conda] nvidia-cudnn-cu12 9.5.1.17 pypi_0 pypi 2025-05-07T19:48:48.3243336Z [conda] nvidia-cufft-cu12 11.3.0.4 pypi_0 pypi 2025-05-07T19:48:48.3243844Z [conda] nvidia-curand-cu12 10.3.7.77 pypi_0 pypi 2025-05-07T19:48:48.3244367Z [conda] nvidia-cusolver-cu12 11.7.1.2 pypi_0 pypi 2025-05-07T19:48:48.3244869Z [conda] nvidia-cusparse-cu12 12.5.4.2 pypi_0 pypi 2025-05-07T19:48:48.3245502Z [conda] nvidia-cusparselt-cu12 0.6.3 pypi_0 pypi 2025-05-07T19:48:48.3246001Z [conda] nvidia-nccl-cu12 2.26.2 pypi_0 pypi 2025-05-07T19:48:48.3246710Z [conda] nvidia-nvjitlink-cu12 12.6.85 pypi_0 pypi 2025-05-07T19:48:48.3247222Z [conda] nvidia-nvtx-cu12 12.6.77 pypi_0 pypi 2025-05-07T19:48:48.3247758Z [conda] pytorch-triton 3.3.0+git96316ce5 pypi_0 pypi 2025-05-07T19:48:48.3248261Z [conda] torch 2.8.0.dev20250507+cu126 pypi_0 pypi 2025-05-07T19:48:48.3248607Z 2025-05-07T19:48:48.3907821Z ##[group]Run . $PRELUDE; install_cudnn $BUILD_ENV "$(pwd)/build_only/cudnn" 12.6.3 2025-05-07T19:48:48.3908429Z . $PRELUDE; install_cudnn $BUILD_ENV "$(pwd)/build_only/cudnn" 12.6.3 2025-05-07T19:48:48.3908983Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:48:48.3909297Z env: 2025-05-07T19:48:48.3909541Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:48:48.3909838Z BUILD_ENV: build_binary 2025-05-07T19:48:48.3910063Z BUILD_TARGET: default 2025-05-07T19:48:48.3910293Z BUILD_VARIANT: cuda 2025-05-07T19:48:48.3910521Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:48:48.3910755Z ##[endgroup] 2025-05-07T19:48:48.8537832Z ################################################################################ 2025-05-07T19:48:48.8538212Z # Install cuDNN 2025-05-07T19:48:48.8538435Z # 2025-05-07T19:48:48.8553946Z # [2025-05-07T19:48:48.854Z] + install_cudnn build_binary /__w/FBGEMM/FBGEMM/build_only/cudnn 12.6.3 2025-05-07T19:48:48.8554778Z ################################################################################ 2025-05-07T19:48:48.8555013Z 2025-05-07T19:48:48.8570543Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:48:48.9460254Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:48:48.9462096Z [INSTALL] cuda_concat_version is determined to be: 126 2025-05-07T19:48:48.9462796Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:48:48.9463046Z 2025-05-07T19:48:48.9477493Z 2025-05-07T19:48:48.9477961Z + mkdir -p /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:48:48.9478226Z 2025-05-07T19:48:48.9504440Z 2025-05-07T19:48:48.9524578Z [INSTALL] Downloading cuDNN to /tmp/tmp.NNMU9ZsNHL ... 2025-05-07T19:48:48.9550877Z [EXEC] [ATTEMPT 0/3] + wget -q https://developer.download.nvidia.com/compute/cudnn/redist/cudnn/linux-x86_64/cudnn-linux-x86_64-9.5.1.17_cuda12-archive.tar.xz -O cudnn.tar.xz 2025-05-07T19:48:50.5798078Z [INSTALL] Unpacking cuDNN ... 2025-05-07T19:48:50.5799225Z + tar -xvf cudnn.tar.xz 2025-05-07T19:48:50.5799392Z 2025-05-07T19:48:50.5826095Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/ 2025-05-07T19:48:50.5827167Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/ 2025-05-07T19:48:50.5829063Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv_static_v9.a 2025-05-07T19:48:55.2593036Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn_static_v9.a 2025-05-07T19:48:55.3226707Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled_static_v9.a 2025-05-07T19:49:02.9511358Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled_static_v9.a 2025-05-07T19:49:03.1997210Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph_static_v9.a 2025-05-07T19:49:03.2383763Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic_static_v9.a 2025-05-07T19:49:03.7881572Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops_static_v9.a 2025-05-07T19:49:05.9360996Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv_static.a 2025-05-07T19:49:05.9362615Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn_static.a 2025-05-07T19:49:05.9363211Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled_static.a 2025-05-07T19:49:05.9363864Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled_static.a 2025-05-07T19:49:05.9364711Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph_static.a 2025-05-07T19:49:05.9365257Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic_static.a 2025-05-07T19:49:05.9365774Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops_static.a 2025-05-07T19:49:05.9366260Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn.so 2025-05-07T19:49:05.9366703Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn.so.9 2025-05-07T19:49:05.9367179Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn.so.9.5.1 2025-05-07T19:49:05.9372230Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv.so 2025-05-07T19:49:05.9374129Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv.so.9 2025-05-07T19:49:05.9375599Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv.so.9.5.1 2025-05-07T19:49:10.5863971Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn.so 2025-05-07T19:49:10.6501809Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn.so.9.5.1 2025-05-07T19:49:10.6503161Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn.so.9 2025-05-07T19:49:10.6503732Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled.so.9.5.1 2025-05-07T19:49:17.8836952Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled.so.9 2025-05-07T19:49:17.8838743Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled.so 2025-05-07T19:49:17.8840464Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled.so 2025-05-07T19:49:17.8842346Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled.so.9.5.1 2025-05-07T19:49:18.0745622Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled.so.9 2025-05-07T19:49:18.0746241Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph.so.9 2025-05-07T19:49:18.0746747Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph.so 2025-05-07T19:49:18.0747246Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph.so.9.5.1 2025-05-07T19:49:18.1099125Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic.so.9.5.1 2025-05-07T19:49:18.6399011Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic.so.9 2025-05-07T19:49:18.6400609Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic.so 2025-05-07T19:49:18.6402080Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops.so.9 2025-05-07T19:49:18.6403472Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops.so 2025-05-07T19:49:18.6404900Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops.so.9.5.1 2025-05-07T19:49:20.7295414Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/ 2025-05-07T19:49:20.7296041Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_v9.h 2025-05-07T19:49:20.7296574Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_adv_v9.h 2025-05-07T19:49:20.7297075Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_backend_v9.h 2025-05-07T19:49:20.7297600Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_cnn_v9.h 2025-05-07T19:49:20.7298111Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_graph_v9.h 2025-05-07T19:49:20.7298605Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_ops_v9.h 2025-05-07T19:49:20.7300384Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_version_v9.h 2025-05-07T19:49:20.7301051Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn.h 2025-05-07T19:49:20.7302291Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_adv.h 2025-05-07T19:49:20.7304792Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_backend.h 2025-05-07T19:49:20.7305323Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_cnn.h 2025-05-07T19:49:20.7305822Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_graph.h 2025-05-07T19:49:20.7306300Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_ops.h 2025-05-07T19:49:20.7306793Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_version.h 2025-05-07T19:49:20.7308483Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/LICENSE 2025-05-07T19:49:20.7314734Z 2025-05-07T19:49:20.7315112Z [INSTALL] Moving cuDNN files to /__w/FBGEMM/FBGEMM/build_only/cudnn ... 2025-05-07T19:49:20.7315598Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn/include 2025-05-07T19:49:20.7315862Z 2025-05-07T19:49:20.7330261Z 2025-05-07T19:49:20.7331308Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:49:20.7331928Z 2025-05-07T19:49:20.7344157Z 2025-05-07T19:49:20.7345116Z + mv cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:49:20.7345523Z 2025-05-07T19:49:20.7373268Z 2025-05-07T19:49:20.7374287Z + mv cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:49:20.7374824Z 2025-05-07T19:49:22.0572116Z 2025-05-07T19:49:22.0572438Z /__w/FBGEMM/FBGEMM 2025-05-07T19:49:22.0572769Z + rm -rf /tmp/tmp.NNMU9ZsNHL 2025-05-07T19:49:22.0573084Z 2025-05-07T19:49:22.4798981Z 2025-05-07T19:49:22.4812825Z [INSTALL] Set environment variables CUDNN_INCLUDE_DIR and CUDNN_LIBRARY ... 2025-05-07T19:49:22.4815676Z + conda env config vars set -n build_binary CUDNN_INCLUDE_DIR=/__w/FBGEMM/FBGEMM/build_only/cudnn/include CUDNN_LIBRARY=/__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:49:22.4816943Z 2025-05-07T19:49:22.8942088Z 2025-05-07T19:49:22.8942943Z [INSTALL] Successfully installed cuDNN (for CUDA 12.6.3) 2025-05-07T19:49:22.9012858Z ##[group]Run . $PRELUDE; cd fbgemm_gpu; prepare_fbgemm_gpu_build $BUILD_ENV 2025-05-07T19:49:22.9013427Z . $PRELUDE; cd fbgemm_gpu; prepare_fbgemm_gpu_build $BUILD_ENV 2025-05-07T19:49:22.9014016Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:49:22.9014335Z env: 2025-05-07T19:49:22.9014546Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:49:22.9014843Z BUILD_ENV: build_binary 2025-05-07T19:49:22.9015070Z BUILD_TARGET: default 2025-05-07T19:49:22.9015297Z BUILD_VARIANT: cuda 2025-05-07T19:49:22.9015527Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:49:22.9015773Z ##[endgroup] 2025-05-07T19:49:23.3677576Z ################################################################################ 2025-05-07T19:49:23.3677990Z # Prepare FBGEMM-GPU Build 2025-05-07T19:49:23.3678251Z # 2025-05-07T19:49:23.3699466Z # [2025-05-07T19:49:23.369Z] + prepare_fbgemm_gpu_build build_binary 2025-05-07T19:49:23.3700056Z ################################################################################ 2025-05-07T19:49:23.3700296Z 2025-05-07T19:49:23.3716864Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:49:23.4623185Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:49:23.4639789Z [BUILD] Running git submodules update ... 2025-05-07T19:49:23.4666952Z [EXEC] [ATTEMPT 0/3] + git submodule sync 2025-05-07T19:49:23.4968008Z Synchronizing submodule url for '../external/asmjit' 2025-05-07T19:49:23.4969468Z Synchronizing submodule url for '../external/composable_kernel' 2025-05-07T19:49:23.4970817Z Synchronizing submodule url for '../external/cpuinfo' 2025-05-07T19:49:23.4972070Z Synchronizing submodule url for '../external/cutlass' 2025-05-07T19:49:23.4973297Z Synchronizing submodule url for '../external/googletest' 2025-05-07T19:49:23.4974064Z Synchronizing submodule url for '../external/hipify_torch' 2025-05-07T19:49:23.4974483Z Synchronizing submodule url for '../external/json' 2025-05-07T19:49:23.5004769Z [EXEC] [ATTEMPT 0/3] + git submodule update --init --recursive 2025-05-07T19:49:23.5510193Z [BUILD] Installing other build dependencies ... 2025-05-07T19:49:23.5535022Z [EXEC] [ATTEMPT 0/3] + conda run --no-capture-output -n build_binary python -m pip install -r requirements.txt 2025-05-07T19:49:25.4141725Z Collecting backports.tarfile (from -r requirements.txt (line 13)) 2025-05-07T19:49:25.4342300Z Downloading backports.tarfile-1.2.0-py3-none-any.whl.metadata (2.0 kB) 2025-05-07T19:49:25.4443965Z Requirement already satisfied: build in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from -r requirements.txt (line 14)) (1.2.2.post1) 2025-05-07T19:49:25.5630156Z Collecting cmake (from -r requirements.txt (line 15)) 2025-05-07T19:49:25.5666081Z Downloading cmake-4.0.0-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (6.3 kB) 2025-05-07T19:49:25.5742948Z Requirement already satisfied: click in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from -r requirements.txt (line 16)) (8.1.8) 2025-05-07T19:49:25.5744310Z Requirement already satisfied: hypothesis in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from -r requirements.txt (line 17)) (6.131.14) 2025-05-07T19:49:25.5747181Z Requirement already satisfied: jinja2 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from -r requirements.txt (line 18)) (3.1.6) 2025-05-07T19:49:25.5750831Z Requirement already satisfied: mpmath==1.3.0 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from -r requirements.txt (line 19)) (1.3.0) 2025-05-07T19:49:25.6039678Z Collecting ninja (from -r requirements.txt (line 20)) 2025-05-07T19:49:25.6076956Z Downloading ninja-1.11.1.4-py3-none-manylinux_2_12_x86_64.manylinux2010_x86_64.whl.metadata (5.0 kB) 2025-05-07T19:49:25.6170380Z Requirement already satisfied: numpy>=2.0.2 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from -r requirements.txt (line 21)) (2.2.5) 2025-05-07T19:49:25.6311407Z Collecting pyre-extensions (from -r requirements.txt (line 22)) 2025-05-07T19:49:25.6358335Z Downloading pyre_extensions-0.0.32-py3-none-any.whl.metadata (4.0 kB) 2025-05-07T19:49:25.6432765Z Requirement already satisfied: pyyaml in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from -r requirements.txt (line 23)) (6.0.2) 2025-05-07T19:49:25.6435904Z Requirement already satisfied: scikit-build in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from -r requirements.txt (line 24)) (0.18.1) 2025-05-07T19:49:25.6443684Z Requirement already satisfied: setuptools in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from -r requirements.txt (line 25)) (78.1.1) 2025-05-07T19:49:25.6658227Z Collecting setuptools_git_versioning (from -r requirements.txt (line 26)) 2025-05-07T19:49:25.6708130Z Downloading setuptools_git_versioning-2.1.0-py3-none-any.whl.metadata (6.1 kB) 2025-05-07T19:49:25.6901017Z Collecting tabulate (from -r requirements.txt (line 27)) 2025-05-07T19:49:25.6930715Z Downloading tabulate-0.9.0-py3-none-any.whl.metadata (34 kB) 2025-05-07T19:49:25.7186200Z Collecting patchelf (from -r requirements.txt (line 28)) 2025-05-07T19:49:25.7220787Z Downloading patchelf-0.17.2.2-py3-none-manylinux1_x86_64.manylinux_2_5_x86_64.musllinux_1_1_x86_64.whl.metadata (3.5 kB) 2025-05-07T19:49:25.7312634Z Requirement already satisfied: packaging>=19.1 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from build->-r requirements.txt (line 14)) (25.0) 2025-05-07T19:49:25.7314270Z Requirement already satisfied: pyproject_hooks in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from build->-r requirements.txt (line 14)) (1.2.0) 2025-05-07T19:49:25.7359100Z Requirement already satisfied: attrs>=22.2.0 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from hypothesis->-r requirements.txt (line 17)) (25.3.0) 2025-05-07T19:49:25.7363562Z Requirement already satisfied: sortedcontainers<3.0.0,>=2.1.0 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from hypothesis->-r requirements.txt (line 17)) (2.4.0) 2025-05-07T19:49:25.7415429Z Requirement already satisfied: MarkupSafe>=2.0 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from jinja2->-r requirements.txt (line 18)) (3.0.2) 2025-05-07T19:49:25.7552204Z Collecting typing-inspect (from pyre-extensions->-r requirements.txt (line 22)) 2025-05-07T19:49:25.7590747Z Downloading typing_inspect-0.9.0-py3-none-any.whl.metadata (1.5 kB) 2025-05-07T19:49:25.7660262Z Requirement already satisfied: typing-extensions in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from pyre-extensions->-r requirements.txt (line 22)) (4.13.2) 2025-05-07T19:49:25.7673881Z Requirement already satisfied: distro in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from scikit-build->-r requirements.txt (line 24)) (1.9.0) 2025-05-07T19:49:25.7684760Z Requirement already satisfied: wheel>=0.32.0 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from scikit-build->-r requirements.txt (line 24)) (0.45.1) 2025-05-07T19:49:25.7998823Z Collecting mypy-extensions>=0.3.0 (from typing-inspect->pyre-extensions->-r requirements.txt (line 22)) 2025-05-07T19:49:25.8035087Z Downloading mypy_extensions-1.1.0-py3-none-any.whl.metadata (1.1 kB) 2025-05-07T19:49:25.8147774Z Downloading backports.tarfile-1.2.0-py3-none-any.whl (30 kB) 2025-05-07T19:49:25.8245399Z Downloading cmake-4.0.0-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (27.9 MB) 2025-05-07T19:49:25.9570933Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 27.9/27.9 MB 215.3 MB/s eta 0:00:00 2025-05-07T19:49:25.9613161Z Downloading ninja-1.11.1.4-py3-none-manylinux_2_12_x86_64.manylinux2010_x86_64.whl (422 kB) 2025-05-07T19:49:25.9712465Z Downloading pyre_extensions-0.0.32-py3-none-any.whl (12 kB) 2025-05-07T19:49:25.9796464Z Downloading setuptools_git_versioning-2.1.0-py3-none-any.whl (10 kB) 2025-05-07T19:49:25.9872477Z Downloading tabulate-0.9.0-py3-none-any.whl (35 kB) 2025-05-07T19:49:25.9941136Z Downloading patchelf-0.17.2.2-py3-none-manylinux1_x86_64.manylinux_2_5_x86_64.musllinux_1_1_x86_64.whl (466 kB) 2025-05-07T19:49:26.0030289Z Downloading typing_inspect-0.9.0-py3-none-any.whl (8.8 kB) 2025-05-07T19:49:26.0102860Z Downloading mypy_extensions-1.1.0-py3-none-any.whl (5.0 kB) 2025-05-07T19:49:26.1547054Z Installing collected packages: tabulate, setuptools_git_versioning, patchelf, ninja, mypy-extensions, cmake, backports.tarfile, typing-inspect, pyre-extensions 2025-05-07T19:49:26.9972050Z 2025-05-07T19:49:26.9996998Z Successfully installed backports.tarfile-1.2.0 cmake-4.0.0 mypy-extensions-1.1.0 ninja-1.11.1.4 patchelf-0.17.2.2 pyre-extensions-0.0.32 setuptools_git_versioning-2.1.0 tabulate-0.9.0 typing-inspect-0.9.0 2025-05-07T19:49:26.9999596Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:49:27.1556805Z ################################################################################ 2025-05-07T19:49:27.1557233Z # Install PyTorch (PyTorch PIP) 2025-05-07T19:49:27.1557505Z # 2025-05-07T19:49:27.1577832Z # [2025-05-07T19:49:27.157Z] + install_triton_pip build_binary 2025-05-07T19:49:27.1578330Z ################################################################################ 2025-05-07T19:49:27.1578623Z 2025-05-07T19:49:27.1578855Z [BUILD] Installing pytorch-triton nightly/3.2.0+git4b3bb1f8 from PIP ... 2025-05-07T19:49:27.1579318Z ################################################################################ 2025-05-07T19:49:27.1579687Z # Install Package From PyTorch PIP: pytorch-triton 2025-05-07T19:49:27.1580035Z # 2025-05-07T19:49:27.1596399Z # [2025-05-07T19:49:27.159Z] + install_from_pytorch_pip build_binary pytorch-triton nightly/3.2.0+git4b3bb1f8 2025-05-07T19:49:27.1596995Z ################################################################################ 2025-05-07T19:49:27.1597224Z 2025-05-07T19:49:27.1616349Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:49:27.2443707Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:49:27.2444819Z ################################################################################ 2025-05-07T19:49:27.2445860Z # Prepare PIP Arguments (PyTorch PIP) 2025-05-07T19:49:27.2446686Z # 2025-05-07T19:49:27.2462746Z # [2025-05-07T19:49:27.245Z] + __prepare_pip_arguments pytorch-triton nightly/3.2.0+git4b3bb1f8 2025-05-07T19:49:27.2464283Z ################################################################################ 2025-05-07T19:49:27.2465008Z 2025-05-07T19:49:27.2507077Z [INSTALL] Extracted package (channel, version): (nightly, 3.2.0+git4b3bb1f8) 2025-05-07T19:49:27.2529540Z [INSTALL] Using a non-RELEASE channel: nightly ... 2025-05-07T19:49:27.2531638Z [INSTALL] Extracted the full PIP channel: https://download.pytorch.org/whl/nightly/ 2025-05-07T19:49:27.2535850Z [INSTALL] Extracted the full PIP package: --pre pytorch-triton==3.2.0+git4b3bb1f8 2025-05-07T19:49:27.2547139Z [INSTALL] Attempting to install [pytorch-triton, 3.2.0+git4b3bb1f8] from PyTorch PIP using channel https://download.pytorch.org/whl/nightly/ ... 2025-05-07T19:49:27.2579593Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --pre pytorch-triton==3.2.0+git4b3bb1f8 --index-url https://download.pytorch.org/whl/nightly/ 2025-05-07T19:49:33.4933967Z ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts. 2025-05-07T19:49:33.4935074Z Looking in indexes: https://download.pytorch.org/whl/nightly/ 2025-05-07T19:49:33.4936374Z torch 2.8.0.dev20250507+cu126 requires pytorch-triton==3.3.0+git96316ce5; platform_system == "Linux" and platform_machine == "x86_64", but you have pytorch-triton 3.2.0+git4b3bb1f8 which is incompatible. 2025-05-07T19:49:33.4938377Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:49:33.4939779Z 2025-05-07T19:49:33.4939910Z Collecting pytorch-triton==3.2.0+git4b3bb1f8 2025-05-07T19:49:33.4940757Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.2.0%2Bgit4b3bb1f8-cp311-cp311-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.metadata (1.3 kB) 2025-05-07T19:49:33.4942043Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.2.0%2Bgit4b3bb1f8-cp311-cp311-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (166.5 MB) 2025-05-07T19:49:33.4943269Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 166.5/166.5 MB 89.2 MB/s eta 0:00:00 2025-05-07T19:49:33.4943652Z Installing collected packages: pytorch-triton 2025-05-07T19:49:33.4944035Z Attempting uninstall: pytorch-triton 2025-05-07T19:49:33.4944429Z Found existing installation: pytorch-triton 3.3.0+git96316ce5 2025-05-07T19:49:33.4944887Z Uninstalling pytorch-triton-3.3.0+git96316ce5: 2025-05-07T19:49:33.4945319Z Successfully uninstalled pytorch-triton-3.3.0+git96316ce5 2025-05-07T19:49:33.4945793Z Successfully installed pytorch-triton-3.2.0+git4b3bb1f8 2025-05-07T19:49:33.4946069Z 2025-05-07T19:49:35.4259734Z [CHECK] Python (sub-)package 'triton' found ... 2025-05-07T19:49:35.4260234Z [CHECK] Printing out the pytorch-triton version ... 2025-05-07T19:49:37.2351725Z ################################################################################ 2025-05-07T19:49:37.2353027Z [CHECK] The installed VERSION of pytorch-triton is: 3.2.0 2025-05-07T19:49:37.2354459Z ################################################################################ 2025-05-07T19:49:37.2355153Z 2025-05-07T19:49:39.0044113Z [CHECK] Python (sub-)package 'numpy' found ... 2025-05-07T19:49:40.8007949Z [CHECK] Python (sub-)package 'skbuild' found ... 2025-05-07T19:49:40.8009250Z [BUILD] Successfully ran git submodules update 2025-05-07T19:49:40.8094704Z ##[group]Run . $PRELUDE; cd fbgemm_gpu; build_fbgemm_gpu_package $BUILD_ENV nightly default/cuda 2025-05-07T19:49:40.8095402Z . $PRELUDE; cd fbgemm_gpu; build_fbgemm_gpu_package $BUILD_ENV nightly default/cuda 2025-05-07T19:49:40.8095989Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:49:40.8096470Z env: 2025-05-07T19:49:40.8096686Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:49:40.8096993Z BUILD_ENV: build_binary 2025-05-07T19:49:40.8097227Z BUILD_TARGET: default 2025-05-07T19:49:40.8097463Z BUILD_VARIANT: cuda 2025-05-07T19:49:40.8097704Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:49:40.8097945Z ##[endgroup] 2025-05-07T19:49:41.2565655Z [BUILD] BUILD_TARGET_VARIANT: default/cuda 2025-05-07T19:49:41.2566103Z [BUILD] Extracted build target: default 2025-05-07T19:49:41.2566452Z [BUILD] Extracted build variant: cuda 2025-05-07T19:49:42.8432764Z /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:49:42.8433108Z 2025-05-07T19:49:42.9015528Z [CHECK] Binary cc found in PATH 2025-05-07T19:49:44.4900542Z /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:49:44.4901248Z 2025-05-07T19:49:44.5724654Z [CHECK] Binary gcc found in PATH 2025-05-07T19:49:46.1603440Z /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:49:46.1603876Z 2025-05-07T19:49:46.2195805Z [CHECK] Binary c++ found in PATH 2025-05-07T19:49:47.8243175Z /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:49:47.8244095Z 2025-05-07T19:49:47.8842007Z [CHECK] Binary g++ found in PATH 2025-05-07T19:49:49.5338252Z [BUILD] Extracted and set Python tag: py311 2025-05-07T19:49:49.5338828Z [BUILD] Extracted and set Python platform name: manylinux_2_28_x86_64 2025-05-07T19:49:49.5591679Z core = 24 2025-05-07T19:49:49.5825926Z sockets = 2 2025-05-07T19:49:49.5826841Z [BUILD] Set multicore run option for setup.py: -j 48 2025-05-07T19:49:49.5827922Z [CHECK] LD_LIBRARY_PATH = 2025-05-07T19:49:49.5829168Z [BUILD] Running pre-build cleanups ... 2025-05-07T19:49:49.5830108Z + rm -rf dist 2025-05-07T19:49:49.5830476Z 2025-05-07T19:49:49.5845059Z 2025-05-07T19:49:49.5845742Z + conda run --no-capture-output -n build_binary python setup.py clean 2025-05-07T19:49:49.5846088Z 2025-05-07T19:49:52.4195353Z INFO:root:running clean 2025-05-07T19:49:52.4195724Z [SETUP.PY] ARGV: ['setup.py', 'clean'] 2025-05-07T19:49:52.4196834Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=False, debug=False, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path=None, nccl_lib_path=None, use_fb_only=False, cxxprefix=None) 2025-05-07T19:49:52.4197939Z [SETUP.PY] Other arguments: ['clean'] 2025-05-07T19:49:52.4198444Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:49:52.4199030Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:49:52.4199674Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:49:52.4200295Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:49:52.4200836Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:49:52.4202198Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', "-DCMAKE_C_FLAGS=''", "-DCMAKE_CXX_FLAGS=''"] 2025-05-07T19:49:52.7233732Z 2025-05-07T19:49:52.7234113Z [BUILD] Printing git status ... 2025-05-07T19:49:52.7234490Z + git status 2025-05-07T19:49:52.7234627Z 2025-05-07T19:49:53.1997715Z HEAD detached at pull/4066/merge 2025-05-07T19:49:53.1998651Z Untracked files: 2025-05-07T19:49:53.1999577Z (use "git add ..." to include in what will be committed) 2025-05-07T19:49:53.2000677Z ../build_only/ 2025-05-07T19:49:53.2001307Z ../collect_env.py 2025-05-07T19:49:53.2002030Z fbgemm_gpu/docs/version.py 2025-05-07T19:49:53.2002543Z 2025-05-07T19:49:53.2003793Z nothing added to commit but untracked files present (use "git add" to track) 2025-05-07T19:49:53.2004647Z 2025-05-07T19:49:53.2004760Z + git diff 2025-05-07T19:49:53.2004890Z 2025-05-07T19:49:53.2283944Z 2025-05-07T19:49:53.2284717Z ################################################################################ 2025-05-07T19:49:53.2285817Z # Configure FBGEMM-GPU Build 2025-05-07T19:49:53.2286142Z # 2025-05-07T19:49:53.2305162Z # [2025-05-07T19:49:53.229Z] + __configure_fbgemm_gpu_build 2025-05-07T19:49:53.2305636Z ################################################################################ 2025-05-07T19:49:53.2305885Z 2025-05-07T19:49:53.2309317Z [BUILD] Setting the build target: default ... 2025-05-07T19:49:53.2310670Z [BUILD] Configuring build as CUDA variant (this is the default behavior) ... 2025-05-07T19:49:54.8285835Z /github/home/miniconda/envs/build_binary/bin/nvcc 2025-05-07T19:49:54.8286482Z 2025-05-07T19:49:54.8905250Z [CHECK] Binary nvcc found in PATH 2025-05-07T19:49:56.4797043Z /__w/FBGEMM/FBGEMM/build_only/cudnn/include 2025-05-07T19:49:56.4797821Z 2025-05-07T19:49:56.5402780Z [CHECK] Environment variable CUDNN_INCLUDE_DIR is defined in the Conda environment 2025-05-07T19:49:58.1307513Z /__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:49:58.1308268Z 2025-05-07T19:49:58.1902021Z [CHECK] Environment variable CUDNN_LIBRARY is defined in the Conda environment 2025-05-07T19:49:59.7829655Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:49:59.7830689Z 2025-05-07T19:49:59.8433862Z [CHECK] Environment variable NVML_LIB_PATH is defined in the Conda environment 2025-05-07T19:50:01.4979138Z [BUILD] Using the default architectures for CUDA nvcc: NVIDIA (R) Cuda compiler driver 2025-05-07T19:50:01.4980728Z Copyright (c) 2005-2024 NVIDIA Corporation 2025-05-07T19:50:01.4981738Z Built on Tue_Oct_29_23:50:19_PDT_2024 2025-05-07T19:50:01.4982658Z Cuda compilation tools, release 12.6, V12.6.85 2025-05-07T19:50:01.4983751Z Build cuda_12.6.r12.6/compiler.35059454_0 ... 2025-05-07T19:50:01.4984830Z [BUILD] Setting the following CUDA targets: 7.0;8.0;9.0;9.0a 2025-05-07T19:50:01.4985359Z [BUILD] Looking up NVML filepath ... 2025-05-07T19:50:03.1567101Z [BUILD] Looking up NCCL filepath ... 2025-05-07T19:50:06.5412876Z [BUILD] Setting NVCC verbose mode ... 2025-05-07T19:50:06.5413413Z + conda env config vars set -n build_binary NVCC_VERBOSE=1 2025-05-07T19:50:06.5413730Z 2025-05-07T19:50:06.9560520Z 2025-05-07T19:50:06.9561149Z [BUILD] Setting CUDA build args ... 2025-05-07T19:50:08.6123735Z [BUILD] Looking up CUDA version ... 2025-05-07T19:50:11.9171468Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:50:11.9172367Z 2025-05-07T19:50:13.5915556Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:50:13.5916601Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:50:13.5917110Z 2025-05-07T19:50:13.5917322Z [BUILD] Setting NVCC flags ... 2025-05-07T19:50:13.5918282Z + conda env config vars set -n build_binary NVCC_PREPEND_FLAGS="-std=c++20 -Xcompiler -std=c++20 -Xcompiler -stdlib=libstdc++ -ccbin /github/home/miniconda/envs/build_binary/bin/c++ -allow-unsupported-compiler" 2025-05-07T19:50:13.5919207Z 2025-05-07T19:50:14.0136158Z 2025-05-07T19:50:14.0139658Z + conda run -n build_binary printenv NVCC_PREPEND_FLAGS 2025-05-07T19:50:14.0139996Z 2025-05-07T19:50:15.6126964Z -std=c++20 -Xcompiler -std=c++20 -Xcompiler -stdlib=libstdc++ -ccbin /github/home/miniconda/envs/build_binary/bin/c++ -allow-unsupported-compiler 2025-05-07T19:50:15.6127890Z 2025-05-07T19:50:15.6734995Z 2025-05-07T19:50:15.6735876Z [BUILD] Setting CUDA build args ... 2025-05-07T19:50:15.6736894Z + conda run -n build_binary c++ --version 2025-05-07T19:50:15.6737288Z 2025-05-07T19:50:17.2887003Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:50:17.2887994Z Target: x86_64-conda-linux-gnu 2025-05-07T19:50:17.2888329Z Thread model: posix 2025-05-07T19:50:17.2888669Z InstalledDir: /github/home/miniconda/envs/build_binary/bin 2025-05-07T19:50:17.2889349Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:50:17.2890124Z 2025-05-07T19:50:17.3483064Z 2025-05-07T19:50:17.3484032Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:50:17.3484860Z 2025-05-07T19:50:19.0204296Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:50:19.0206889Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:50:19.0208301Z 2025-05-07T19:50:19.0208867Z [BUILD] Clang is available; configuring for Clang-based build ... 2025-05-07T19:50:20.6679782Z .github/scripts/fbgemm_gpu_build.bash: line 370: [: : integer expression expected 2025-05-07T19:50:20.6681401Z [BUILD] Enabling debug features in the build ... 2025-05-07T19:50:20.6688603Z [BUILD] FBGEMM_GPU build arguments have been set: --verbose --build-target=default --build-variant=cuda --nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so --nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 -DTORCH_CUDA_ARCH_LIST='7.0;8.0;9.0;9.0a' -DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCMAKE_CXX_STANDARD=20 --cxxprefix=/github/home/miniconda/envs/build_binary --debug 2025-05-07T19:50:20.6691178Z ################################################################################ 2025-05-07T19:50:20.6691543Z # Build FBGEMM-GPU Package (Wheel) 2025-05-07T19:50:20.6691833Z # 2025-05-07T19:50:20.6700782Z # [2025-05-07T19:50:20.669Z] + build_fbgemm_gpu_package build_binary nightly default/cuda 2025-05-07T19:50:20.6702355Z ################################################################################ 2025-05-07T19:50:20.6703036Z 2025-05-07T19:50:20.6703597Z [BUILD] Building FBGEMM wheel (TARGET=default, VARIANT=cuda) ... 2025-05-07T19:50:20.6710688Z + conda run --no-capture-output -n build_binary python -m build --wheel --no-isolation --config-setting=--build-option=--verbose --config-setting=--build-option=--build-target=default --config-setting=--build-option=--build-variant=cuda --config-setting=--build-option=--nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so --config-setting=--build-option=--nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 --config-setting=--build-option=-DTORCH_CUDA_ARCH_LIST='7.0;8.0;9.0;9.0a' --config-setting=--build-option=-DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux --config-setting=--build-option=-DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux --config-setting=--build-option=-DCMAKE_CXX_STANDARD=20 --config-setting=--build-option=--cxxprefix=/github/home/miniconda/envs/build_binary --config-setting=--build-option=--debug --config-setting=--build-option=--package_channel=nightly --config-setting=--build-option=--python-tag=py311 --config-setting=--build-option=--plat-name=manylinux_2_28_x86_64 2025-05-07T19:50:20.6715716Z 2025-05-07T19:50:22.3092583Z * Getting build dependencies for wheel... 2025-05-07T19:50:23.5608240Z INFO:root:running egg_info 2025-05-07T19:50:23.5632314Z INFO:root:creating fbgemm_gpu_nightly.egg-info 2025-05-07T19:50:23.5633596Z INFO:root:writing fbgemm_gpu_nightly.egg-info/PKG-INFO 2025-05-07T19:50:23.5635885Z INFO:root:writing dependency_links to fbgemm_gpu_nightly.egg-info/dependency_links.txt 2025-05-07T19:50:23.5636788Z INFO:root:writing requirements to fbgemm_gpu_nightly.egg-info/requires.txt 2025-05-07T19:50:23.5637613Z INFO:root:writing top-level names to fbgemm_gpu_nightly.egg-info/top_level.txt 2025-05-07T19:50:23.5638948Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:50:23.5691711Z INFO:root:reading manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:50:23.5706053Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:50:23.5709949Z [SETUP.PY] ARGV: ['setup.py', 'egg_info'] 2025-05-07T19:50:23.5713062Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=False, debug=False, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path=None, nccl_lib_path=None, use_fb_only=False, cxxprefix=None) 2025-05-07T19:50:23.5716386Z [SETUP.PY] Other arguments: ['egg_info'] 2025-05-07T19:50:23.5717473Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:50:23.5718083Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:50:23.5718692Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:50:23.5719301Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:50:23.5719753Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:50:23.5721106Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', "-DCMAKE_C_FLAGS=''", "-DCMAKE_CXX_FLAGS=''"] 2025-05-07T19:50:23.8497234Z * Building wheel... 2025-05-07T19:50:25.1009571Z [SETUP.PY] ARGV: ['setup.py', 'bdist_wheel', '--dist-dir', '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-radia1_8', '--verbose', '--build-target=default', '--build-variant=cuda', '--nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', '--nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2', '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a', '-DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCMAKE_CXX_STANDARD=20', '--cxxprefix=/github/home/miniconda/envs/build_binary', '--debug', '--package_channel=nightly', '--python-tag=py311', '--plat-name=manylinux_2_28_x86_64'] 2025-05-07T19:50:25.1014154Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=True, debug=True, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path='/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', nccl_lib_path='/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2', use_fb_only=False, cxxprefix='/github/home/miniconda/envs/build_binary') 2025-05-07T19:50:25.1017193Z [SETUP.PY] Other arguments: ['bdist_wheel', '--dist-dir', '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-radia1_8', '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a', '-DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCMAKE_CXX_STANDARD=20', '--python-tag=py311', '--plat-name=manylinux_2_28_x86_64'] 2025-05-07T19:50:25.1018909Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:50:25.1019492Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:50:25.1020082Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:50:25.1020629Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:50:25.1021069Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:50:25.1027140Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DCMAKE_VERBOSE_MAKEFILE=ON', '-DCMAKE_EXPORT_COMPILE_COMMANDS=TRUE', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', '-DNVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', '-DNCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include', '-DNCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2', '-DCMAKE_C_COMPILER=/github/home/miniconda/envs/build_binary/bin/cc', '-DCMAKE_CXX_COMPILER=/github/home/miniconda/envs/build_binary/bin/c++', "-DCMAKE_C_FLAGS='-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include'", "-DCMAKE_CXX_FLAGS='-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include'", '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a', '-DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCMAKE_CXX_STANDARD=20'] 2025-05-07T19:50:25.1033869Z 2025-05-07T19:50:25.1033874Z 2025-05-07T19:50:25.1034131Z -------------------------------------------------------------------------------- 2025-05-07T19:50:25.1034586Z -- Trying 'Ninja' generator 2025-05-07T19:50:25.1034891Z -------------------------------- 2025-05-07T19:50:25.1035210Z --------------------------- 2025-05-07T19:50:25.1035473Z ---------------------- 2025-05-07T19:50:25.1035753Z ----------------- 2025-05-07T19:50:25.1036000Z ------------ 2025-05-07T19:50:25.1036253Z ------- 2025-05-07T19:50:25.1036507Z -- 2025-05-07T19:50:25.1494573Z CMake Deprecation Warning at CMakeLists.txt:1 (cmake_minimum_required): 2025-05-07T19:50:25.1495251Z Compatibility with CMake < 3.10 will be removed from a future version of 2025-05-07T19:50:25.1495793Z Not searching for unused variables given on the command line. 2025-05-07T19:50:25.1496271Z CMake. 2025-05-07T19:50:25.1496399Z 2025-05-07T19:50:25.1496632Z Update the VERSION argument value. Or, use the ... syntax 2025-05-07T19:50:25.1497228Z to tell CMake that the project requires at least but has been updated 2025-05-07T19:50:25.1497768Z to work with policies introduced by or earlier. 2025-05-07T19:50:25.1498036Z 2025-05-07T19:50:25.1498041Z 2025-05-07T19:50:25.2397305Z -- The C compiler identification is Clang 16.0.6 2025-05-07T19:50:25.2506441Z -- Detecting C compiler ABI info 2025-05-07T19:50:25.3838118Z -- Detecting C compiler ABI info - done 2025-05-07T19:50:25.3961602Z -- Check for working C compiler: /github/home/miniconda/envs/build_binary/bin/cc - skipped 2025-05-07T19:50:25.3963967Z -- Detecting C compile features 2025-05-07T19:50:25.3968080Z -- Detecting C compile features - done 2025-05-07T19:50:25.5489909Z -- The CXX compiler identification is Clang 16.0.6 2025-05-07T19:50:25.5573270Z -- Detecting CXX compiler ABI info 2025-05-07T19:50:25.7156845Z -- Detecting CXX compiler ABI info - done 2025-05-07T19:50:25.7280901Z -- Check for working CXX compiler: /github/home/miniconda/envs/build_binary/bin/c++ - skipped 2025-05-07T19:50:25.7283747Z -- Detecting CXX compile features 2025-05-07T19:50:25.7291631Z -- Detecting CXX compile features - done 2025-05-07T19:50:25.7307593Z -- Configuring done (0.6s) 2025-05-07T19:50:25.7357483Z -- Generating done (0.0s) 2025-05-07T19:50:25.7374071Z -- Build files have been written to: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_cmake_test_compile/build 2025-05-07T19:50:25.7418098Z -- 2025-05-07T19:50:25.7418786Z ------- 2025-05-07T19:50:25.7419252Z ------------ 2025-05-07T19:50:25.7419534Z ----------------- 2025-05-07T19:50:25.7419777Z ---------------------- 2025-05-07T19:50:25.7420069Z --------------------------- 2025-05-07T19:50:25.7420341Z -------------------------------- 2025-05-07T19:50:25.7420673Z -- Trying 'Ninja' generator - success 2025-05-07T19:50:25.7421423Z -------------------------------------------------------------------------------- 2025-05-07T19:50:25.7421754Z 2025-05-07T19:50:25.7429849Z Configuring Project 2025-05-07T19:50:25.7430553Z Working directory: 2025-05-07T19:50:25.7431581Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build 2025-05-07T19:50:25.7433083Z Command: 2025-05-07T19:50:25.7450346Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/cmake/data/bin/cmake /__w/FBGEMM/FBGEMM/fbgemm_gpu -G Ninja -DCMAKE_MAKE_PROGRAM:FILEPATH=/github/home/miniconda/envs/build_binary/bin/ninja --no-warn-unused-cli -DCMAKE_INSTALL_PREFIX:PATH=/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install -DPYTHON_VERSION_STRING:STRING=3.11.11 -DSKBUILD:INTERNAL=TRUE -DCMAKE_MODULE_PATH:PATH=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/skbuild/resources/cmake -DPYTHON_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPYTHON_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.11 -DPYTHON_LIBRARY:PATH=/github/home/miniconda/envs/build_binary/lib/libpython3.11.so -DPython_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPython_ROOT_DIR:PATH=/github/home/miniconda/envs/build_binary -DPython_FIND_REGISTRY:STRING=NEVER -DPython_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.11 -DPython_NumPy_INCLUDE_DIRS:PATH=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/numpy/_core/include -DPython3_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPython3_ROOT_DIR:PATH=/github/home/miniconda/envs/build_binary -DPython3_FIND_REGISTRY:STRING=NEVER -DPython3_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.11 -DPython3_NumPy_INCLUDE_DIRS:PATH=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/numpy/_core/include -DCMAKE_MAKE_PROGRAM:FILEPATH=/github/home/miniconda/envs/build_binary/bin/ninja -DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch -D_GLIBCXX_USE_CXX11_ABI=1 -DCMAKE_VERBOSE_MAKEFILE=ON -DCMAKE_EXPORT_COMPILE_COMMANDS=TRUE -DFBGEMM_BUILD_TARGET=default -DFBGEMM_BUILD_VARIANT=cuda -DNVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -DNCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -DNCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 -DCMAKE_C_COMPILER=/github/home/miniconda/envs/build_binary/bin/cc -DCMAKE_CXX_COMPILER=/github/home/miniconda/envs/build_binary/bin/c++ '-DCMAKE_C_FLAGS='"'"'-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include'"'"'' '-DCMAKE_CXX_FLAGS='"'"'-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include'"'"'' '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a' -DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCMAKE_CXX_STANDARD=20 '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a' -DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCMAKE_CXX_STANDARD=20 -DCMAKE_BUILD_TYPE:STRING=Release 2025-05-07T19:50:25.7463931Z 2025-05-07T19:50:25.7893590Z 2025-05-07T19:50:25.7894370Z Not searching for unused variables given on the command line. 2025-05-07T19:50:25.7895355Z 2025-05-07T19:50:25.7895720Z ================================================================================ 2025-05-07T19:50:25.7896674Z Default C compiler flags 2025-05-07T19:50:25.7897725Z (values may be overridden by CMAKE_CXX_STANDARD and CXX_STANDARD): 2025-05-07T19:50:25.7899011Z 2025-05-07T19:50:25.7901572Z -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include 2025-05-07T19:50:25.7902884Z ================================================================================ 2025-05-07T19:50:25.7903209Z 2025-05-07T19:50:25.7903214Z 2025-05-07T19:50:25.7903217Z 2025-05-07T19:50:25.7903360Z ================================================================================ 2025-05-07T19:50:25.7903700Z Default C++ compiler flags 2025-05-07T19:50:25.7904093Z (values may be overridden by CMAKE_CXX_STANDARD and CXX_STANDARD): 2025-05-07T19:50:25.7904389Z 2025-05-07T19:50:25.7905206Z -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include 2025-05-07T19:50:25.7906264Z ================================================================================ 2025-05-07T19:50:25.7906600Z 2025-05-07T19:50:25.7906628Z 2025-05-07T19:50:25.7906631Z 2025-05-07T19:50:25.7906742Z ================================================================================ 2025-05-07T19:50:25.7907042Z AVX2_FLAGS: 2025-05-07T19:50:25.7907182Z 2025-05-07T19:50:25.7907267Z -mavx2 2025-05-07T19:50:25.7907456Z -mf16c 2025-05-07T19:50:25.7907665Z -mfma 2025-05-07T19:50:25.7907886Z -fopenmp 2025-05-07T19:50:25.7908104Z ================================================================================ 2025-05-07T19:50:25.7908320Z 2025-05-07T19:50:25.7908324Z 2025-05-07T19:50:25.7908357Z 2025-05-07T19:50:25.7908467Z ================================================================================ 2025-05-07T19:50:25.7908766Z AVX512_FLAGS: 2025-05-07T19:50:25.7908921Z 2025-05-07T19:50:25.7909002Z -mavx2 2025-05-07T19:50:25.7909194Z -mf16c 2025-05-07T19:50:25.7909412Z -mfma 2025-05-07T19:50:25.7909629Z -mavx512f 2025-05-07T19:50:25.7909822Z -mavx512bw 2025-05-07T19:50:25.7910045Z -mavx512dq 2025-05-07T19:50:25.7910240Z -mavx512vl 2025-05-07T19:50:25.7910463Z -fopenmp 2025-05-07T19:50:25.7910679Z ================================================================================ 2025-05-07T19:50:25.7910923Z 2025-05-07T19:50:25.7910926Z 2025-05-07T19:50:25.7910930Z 2025-05-07T19:50:25.7911040Z ================================================================================ 2025-05-07T19:50:25.7911372Z The project is built using scikit-build 2025-05-07T19:50:25.7911714Z ================================================================================ 2025-05-07T19:50:25.7911933Z 2025-05-07T19:50:25.7911936Z 2025-05-07T19:50:25.7911939Z 2025-05-07T19:50:25.7912072Z ================================================================================ 2025-05-07T19:50:25.7912377Z Build Settings 2025-05-07T19:50:25.7912529Z 2025-05-07T19:50:25.7912634Z FBGEMM_BUILD_TARGET : default 2025-05-07T19:50:25.7912917Z FBGEMM_BUILD_VARIANT : cuda 2025-05-07T19:50:25.7913121Z 2025-05-07T19:50:25.7913217Z NVCC_VERBOSE : 2025-05-07T19:50:25.7913487Z CUDNN_INCLUDE_DIR : 2025-05-07T19:50:25.7913734Z CUDNN_LIBRARY : 2025-05-07T19:50:25.7914273Z NVML_LIB_PATH : /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:25.7914935Z TORCH_CUDA_ARCH_LIST : 7.0 2025-05-07T19:50:25.7915242Z 8.0 2025-05-07T19:50:25.7915439Z 9.0 2025-05-07T19:50:25.7915668Z 9.0a 2025-05-07T19:50:25.7915788Z 2025-05-07T19:50:25.7915897Z HIP_ROOT_DIR : 2025-05-07T19:50:25.7916193Z HIPCC_VERBOSE : 2025-05-07T19:50:25.7916460Z AMDGPU_TARGETS : 2025-05-07T19:50:25.7916764Z PYTORCH_ROCM_ARCH : 2025-05-07T19:50:25.7917092Z ================================================================================ 2025-05-07T19:50:25.7917335Z 2025-05-07T19:50:25.9377128Z -- The CXX compiler identification is Clang 16.0.6 2025-05-07T19:50:26.0064319Z -- The C compiler identification is Clang 16.0.6 2025-05-07T19:50:27.0676199Z -- The CUDA compiler identification is NVIDIA 12.6.85 with host compiler Clang 16.0.6 2025-05-07T19:50:27.0784369Z -- Detecting CXX compiler ABI info 2025-05-07T19:50:27.2217370Z -- Detecting CXX compiler ABI info - done 2025-05-07T19:50:27.2348597Z -- Check for working CXX compiler: /github/home/miniconda/envs/build_binary/bin/c++ - skipped 2025-05-07T19:50:27.2349651Z -- Detecting CXX compile features 2025-05-07T19:50:27.2356512Z -- Detecting CXX compile features - done 2025-05-07T19:50:27.2430648Z -- Detecting C compiler ABI info 2025-05-07T19:50:27.3739347Z -- Detecting C compiler ABI info - done 2025-05-07T19:50:27.3866570Z -- Check for working C compiler: /github/home/miniconda/envs/build_binary/bin/cc - skipped 2025-05-07T19:50:27.3867184Z -- Detecting C compile features 2025-05-07T19:50:27.3871465Z -- Detecting C compile features - done 2025-05-07T19:50:27.3921327Z -- Detecting CUDA compiler ABI info 2025-05-07T19:50:28.4391684Z -- Detecting CUDA compiler ABI info - done 2025-05-07T19:50:28.4920851Z -- Check for working CUDA compiler: /github/home/miniconda/envs/build_binary/bin/nvcc - skipped 2025-05-07T19:50:28.4955906Z -- Detecting CUDA compile features 2025-05-07T19:50:28.4958649Z -- Detecting CUDA compile features - done 2025-05-07T19:50:28.4983361Z -- Performing Test C_HAS_AVX_1 2025-05-07T19:50:28.7960027Z -- Performing Test C_HAS_AVX_1 - Failed 2025-05-07T19:50:28.7961296Z -- Performing Test C_HAS_AVX_2 2025-05-07T19:50:29.1370951Z -- Performing Test C_HAS_AVX_2 - Success 2025-05-07T19:50:29.1371443Z -- Performing Test C_HAS_AVX2_1 2025-05-07T19:50:29.4309659Z -- Performing Test C_HAS_AVX2_1 - Failed 2025-05-07T19:50:29.4310107Z -- Performing Test C_HAS_AVX2_2 2025-05-07T19:50:29.7733987Z -- Performing Test C_HAS_AVX2_2 - Success 2025-05-07T19:50:29.7734487Z -- Performing Test C_HAS_AVX512_1 2025-05-07T19:50:30.0689895Z -- Performing Test C_HAS_AVX512_1 - Failed 2025-05-07T19:50:30.0690416Z -- Performing Test C_HAS_AVX512_2 2025-05-07T19:50:30.4128776Z -- Performing Test C_HAS_AVX512_2 - Success 2025-05-07T19:50:30.4129384Z -- Performing Test CXX_HAS_AVX_1 2025-05-07T19:50:30.7063438Z -- Performing Test CXX_HAS_AVX_1 - Failed 2025-05-07T19:50:30.7064437Z -- Performing Test CXX_HAS_AVX_2 2025-05-07T19:50:31.0478826Z -- Performing Test CXX_HAS_AVX_2 - Success 2025-05-07T19:50:31.0480507Z -- Performing Test CXX_HAS_AVX2_1 2025-05-07T19:50:31.3446568Z -- Performing Test CXX_HAS_AVX2_1 - Failed 2025-05-07T19:50:31.3447606Z -- Performing Test CXX_HAS_AVX2_2 2025-05-07T19:50:31.6881980Z -- Performing Test CXX_HAS_AVX2_2 - Success 2025-05-07T19:50:31.6883769Z -- Performing Test CXX_HAS_AVX512_1 2025-05-07T19:50:31.9829882Z -- Performing Test CXX_HAS_AVX512_1 - Failed 2025-05-07T19:50:31.9830933Z -- Performing Test CXX_HAS_AVX512_2 2025-05-07T19:50:32.3268574Z -- Performing Test CXX_HAS_AVX512_2 - Success 2025-05-07T19:50:32.3443044Z -- Found CUDA: /github/home/miniconda/envs/build_binary/targets/x86_64-linux (found version "12.6") 2025-05-07T19:50:32.3477735Z -- Found CUDAToolkit: /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include (found version "12.6.85") 2025-05-07T19:50:32.3541733Z -- Performing Test CMAKE_HAVE_LIBC_PTHREAD 2025-05-07T19:50:32.4894021Z -- Performing Test CMAKE_HAVE_LIBC_PTHREAD - Success 2025-05-07T19:50:32.4903402Z -- Found Threads: TRUE 2025-05-07T19:50:32.4916154Z CMake Warning at /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/share/cmake/Caffe2/FindCUDAToolkit.cmake:957 (message): 2025-05-07T19:50:32.4917687Z Could not find librt library, needed by CUDA::cudart_static 2025-05-07T19:50:32.4918102Z Call Stack (most recent call first): 2025-05-07T19:50:32.4918826Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/share/cmake/Caffe2/public/cuda.cmake:59 (find_package) 2025-05-07T19:50:32.4919963Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/share/cmake/Caffe2/Caffe2Config.cmake:86 (include) 2025-05-07T19:50:32.4921374Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:68 (find_package) 2025-05-07T19:50:32.4922249Z /__w/FBGEMM/FBGEMM/cmake/modules/PyTorchSetup.cmake:14 (find_package) 2025-05-07T19:50:32.4922721Z CMakeLists.txt:112 (include) 2025-05-07T19:50:32.4923063Z 2025-05-07T19:50:32.4923068Z 2025-05-07T19:50:32.6186775Z -- PyTorch: CUDA detected: 12.6 2025-05-07T19:50:32.6188309Z -- PyTorch: CUDA nvcc is: /github/home/miniconda/envs/build_binary/targets/x86_64-linux/bin/nvcc 2025-05-07T19:50:32.6190510Z -- PyTorch: CUDA toolkit directory: /github/home/miniconda/envs/build_binary/targets/x86_64-linux 2025-05-07T19:50:32.8022798Z -- PyTorch: Header version is: 12.6 2025-05-07T19:50:32.8836772Z -- Found Python: /github/home/miniconda/envs/build_binary/bin/python (found version "3.11.11") found components: Interpreter 2025-05-07T19:50:32.8846405Z CMake Warning at /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/share/cmake/Caffe2/public/cuda.cmake:140 (message): 2025-05-07T19:50:32.8848784Z Failed to compute shorthash for libnvrtc.so 2025-05-07T19:50:32.8849758Z Call Stack (most recent call first): 2025-05-07T19:50:32.8851798Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/share/cmake/Caffe2/Caffe2Config.cmake:86 (include) 2025-05-07T19:50:32.8855068Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:68 (find_package) 2025-05-07T19:50:32.8856305Z /__w/FBGEMM/FBGEMM/cmake/modules/PyTorchSetup.cmake:14 (find_package) 2025-05-07T19:50:32.8856717Z CMakeLists.txt:112 (include) 2025-05-07T19:50:32.8856906Z 2025-05-07T19:50:32.8856911Z 2025-05-07T19:50:32.8857062Z -- USE_CUDNN is set to 0. Compiling without cuDNN support 2025-05-07T19:50:32.8857506Z -- USE_CUSPARSELT is set to 0. Compiling without cuSPARSELt support 2025-05-07T19:50:32.8857927Z -- USE_CUDSS is set to 0. Compiling without cuDSS support 2025-05-07T19:50:32.8858345Z -- USE_CUFILE is set to 0. Compiling without cuFile support 2025-05-07T19:50:32.8859147Z -- Added CUDA NVCC flags for: -gencode;arch=compute_70,code=sm_70;-gencode;arch=compute_80,code=sm_80;-gencode;arch=compute_90,code=sm_90;-gencode;arch=compute_90a,code=sm_90a 2025-05-07T19:50:32.9192736Z CMake Warning at /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:22 (message): 2025-05-07T19:50:32.9195382Z static library kineto_LIBRARY-NOTFOUND not found. 2025-05-07T19:50:32.9195769Z Call Stack (most recent call first): 2025-05-07T19:50:32.9196549Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:125 (append_torchlib_if_found) 2025-05-07T19:50:32.9197488Z /__w/FBGEMM/FBGEMM/cmake/modules/PyTorchSetup.cmake:14 (find_package) 2025-05-07T19:50:32.9197957Z CMakeLists.txt:112 (include) 2025-05-07T19:50:32.9198142Z 2025-05-07T19:50:32.9198147Z 2025-05-07T19:50:32.9198539Z -- Found Torch: /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so 2025-05-07T19:50:32.9199066Z 2025-05-07T19:50:32.9199070Z 2025-05-07T19:50:32.9199190Z ================================================================================ 2025-05-07T19:50:32.9199526Z PyTorch Flags: 2025-05-07T19:50:32.9199756Z 2025-05-07T19:50:32.9199976Z TORCH_INCLUDE_DIRS: 2025-05-07T19:50:32.9200399Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:32.9201211Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:32.9201791Z 2025-05-07T19:50:32.9202011Z TORCH_LIBRARIES: 2025-05-07T19:50:32.9202236Z torch 2025-05-07T19:50:32.9202453Z torch_library 2025-05-07T19:50:32.9202910Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:32.9203586Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:32.9204529Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:32.9205058Z 2025-05-07T19:50:32.9205281Z TORCH_CUDA_OPTIONS: 2025-05-07T19:50:32.9205533Z --expt-relaxed-constexpr 2025-05-07T19:50:32.9205821Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:32.9206186Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:32.9206616Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:32.9206903Z ================================================================================ 2025-05-07T19:50:32.9207114Z 2025-05-07T19:50:32.9207146Z 2025-05-07T19:50:32.9207150Z 2025-05-07T19:50:32.9207260Z ================================================================================ 2025-05-07T19:50:32.9207562Z NCCL Flags 2025-05-07T19:50:32.9207670Z 2025-05-07T19:50:32.9208037Z NCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:32.9208869Z NCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:32.9209470Z ================================================================================ 2025-05-07T19:50:32.9209678Z 2025-05-07T19:50:32.9209682Z 2025-05-07T19:50:32.9209685Z 2025-05-07T19:50:32.9209789Z ================================================================================ 2025-05-07T19:50:32.9210097Z CUDA Driver Path 2025-05-07T19:50:32.9210226Z 2025-05-07T19:50:32.9210574Z CUDA_DRIVER_LIBRARIES=/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:32.9211111Z ================================================================================ 2025-05-07T19:50:32.9211320Z 2025-05-07T19:50:32.9211604Z -- Found NVML_LIB_PATH: /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:32.9230584Z 2025-05-07T19:50:32.9230903Z 2025-05-07T19:50:32.9231466Z ================================================================================ 2025-05-07T19:50:32.9232048Z GPU CPP Library Target: asmjit (SHARED) 2025-05-07T19:50:32.9232429Z 2025-05-07T19:50:32.9232688Z CPU_SRCS: 2025-05-07T19:50:32.9232850Z 2025-05-07T19:50:32.9232962Z 2025-05-07T19:50:32.9233200Z GPU_SRCS: 2025-05-07T19:50:32.9233329Z 2025-05-07T19:50:32.9233422Z 2025-05-07T19:50:32.9233747Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:32.9233902Z 2025-05-07T19:50:32.9234088Z 2025-05-07T19:50:32.9234321Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:32.9234478Z 2025-05-07T19:50:32.9234636Z 2025-05-07T19:50:32.9234854Z OTHER_SRCS: 2025-05-07T19:50:32.9235293Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64assembler.cpp 2025-05-07T19:50:32.9235943Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64builder.cpp 2025-05-07T19:50:32.9236615Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64compiler.cpp 2025-05-07T19:50:32.9237290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64emithelper.cpp 2025-05-07T19:50:32.9237942Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64formatter.cpp 2025-05-07T19:50:32.9238584Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64func.cpp 2025-05-07T19:50:32.9239183Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64instapi.cpp 2025-05-07T19:50:32.9239818Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64instdb.cpp 2025-05-07T19:50:32.9240458Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64operand.cpp 2025-05-07T19:50:32.9241069Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64rapass.cpp 2025-05-07T19:50:32.9241716Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/armformatter.cpp 2025-05-07T19:50:32.9242388Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/archtraits.cpp 2025-05-07T19:50:32.9243094Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/assembler.cpp 2025-05-07T19:50:32.9243759Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/builder.cpp 2025-05-07T19:50:32.9244718Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/codeholder.cpp 2025-05-07T19:50:32.9245447Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/codewriter.cpp 2025-05-07T19:50:32.9246127Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/compiler.cpp 2025-05-07T19:50:32.9246973Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/constpool.cpp 2025-05-07T19:50:32.9247626Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/cpuinfo.cpp 2025-05-07T19:50:32.9248319Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emithelper.cpp 2025-05-07T19:50:32.9248925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emitter.cpp 2025-05-07T19:50:32.9249572Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emitterutils.cpp 2025-05-07T19:50:32.9250234Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/environment.cpp 2025-05-07T19:50:32.9250869Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/errorhandler.cpp 2025-05-07T19:50:32.9251522Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/formatter.cpp 2025-05-07T19:50:32.9252117Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/func.cpp 2025-05-07T19:50:32.9252759Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/funcargscontext.cpp 2025-05-07T19:50:32.9253415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/globals.cpp 2025-05-07T19:50:32.9254000Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/inst.cpp 2025-05-07T19:50:32.9254616Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/instdb.cpp 2025-05-07T19:50:32.9255321Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/jitallocator.cpp 2025-05-07T19:50:32.9255955Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/jitruntime.cpp 2025-05-07T19:50:32.9256592Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/logger.cpp 2025-05-07T19:50:32.9257221Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/operand.cpp 2025-05-07T19:50:32.9257820Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/osutils.cpp 2025-05-07T19:50:32.9258449Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/ralocal.cpp 2025-05-07T19:50:32.9259048Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/rapass.cpp 2025-05-07T19:50:32.9259679Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/rastack.cpp 2025-05-07T19:50:32.9260265Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/string.cpp 2025-05-07T19:50:32.9260884Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/support.cpp 2025-05-07T19:50:32.9261501Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/target.cpp 2025-05-07T19:50:32.9262080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/type.cpp 2025-05-07T19:50:32.9262700Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/virtmem.cpp 2025-05-07T19:50:32.9263291Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zone.cpp 2025-05-07T19:50:32.9263914Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonehash.cpp 2025-05-07T19:50:32.9264523Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonelist.cpp 2025-05-07T19:50:32.9265172Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonestack.cpp 2025-05-07T19:50:32.9265811Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonetree.cpp 2025-05-07T19:50:32.9266432Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonevector.cpp 2025-05-07T19:50:32.9267095Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86assembler.cpp 2025-05-07T19:50:32.9267718Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86builder.cpp 2025-05-07T19:50:32.9268361Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86compiler.cpp 2025-05-07T19:50:32.9269104Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86emithelper.cpp 2025-05-07T19:50:32.9269741Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86formatter.cpp 2025-05-07T19:50:32.9270386Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86func.cpp 2025-05-07T19:50:32.9271064Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86instapi.cpp 2025-05-07T19:50:32.9271702Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86instdb.cpp 2025-05-07T19:50:32.9272313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86operand.cpp 2025-05-07T19:50:32.9272958Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86rapass.cpp 2025-05-07T19:50:32.9273447Z 2025-05-07T19:50:32.9273666Z CC_FLAGS: 2025-05-07T19:50:32.9273804Z 2025-05-07T19:50:32.9273939Z 2025-05-07T19:50:32.9274225Z NVCC_FLAGS: 2025-05-07T19:50:32.9274395Z 2025-05-07T19:50:32.9274494Z 2025-05-07T19:50:32.9274715Z HIPCC_FLAGS: 2025-05-07T19:50:32.9274894Z 2025-05-07T19:50:32.9274991Z 2025-05-07T19:50:32.9275215Z INCLUDE_DIRS: 2025-05-07T19:50:32.9275562Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:32.9275910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:32.9276259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:32.9276633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:32.9277151Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:32.9277988Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:32.9278654Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:32.9279114Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:32.9279563Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:32.9280085Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:32.9280652Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:32.9281134Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:32.9281739Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:32.9282270Z 2025-05-07T19:50:32.9282518Z Selected Source Files: 2025-05-07T19:50:32.9282686Z 2025-05-07T19:50:32.9282778Z 2025-05-07T19:50:32.9283026Z HIPified Source Files: 2025-05-07T19:50:32.9283192Z 2025-05-07T19:50:32.9283318Z 2025-05-07T19:50:32.9283545Z Library Dependencies: 2025-05-07T19:50:32.9283839Z torch 2025-05-07T19:50:32.9284055Z torch_library 2025-05-07T19:50:32.9284541Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:32.9285242Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:32.9285995Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:32.9286918Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:32.9287685Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:32.9288328Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:32.9288748Z 2025-05-07T19:50:32.9288990Z Output Library: 2025-05-07T19:50:32.9289220Z asmjit 2025-05-07T19:50:32.9289457Z 2025-05-07T19:50:32.9289678Z Destination Directory: 2025-05-07T19:50:32.9289964Z fbgemm_gpu 2025-05-07T19:50:32.9290216Z ================================================================================ 2025-05-07T19:50:32.9290478Z 2025-05-07T19:50:32.9290482Z 2025-05-07T19:50:32.9290488Z 2025-05-07T19:50:32.9290611Z ================================================================================ 2025-05-07T19:50:32.9290991Z GPU CPP Library Target: fbgemm (SHARED) 2025-05-07T19:50:32.9291302Z 2025-05-07T19:50:32.9291531Z CPU_SRCS: 2025-05-07T19:50:32.9291655Z 2025-05-07T19:50:32.9291748Z 2025-05-07T19:50:32.9292069Z GPU_SRCS: 2025-05-07T19:50:32.9292196Z 2025-05-07T19:50:32.9292291Z 2025-05-07T19:50:32.9292542Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:32.9292694Z 2025-05-07T19:50:32.9292785Z 2025-05-07T19:50:32.9293031Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:32.9293180Z 2025-05-07T19:50:32.9293372Z 2025-05-07T19:50:32.9293574Z OTHER_SRCS: 2025-05-07T19:50:32.9293901Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDM.cc 2025-05-07T19:50:32.9294363Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAutovec.cc 2025-05-07T19:50:32.9294879Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMNBit.cc 2025-05-07T19:50:32.9295307Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtils.cc 2025-05-07T19:50:32.9295762Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/RefImplementations.cc 2025-05-07T19:50:32.9296256Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/RowWiseSparseAdagradFused.cc 2025-05-07T19:50:32.9296753Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/SparseAdagrad.cc 2025-05-07T19:50:32.9297165Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/Utils.cc 2025-05-07T19:50:32.9297569Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:50:32.9298032Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtilsAvx2.cc 2025-05-07T19:50:32.9298466Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:50:32.9298925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtilsAvx2.cc 2025-05-07T19:50:32.9299365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx512.cc 2025-05-07T19:50:32.9299772Z 2025-05-07T19:50:32.9299978Z CC_FLAGS: 2025-05-07T19:50:32.9300134Z 2025-05-07T19:50:32.9300223Z 2025-05-07T19:50:32.9300457Z NVCC_FLAGS: 2025-05-07T19:50:32.9300585Z 2025-05-07T19:50:32.9300674Z 2025-05-07T19:50:32.9300915Z HIPCC_FLAGS: 2025-05-07T19:50:32.9301048Z 2025-05-07T19:50:32.9301139Z 2025-05-07T19:50:32.9301374Z INCLUDE_DIRS: 2025-05-07T19:50:32.9301633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:32.9301993Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:32.9302296Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:32.9302650Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:32.9303185Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:32.9303973Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:32.9304656Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:32.9305082Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:32.9305557Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:32.9306038Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:32.9306590Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:32.9307112Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:32.9307683Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:32.9308227Z 2025-05-07T19:50:32.9308452Z Selected Source Files: 2025-05-07T19:50:32.9308646Z 2025-05-07T19:50:32.9308736Z 2025-05-07T19:50:32.9308956Z HIPified Source Files: 2025-05-07T19:50:32.9309153Z 2025-05-07T19:50:32.9309241Z 2025-05-07T19:50:32.9309459Z Library Dependencies: 2025-05-07T19:50:32.9309744Z torch 2025-05-07T19:50:32.9309965Z torch_library 2025-05-07T19:50:32.9310445Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:32.9311169Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:32.9311875Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:32.9312707Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:32.9313458Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:32.9313973Z asmjit 2025-05-07T19:50:32.9314641Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:32.9315111Z 2025-05-07T19:50:32.9315428Z Output Library: 2025-05-07T19:50:32.9315666Z fbgemm 2025-05-07T19:50:32.9315923Z 2025-05-07T19:50:32.9316157Z Destination Directory: 2025-05-07T19:50:32.9316461Z fbgemm_gpu 2025-05-07T19:50:32.9316811Z ================================================================================ 2025-05-07T19:50:32.9317092Z 2025-05-07T19:50:32.9317096Z 2025-05-07T19:50:32.9317101Z 2025-05-07T19:50:32.9317232Z ================================================================================ 2025-05-07T19:50:32.9317623Z Running code generation script ... 2025-05-07T19:50:32.9318389Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_backward_split.py --opensource 2025-05-07T19:50:32.9319213Z ================================================================================ 2025-05-07T19:50:32.9319454Z 2025-05-07T19:50:33.4654544Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:33.4655486Z [GENERAATE BACKWARD SPLIT]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_backward_split.py', '--opensource'] 2025-05-07T19:50:33.4656243Z Written: gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:50:33.4656790Z Written: gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:50:33.4657391Z Written: gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:33.4657928Z Written: gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:33.4658474Z Written: gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:50:33.4658981Z Written: gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:50:33.4659508Z Written: gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:50:33.4660062Z Written: gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:33.4660613Z Written: gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:33.4661158Z Written: gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:50:33.4661691Z Written: gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:33.4662257Z Written: gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:50:33.4662808Z Written: gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:33.4663416Z Written: gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:33.4664005Z Written: gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:50:33.4664557Z Written: gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:33.4665133Z Written: gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:50:33.4665694Z Written: gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:33.4666415Z Written: gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:33.4667066Z Written: gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:50:33.4667577Z Written: gen_embedding_optimizer_dense_split_device_kernel.cuh 2025-05-07T19:50:33.4668023Z Written: gen_embedding_backward_split_dense.cpp 2025-05-07T19:50:33.4668398Z Written: gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:50:33.4668839Z Written: gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:33.4669319Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:33.4669835Z Written: gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:33.4670299Z Written: gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:33.4670814Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:33.4671341Z Written: gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:33.4671830Z Written: gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:33.4672598Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:33.4673142Z Written: gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:33.4673687Z Written: gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:33.4674609Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:33.4675233Z Written: gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:33.4675797Z Written: gen_embedding_optimizer_adagrad_split_device_kernel.cuh 2025-05-07T19:50:33.4676256Z Written: gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:50:33.4676706Z Written: gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:50:33.4677173Z Written: gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:33.4677630Z Written: lookup_adagrad.py 2025-05-07T19:50:33.4677965Z Written: gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:50:33.4678418Z Written: gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:50:33.4678882Z Written: gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:33.4679408Z Written: gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:50:33.4679915Z Written: gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:50:33.4680416Z Written: gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:33.4680960Z Written: gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:33.4681454Z Written: gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:50:33.4681973Z Written: gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:50:33.4682459Z Written: gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:50:33.4682995Z Written: gen_embedding_backward_adam_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:33.4683560Z Written: gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:33.4684080Z Written: gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:50:33.4684639Z Written: gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:33.4685152Z Written: gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:33.4685680Z Written: gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:33.4686227Z Written: gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:33.4686864Z Written: gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:33.4687383Z Written: gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:33.4687876Z Written: gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:33.4688402Z Written: gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:33.4688943Z Written: gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:33.4689476Z Written: gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:33.4690128Z Written: gen_embedding_optimizer_adam_split_device_kernel.cuh 2025-05-07T19:50:33.4690558Z Written: gen_embedding_backward_split_adam.cpp 2025-05-07T19:50:33.4690945Z Written: gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:50:33.4691378Z Written: gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:33.4691783Z Written: lookup_adam.py 2025-05-07T19:50:33.4692070Z Written: gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:50:33.4692508Z Written: gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:33.4692990Z Written: gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:50:33.4693526Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:33.4694023Z Written: gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:33.4694485Z Written: gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:50:33.4695178Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:33.4695660Z Written: gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:33.4696145Z Written: gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:33.4696653Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:33.4697266Z Written: gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:33.4697763Z Written: gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:33.4698270Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:33.4698808Z Written: gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:33.4699281Z Written: gen_embedding_optimizer_lamb_split_device_kernel.cuh 2025-05-07T19:50:33.4699695Z Written: gen_embedding_backward_split_lamb.cpp 2025-05-07T19:50:33.4700081Z Written: gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:50:33.4700567Z Written: gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:33.4701002Z Written: lookup_lamb.py 2025-05-07T19:50:33.4701314Z Written: gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:50:33.4701890Z Written: gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:33.4702358Z Written: gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:50:33.4702870Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:33.4703365Z Written: gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:33.4703863Z Written: gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:50:33.4704359Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:33.4704891Z Written: gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:33.4705410Z Written: gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:33.4705949Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:33.4706519Z Written: gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:33.4810199Z Written: gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:33.4811129Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:33.4811694Z Written: gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:33.4812364Z Written: gen_embedding_optimizer_lars_sgd_split_device_kernel.cuh 2025-05-07T19:50:33.4812790Z Written: gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:50:33.4813175Z Written: gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:50:33.4813614Z Written: gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:33.4814017Z Written: lookup_lars_sgd.py 2025-05-07T19:50:33.4814342Z Written: gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:50:33.4814794Z Written: gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:33.4815317Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:50:33.4815923Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:33.4816527Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:50:33.4817171Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:50:33.4817836Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:33.4818473Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:50:33.4819132Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:33.4819799Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:33.4820449Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:33.4821241Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:33.4821952Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:33.4822666Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:33.5549171Z Written: gen_embedding_optimizer_partial_rowwise_adam_split_device_kernel.cuh 2025-05-07T19:50:33.5549780Z Written: gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:50:33.5550267Z Written: gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:50:33.5550844Z Written: gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:33.5551311Z Written: lookup_partial_rowwise_adam.py 2025-05-07T19:50:33.5551739Z Written: gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:50:33.5552310Z Written: gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:33.5552973Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:50:33.5553599Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:33.5554339Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:33.5554958Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:50:33.5555585Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:33.5556219Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:33.5556879Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:33.5557572Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:33.5558234Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:33.5558911Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:33.5559588Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:33.5560291Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:33.5560945Z Written: gen_embedding_optimizer_partial_rowwise_lamb_split_device_kernel.cuh 2025-05-07T19:50:33.5561494Z Written: gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:50:33.5562025Z Written: gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:50:33.5562592Z Written: gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:33.5563111Z Written: lookup_partial_rowwise_lamb.py 2025-05-07T19:50:33.5563538Z Written: gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:50:33.5564128Z Written: gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:33.5564753Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:50:33.5565328Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:50:33.5565914Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:50:33.5566457Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:33.5567155Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:50:33.5567713Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:33.5568300Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:50:33.5568871Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:33.5569403Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:50:33.5569949Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:33.5570768Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:50:33.5571336Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:50:33.5571863Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_meta.cpp 2025-05-07T19:50:33.5572511Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:33.5573079Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_meta.cpp 2025-05-07T19:50:33.5573646Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:33.5574240Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:50:33.5574787Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:33.5575354Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_meta.cpp 2025-05-07T19:50:33.5575880Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:33.5576465Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:33.5577064Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:33.5577621Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:50:33.5578194Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:33.5578760Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:33.5579387Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:33.5580014Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:33.5580600Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:33.5581203Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:50:33.5581770Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:33.5582369Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:33.5582952Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:33.5583548Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:50:33.5584136Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:33.5584719Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:33.5585362Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:33.5585968Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:33.5586588Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:33.5587199Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:50:33.5587773Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:33.5588392Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:33.5588989Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:50:33.5589630Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:33.5590241Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:50:33.5590885Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:33.5591520Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:50:33.5592128Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:33.5592830Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:50:33.5593412Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:50:33.5593988Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:50:33.5594802Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:50:33.5595513Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:50:33.5596108Z Written: gen_embedding_optimizer_rowwise_adagrad_ssd_device_kernel.cuh 2025-05-07T19:50:33.5596672Z Written: gen_embedding_optimizer_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:50:33.5597211Z Written: gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:50:33.5597662Z Written: gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:33.5598211Z Written: gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:33.5598681Z Written: lookup_rowwise_adagrad_ssd.py 2025-05-07T19:50:33.5599105Z Written: gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:50:33.5599607Z Written: gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:33.5600140Z Written: gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:33.5600635Z Written: lookup_rowwise_adagrad.py 2025-05-07T19:50:33.5601033Z Written: gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:50:33.5601539Z Written: gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:33.5602062Z Written: gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:33.5602697Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:50:33.5603299Z Written: gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:50:33.5603814Z Written: gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:33.5604415Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:33.5605005Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:33.5605612Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:33.5606275Z Written: gen_embedding_optimizer_rowwise_adagrad_with_weight_decay_split_device_kernel.cuh 2025-05-07T19:50:33.5607051Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:33.5607637Z Written: gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:33.5608257Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:33.5608902Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:33.5609517Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:33.5610225Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_with_weight_decay_split_device_kernel.cuh 2025-05-07T19:50:33.5610899Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:33.5611499Z Written: gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:33.5612200Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:33.5612868Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:33.6600446Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:33.6601214Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:50:33.6601972Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:50:33.6602654Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:33.6603567Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:33.6604278Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:50:33.6604951Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:50:33.6605643Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:50:33.6606471Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:33.6607217Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:33.6607932Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:50:33.6608630Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:33.6609369Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:50:33.6610097Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:33.6610870Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:33.6611622Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:50:33.6612352Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:33.6613093Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:50:33.6613930Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:33.6614772Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:33.6615472Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:50:33.6616114Z Written: gen_embedding_optimizer_rowwise_adagrad_with_counter_split_device_kernel.cuh 2025-05-07T19:50:33.6616705Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:33.6617234Z Written: gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:33.6617845Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:33.6618349Z Written: lookup_rowwise_adagrad_with_counter.py 2025-05-07T19:50:33.6618821Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:33.6619429Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:33.6620072Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_with_counter_split_device_kernel.cuh 2025-05-07T19:50:33.6620712Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:33.6621289Z Written: gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:33.6621949Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:33.6622581Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:33.6623232Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:33.6623894Z Written: gen_embedding_optimizer_rowwise_weighted_adagrad_split_device_kernel.cuh 2025-05-07T19:50:33.6624435Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:50:33.6624956Z Written: gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:50:33.6625511Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:33.6626092Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:50:33.6626664Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:33.6627246Z Written: gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:50:33.6627712Z Written: gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:50:33.6628159Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:33.6629026Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:33.6629614Z Written: gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:33.6630123Z Written: gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:50:33.6630639Z Written: gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:50:33.6631141Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:33.6631688Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:33.6632184Z Written: gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:33.6632720Z Written: gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:33.6633237Z Written: gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:33.6633790Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:33.6634443Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:33.6634973Z Written: gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:33.6635536Z Written: gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:33.6636053Z Written: gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:33.6636631Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:33.6637223Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:33.6637767Z Written: gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:33.6638294Z Written: gen_embedding_optimizer_sgd_split_device_kernel.cuh 2025-05-07T19:50:33.6638723Z Written: gen_embedding_backward_split_sgd.cpp 2025-05-07T19:50:33.6639134Z Written: gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:50:33.6639603Z Written: gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:33.6639996Z Written: lookup_sgd.py 2025-05-07T19:50:33.6640341Z Written: gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:50:33.6640740Z Written: gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:50:33.6641213Z Written: gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:33.6641727Z Written: gen_embedding_optimizer_approx_sgd_split_device_kernel.cuh 2025-05-07T19:50:33.6642238Z Written: gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:50:33.6642672Z Written: gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:50:33.6643198Z Written: gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:33.6643724Z Written: gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:50:33.6644208Z Written: gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:33.6644749Z Written: gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:50:33.6645248Z Written: gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:33.6645785Z Written: gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:50:33.6646276Z Written: gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:50:33.6646906Z Written: gen_embedding_backward_none_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:33.6647419Z Written: gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:50:33.6647894Z Written: gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:50:33.6648436Z Written: gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:33.6648946Z Written: gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:50:33.6649462Z Written: gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:50:33.6649964Z Written: gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:33.6650604Z Written: gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:50:33.6651123Z Written: gen_embedding_optimizer_none_split_device_kernel.cuh 2025-05-07T19:50:33.6651512Z Written: gen_embedding_backward_split_none.cpp 2025-05-07T19:50:33.6651895Z Written: gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:50:33.6652377Z Written: gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:50:33.6652780Z Written: lookup_none.py 2025-05-07T19:50:33.6653075Z Written: gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:50:33.6653513Z Written: gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:50:33.6654021Z Written: gen_embedding_backward_split_weighted_device_kernel_hip.hip 2025-05-07T19:50:33.6654527Z Written: gen_embedding_backward_split_unweighted_nobag_device_kernel_hip.hip 2025-05-07T19:50:33.6655072Z Written: gen_embedding_backward_split_unweighted_device_kernel_hip.hip 2025-05-07T19:50:33.6655563Z Written: gen_embedding_backward_ssd_weighted_vbe_device_kernel.cuh 2025-05-07T19:50:33.6656065Z Written: gen_embedding_backward_split_weighted_vbe_device_kernel.cuh 2025-05-07T19:50:33.6656530Z Written: gen_embedding_backward_ssd_weighted_device_kernel.cuh 2025-05-07T19:50:33.6657003Z Written: gen_embedding_backward_split_weighted_device_kernel.cuh 2025-05-07T19:50:33.6657506Z Written: gen_embedding_backward_ssd_unweighted_nobag_device_kernel.cuh 2025-05-07T19:50:33.6658014Z Written: gen_embedding_backward_split_unweighted_nobag_device_kernel.cuh 2025-05-07T19:50:33.6658534Z Written: gen_embedding_backward_ssd_unweighted_vbe_device_kernel.cuh 2025-05-07T19:50:33.6659020Z Written: gen_embedding_backward_split_unweighted_vbe_device_kernel.cuh 2025-05-07T19:50:33.6659515Z Written: gen_embedding_backward_ssd_unweighted_device_kernel.cuh 2025-05-07T19:50:33.6659976Z Written: gen_embedding_backward_split_unweighted_device_kernel.cuh 2025-05-07T19:50:33.6660454Z Written: gen_embedding_backward_split_common_device_kernel.cuh 2025-05-07T19:50:33.6660911Z Written: gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:50:33.6661371Z Written: gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:50:33.6661859Z Written: gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:50:33.6662339Z Written: gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:50:33.6662758Z Written: pt2_arg_utils.h 2025-05-07T19:50:33.6662999Z Written: __init__.py 2025-05-07T19:50:33.6663259Z Written: lookup_args_ssd.py 2025-05-07T19:50:33.6663532Z Written: lookup_args.py 2025-05-07T19:50:33.6725597Z 2025-05-07T19:50:33.6725652Z 2025-05-07T19:50:33.6725899Z ================================================================================ 2025-05-07T19:50:33.6726445Z Running code generation script ... 2025-05-07T19:50:33.6727451Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_embedding_optimizer.py --opensource 2025-05-07T19:50:33.6728781Z ================================================================================ 2025-05-07T19:50:33.6729141Z 2025-05-07T19:50:33.7746937Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:33.7747818Z [GENERATE OPTIMIZERS]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_embedding_optimizer.py', '--opensource'] 2025-05-07T19:50:33.7748616Z Written: gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:50:33.7749111Z Written: gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:50:33.7749627Z Written: gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:50:33.7750165Z Written: gen_embedding_optimizer_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:50:33.7750662Z Written: split_embedding_optimizer_rowwise_adagrad.py 2025-05-07T19:50:33.7751055Z Written: optimizer_args.py 2025-05-07T19:50:33.7835136Z 2025-05-07T19:50:33.7835493Z 2025-05-07T19:50:33.7836463Z ================================================================================ 2025-05-07T19:50:33.7836942Z Running code generation script ... 2025-05-07T19:50:33.7837754Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_quantized.py --opensource 2025-05-07T19:50:33.7838674Z ================================================================================ 2025-05-07T19:50:33.7839062Z 2025-05-07T19:50:33.8985157Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:33.8986248Z [GENERATE FORWARD QUANTIZED]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_quantized.py', '--opensource'] 2025-05-07T19:50:33.8987099Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:50:33.8987845Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:50:33.8988484Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:50:33.8989116Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:50:33.8989749Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:50:33.8990383Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:50:33.8991038Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:50:33.8991731Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:50:33.8992403Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:50:33.8993089Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:50:33.8993758Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:50:33.8994781Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:50:33.8995516Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:50:33.8996208Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:50:33.8996920Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:50:33.8997596Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:50:33.8998299Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:50:33.8999019Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:50:33.8999677Z Written: gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:50:33.9000361Z Written: gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:50:33.9001126Z Written: gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:50:33.9001693Z Written: gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:50:33.9002205Z Written: gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:50:33.9075558Z 2025-05-07T19:50:33.9076225Z 2025-05-07T19:50:33.9076491Z ================================================================================ 2025-05-07T19:50:33.9076901Z Running code generation script ... 2025-05-07T19:50:33.9077698Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_split.py --opensource 2025-05-07T19:50:33.9078620Z ================================================================================ 2025-05-07T19:50:33.9078846Z 2025-05-07T19:50:34.2444280Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:34.2447279Z [GENERATE FORWARD SPLIT]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_split.py', '--opensource'] 2025-05-07T19:50:34.2448014Z Written: gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:34.2448541Z Written: gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:50:34.2450207Z Written: gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:34.2450725Z Written: gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:50:34.2451192Z Written: gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:34.2451692Z Written: gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:34.2452179Z Written: gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:50:34.2452619Z Written: gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:50:34.2453109Z Written: gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:34.2453591Z Written: gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:34.2454086Z Written: gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:50:34.2454529Z Written: gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:50:34.2455023Z Written: gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:34.2455527Z Written: gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:50:34.2456000Z Written: gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:34.2456511Z Written: gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:50:34.2456977Z Written: gen_embedding_forward_dense_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:34.2457449Z Written: gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:50:34.2457908Z Written: gen_embedding_forward_dense_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:34.2458389Z Written: gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:50:34.2458851Z Written: gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:34.2459300Z Written: gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:34.2459764Z Written: gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:50:34.2460193Z Written: gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:50:34.2460661Z Written: gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:34.2461133Z Written: gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:34.2461610Z Written: gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:50:34.2462069Z Written: gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:50:34.2462502Z Written: gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:50:34.2462931Z Written: gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:50:34.2463344Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:50:34.2463806Z Written: gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:50:34.2464219Z Written: gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:50:34.2464635Z Written: gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:50:34.2465059Z Written: gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:50:34.2465456Z Written: gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:50:34.2465848Z Written: gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:50:34.2466250Z Written: gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:50:34.2466708Z Written: gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:50:34.2467133Z Written: gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:50:34.2467569Z Written: gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:50:34.2467979Z Written: gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:50:34.2468386Z Written: gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:50:34.2468924Z Written: gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:50:34.2469361Z Written: gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:50:34.2469817Z Written: gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:50:34.2470259Z Written: gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:50:34.2470751Z Written: gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:50:34.2471159Z Written: gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:50:34.2471629Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:34.2472135Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:34.2472611Z Written: gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:34.2473110Z Written: gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:34.2473562Z Written: gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.2473984Z Written: gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.2474691Z Written: gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.2557502Z 2025-05-07T19:50:34.2557986Z 2025-05-07T19:50:34.2558574Z ================================================================================ 2025-05-07T19:50:34.2559683Z Running code generation script ... 2025-05-07T19:50:34.2561857Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_index_select.py --opensource 2025-05-07T19:50:34.2562904Z ================================================================================ 2025-05-07T19:50:34.2563135Z 2025-05-07T19:50:34.5154419Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:34.5155511Z [INDEX SELECT GENERATOR]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_index_select.py', '--opensource'] 2025-05-07T19:50:34.5156242Z Written: gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:50:34.5156671Z Written: gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:50:34.5157111Z Written: gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:50:34.5157576Z Written: gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:50:34.5158029Z Written: gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:50:34.5158602Z Written: gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:50:34.5159067Z Written: gen_embedding_backward_split_batch_index_select_device_kernel.cuh 2025-05-07T19:50:34.5159549Z Written: gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:50:34.5159968Z Written: gen_embedding_backward_split_common_device_kernel.cuh 2025-05-07T19:50:34.5255136Z -- Adding merge_pooled_embeddings sources 2025-05-07T19:50:34.5270698Z 2025-05-07T19:50:34.5270797Z 2025-05-07T19:50:34.5271314Z ================================================================================ 2025-05-07T19:50:34.5272533Z GPU CPP Library Target: fbgemm_gpu_tbe_cache (SHARED) 2025-05-07T19:50:34.5273512Z 2025-05-07T19:50:34.5274331Z CPU_SRCS: 2025-05-07T19:50:34.5274905Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:50:34.5275588Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:50:34.5276250Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:50:34.5276864Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:50:34.5277487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:50:34.5277958Z 2025-05-07T19:50:34.5278159Z GPU_SRCS: 2025-05-07T19:50:34.5278605Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu 2025-05-07T19:50:34.5279156Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu 2025-05-07T19:50:34.5279986Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu 2025-05-07T19:50:34.5280596Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu 2025-05-07T19:50:34.5281189Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu 2025-05-07T19:50:34.5281756Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu 2025-05-07T19:50:34.5282409Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu 2025-05-07T19:50:34.5282976Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu 2025-05-07T19:50:34.5283515Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu 2025-05-07T19:50:34.5284147Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu 2025-05-07T19:50:34.5284591Z 2025-05-07T19:50:34.5284810Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:34.5284947Z 2025-05-07T19:50:34.5285050Z 2025-05-07T19:50:34.5285236Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:34.5285374Z 2025-05-07T19:50:34.5285478Z 2025-05-07T19:50:34.5285662Z OTHER_SRCS: 2025-05-07T19:50:34.5285783Z 2025-05-07T19:50:34.5285885Z 2025-05-07T19:50:34.5286063Z CC_FLAGS: 2025-05-07T19:50:34.5286193Z 2025-05-07T19:50:34.5286271Z 2025-05-07T19:50:34.5286452Z NVCC_FLAGS: 2025-05-07T19:50:34.5286689Z --expt-relaxed-constexpr 2025-05-07T19:50:34.5286953Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:34.5287249Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:34.5287553Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:34.5287800Z 2025-05-07T19:50:34.5288011Z HIPCC_FLAGS: 2025-05-07T19:50:34.5288131Z 2025-05-07T19:50:34.5288214Z 2025-05-07T19:50:34.5288416Z INCLUDE_DIRS: 2025-05-07T19:50:34.5288636Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.5288946Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:34.5289207Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:34.5289513Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.5289976Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:34.5290719Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:34.5291335Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:34.5291714Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:34.5292131Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:34.5292568Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:34.5293066Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:34.5293493Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:34.5294027Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:34.5294521Z 2025-05-07T19:50:34.5294711Z Selected Source Files: 2025-05-07T19:50:34.5295133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:50:34.5295742Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:50:34.5296360Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:50:34.5296906Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:50:34.5297489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:50:34.5298083Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu 2025-05-07T19:50:34.5298625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu 2025-05-07T19:50:34.5299221Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu 2025-05-07T19:50:34.5299812Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu 2025-05-07T19:50:34.5300394Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu 2025-05-07T19:50:34.5301012Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu 2025-05-07T19:50:34.5301618Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu 2025-05-07T19:50:34.5302190Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu 2025-05-07T19:50:34.5302783Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu 2025-05-07T19:50:34.5303397Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu 2025-05-07T19:50:34.5303834Z 2025-05-07T19:50:34.5304040Z HIPified Source Files: 2025-05-07T19:50:34.5304185Z 2025-05-07T19:50:34.5304261Z 2025-05-07T19:50:34.5304467Z Library Dependencies: 2025-05-07T19:50:34.5304703Z torch 2025-05-07T19:50:34.5304885Z torch_library 2025-05-07T19:50:34.5305308Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:34.5305939Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:34.5306601Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:34.5307337Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:34.5308038Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:34.5308617Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:34.5308988Z 2025-05-07T19:50:34.5309190Z Output Library: 2025-05-07T19:50:34.5309405Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:34.5309633Z 2025-05-07T19:50:34.5309824Z Destination Directory: 2025-05-07T19:50:34.5310067Z fbgemm_gpu 2025-05-07T19:50:34.5310280Z ================================================================================ 2025-05-07T19:50:34.5310511Z 2025-05-07T19:50:34.5753160Z 2025-05-07T19:50:34.5753617Z 2025-05-07T19:50:34.5754527Z ================================================================================ 2025-05-07T19:50:34.5755196Z GPU CPP Library Target: fbgemm_gpu_tbe_inference (SHARED) 2025-05-07T19:50:34.5755553Z 2025-05-07T19:50:34.5755768Z CPU_SRCS: 2025-05-07T19:50:34.5756136Z codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:50:34.5756595Z gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:50:34.5757063Z gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:50:34.5757405Z 2025-05-07T19:50:34.5757614Z GPU_SRCS: 2025-05-07T19:50:34.5757897Z codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:50:34.5758382Z codegen/inference/embedding_forward_quantized_split_lookup.cu 2025-05-07T19:50:34.5759049Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:50:34.5759621Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:50:34.5760206Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:50:34.5760765Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:50:34.5761344Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:50:34.5761898Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:50:34.5762502Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:50:34.5763133Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:50:34.5763739Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:50:34.5764360Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:50:34.5764967Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:50:34.5765591Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:50:34.5766401Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:50:34.5767000Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:50:34.5767604Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:50:34.5768278Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:50:34.5768878Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:50:34.5769455Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:50:34.5770030Z gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:50:34.5770605Z gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:50:34.5771157Z gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:50:34.5771577Z 2025-05-07T19:50:34.5771769Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:34.5771936Z 2025-05-07T19:50:34.5772016Z 2025-05-07T19:50:34.5772212Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:34.5772373Z 2025-05-07T19:50:34.5772456Z 2025-05-07T19:50:34.5772649Z OTHER_SRCS: 2025-05-07T19:50:34.5772788Z 2025-05-07T19:50:34.5772873Z 2025-05-07T19:50:34.5773079Z CC_FLAGS: 2025-05-07T19:50:34.5773200Z 2025-05-07T19:50:34.5773282Z 2025-05-07T19:50:34.5773482Z NVCC_FLAGS: 2025-05-07T19:50:34.5773704Z --expt-relaxed-constexpr 2025-05-07T19:50:34.5773992Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:34.5774269Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:34.5774581Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:34.5774832Z 2025-05-07T19:50:34.5775040Z HIPCC_FLAGS: 2025-05-07T19:50:34.5775162Z 2025-05-07T19:50:34.5775239Z 2025-05-07T19:50:34.5775442Z INCLUDE_DIRS: 2025-05-07T19:50:34.5775689Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.5775985Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:34.5776264Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:34.5776555Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.5777030Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:34.5777761Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:34.5778375Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:34.5778757Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:34.5779171Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:34.5779625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:34.5780106Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:34.5780545Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:34.5781067Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:34.5781562Z 2025-05-07T19:50:34.5781758Z Selected Source Files: 2025-05-07T19:50:34.5782088Z codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:50:34.5782532Z gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:50:34.5782944Z gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:50:34.5783364Z codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:50:34.5783795Z codegen/inference/embedding_forward_quantized_split_lookup.cu 2025-05-07T19:50:34.5784320Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:50:34.5784882Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:50:34.5785459Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:50:34.5786042Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:50:34.5786604Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:50:34.5787288Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:50:34.5787883Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:50:34.5788530Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:50:34.5789207Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:50:34.5789836Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:50:34.5790464Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:50:34.5791071Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:50:34.5791679Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:50:34.5792253Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:50:34.5792850Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:50:34.5793435Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:50:34.5794109Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:50:34.5794922Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:50:34.5795513Z gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:50:34.5796123Z gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:50:34.5796715Z gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:50:34.5797160Z 2025-05-07T19:50:34.5797387Z HIPified Source Files: 2025-05-07T19:50:34.5797549Z 2025-05-07T19:50:34.5797633Z 2025-05-07T19:50:34.5797855Z Library Dependencies: 2025-05-07T19:50:34.5798088Z torch 2025-05-07T19:50:34.5798303Z torch_library 2025-05-07T19:50:34.5798745Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:34.5799438Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:34.5800132Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:34.5801033Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:34.5801735Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:34.5802165Z asmjit 2025-05-07T19:50:34.5802362Z fbgemm 2025-05-07T19:50:34.5802546Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:34.5802787Z fbgemm_gpu_config 2025-05-07T19:50:34.5803110Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:34.5803503Z 2025-05-07T19:50:34.5803683Z Output Library: 2025-05-07T19:50:34.5803917Z fbgemm_gpu_tbe_inference 2025-05-07T19:50:34.5804139Z 2025-05-07T19:50:34.5804358Z Destination Directory: 2025-05-07T19:50:34.5804601Z fbgemm_gpu 2025-05-07T19:50:34.5804818Z ================================================================================ 2025-05-07T19:50:34.5805038Z 2025-05-07T19:50:34.7997192Z 2025-05-07T19:50:34.7997272Z 2025-05-07T19:50:34.7997766Z ================================================================================ 2025-05-07T19:50:34.7999005Z GPU CPP Library Target: fbgemm_gpu_config (SHARED) 2025-05-07T19:50:34.7999951Z 2025-05-07T19:50:34.8000490Z CPU_SRCS: 2025-05-07T19:50:34.8001107Z src/config/feature_gates.cpp 2025-05-07T19:50:34.8001812Z 2025-05-07T19:50:34.8002322Z GPU_SRCS: 2025-05-07T19:50:34.8002629Z 2025-05-07T19:50:34.8002834Z 2025-05-07T19:50:34.8003382Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:34.8003781Z 2025-05-07T19:50:34.8003987Z 2025-05-07T19:50:34.8004422Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:34.8004558Z 2025-05-07T19:50:34.8004640Z 2025-05-07T19:50:34.8004845Z OTHER_SRCS: 2025-05-07T19:50:34.8004959Z 2025-05-07T19:50:34.8005248Z 2025-05-07T19:50:34.8005464Z CC_FLAGS: 2025-05-07T19:50:34.8005578Z 2025-05-07T19:50:34.8005678Z 2025-05-07T19:50:34.8005861Z NVCC_FLAGS: 2025-05-07T19:50:34.8005974Z 2025-05-07T19:50:34.8006078Z 2025-05-07T19:50:34.8006434Z HIPCC_FLAGS: 2025-05-07T19:50:34.8006656Z 2025-05-07T19:50:34.8006759Z 2025-05-07T19:50:34.8006952Z INCLUDE_DIRS: 2025-05-07T19:50:34.8007207Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8007695Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:34.8008012Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:34.8008333Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8008857Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:34.8009676Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:34.8010317Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:34.8010756Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:34.8011182Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:34.8011673Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:34.8012189Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:34.8012663Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:34.8013241Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:34.8013742Z 2025-05-07T19:50:34.8013953Z Selected Source Files: 2025-05-07T19:50:34.8014207Z src/config/feature_gates.cpp 2025-05-07T19:50:34.8014479Z 2025-05-07T19:50:34.8014676Z HIPified Source Files: 2025-05-07T19:50:34.8014851Z 2025-05-07T19:50:34.8014936Z 2025-05-07T19:50:34.8015137Z Library Dependencies: 2025-05-07T19:50:34.8015390Z torch 2025-05-07T19:50:34.8015702Z torch_library 2025-05-07T19:50:34.8016146Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:34.8016817Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:34.8017708Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:34.8018528Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:34.8019287Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:34.8019903Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:34.8020331Z 2025-05-07T19:50:34.8020530Z Output Library: 2025-05-07T19:50:34.8020771Z fbgemm_gpu_config 2025-05-07T19:50:34.8020987Z 2025-05-07T19:50:34.8021208Z Destination Directory: 2025-05-07T19:50:34.8021453Z fbgemm_gpu 2025-05-07T19:50:34.8021706Z ================================================================================ 2025-05-07T19:50:34.8021940Z 2025-05-07T19:50:34.8022025Z 2025-05-07T19:50:34.8022033Z 2025-05-07T19:50:34.8022154Z ================================================================================ 2025-05-07T19:50:34.8022557Z GPU CPP Library Target: fbgemm_gpu_tbe_utils (SHARED) 2025-05-07T19:50:34.8022894Z 2025-05-07T19:50:34.8023112Z CPU_SRCS: 2025-05-07T19:50:34.8023412Z src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:50:34.8023883Z src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:50:34.8024244Z 2025-05-07T19:50:34.8024456Z GPU_SRCS: 2025-05-07T19:50:34.8024723Z src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:50:34.8025151Z src/split_embeddings_utils/generate_vbe_metadata.cu 2025-05-07T19:50:34.8025540Z src/split_embeddings_utils/get_infos_metadata.cu 2025-05-07T19:50:34.8025931Z src/split_embeddings_utils/radix_sort_pairs.cu 2025-05-07T19:50:34.8026343Z src/split_embeddings_utils/transpose_embedding_input.cu 2025-05-07T19:50:34.8026690Z 2025-05-07T19:50:34.8026907Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:34.8027130Z 2025-05-07T19:50:34.8027215Z 2025-05-07T19:50:34.8027439Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:34.8027576Z 2025-05-07T19:50:34.8027658Z 2025-05-07T19:50:34.8027868Z OTHER_SRCS: 2025-05-07T19:50:34.8027988Z 2025-05-07T19:50:34.8028068Z 2025-05-07T19:50:34.8028273Z CC_FLAGS: 2025-05-07T19:50:34.8028661Z 2025-05-07T19:50:34.8028765Z 2025-05-07T19:50:34.8028953Z NVCC_FLAGS: 2025-05-07T19:50:34.8029072Z 2025-05-07T19:50:34.8029173Z 2025-05-07T19:50:34.8029358Z HIPCC_FLAGS: 2025-05-07T19:50:34.8029482Z 2025-05-07T19:50:34.8029583Z 2025-05-07T19:50:34.8029766Z INCLUDE_DIRS: 2025-05-07T19:50:34.8030024Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8030338Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:34.8030646Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:34.8030955Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8031470Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:34.8032295Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:34.8032946Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:34.8033384Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:34.8033810Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:34.8034394Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:34.8034908Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:34.8035389Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:34.8035965Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:34.8036469Z 2025-05-07T19:50:34.8036697Z Selected Source Files: 2025-05-07T19:50:34.8037022Z src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:50:34.8037491Z src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:50:34.8037931Z src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:50:34.8038351Z src/split_embeddings_utils/generate_vbe_metadata.cu 2025-05-07T19:50:34.8038760Z src/split_embeddings_utils/get_infos_metadata.cu 2025-05-07T19:50:34.8039136Z src/split_embeddings_utils/radix_sort_pairs.cu 2025-05-07T19:50:34.8039554Z src/split_embeddings_utils/transpose_embedding_input.cu 2025-05-07T19:50:34.8039903Z 2025-05-07T19:50:34.8040136Z HIPified Source Files: 2025-05-07T19:50:34.8040288Z 2025-05-07T19:50:34.8040369Z 2025-05-07T19:50:34.8040707Z Library Dependencies: 2025-05-07T19:50:34.8040945Z torch 2025-05-07T19:50:34.8041173Z torch_library 2025-05-07T19:50:34.8041717Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:34.8042563Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:34.8043281Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:34.8044071Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:34.8044814Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:34.8045401Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:34.8045822Z 2025-05-07T19:50:34.8046017Z Output Library: 2025-05-07T19:50:34.8046273Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:34.8046527Z 2025-05-07T19:50:34.8046737Z Destination Directory: 2025-05-07T19:50:34.8047006Z fbgemm_gpu 2025-05-07T19:50:34.8047244Z ================================================================================ 2025-05-07T19:50:34.8047472Z 2025-05-07T19:50:34.8047476Z 2025-05-07T19:50:34.8047499Z 2025-05-07T19:50:34.8047617Z ================================================================================ 2025-05-07T19:50:34.8048021Z GPU CPP Library Target: fbgemm_gpu_sparse_async_cumsum (SHARED) 2025-05-07T19:50:34.8048404Z 2025-05-07T19:50:34.8048719Z CPU_SRCS: 2025-05-07T19:50:34.8048964Z src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:50:34.8049261Z 2025-05-07T19:50:34.8049444Z GPU_SRCS: 2025-05-07T19:50:34.8049677Z src/sparse_ops/sparse_async_cumsum.cu 2025-05-07T19:50:34.8049946Z 2025-05-07T19:50:34.8050156Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:34.8050394Z 2025-05-07T19:50:34.8050472Z 2025-05-07T19:50:34.8050684Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:34.8050816Z 2025-05-07T19:50:34.8050894Z 2025-05-07T19:50:34.8051102Z OTHER_SRCS: 2025-05-07T19:50:34.8051219Z 2025-05-07T19:50:34.8051302Z 2025-05-07T19:50:34.8051502Z CC_FLAGS: 2025-05-07T19:50:34.8051621Z 2025-05-07T19:50:34.8051716Z 2025-05-07T19:50:34.8051896Z NVCC_FLAGS: 2025-05-07T19:50:34.8052013Z 2025-05-07T19:50:34.8052109Z 2025-05-07T19:50:34.8052295Z HIPCC_FLAGS: 2025-05-07T19:50:34.8052416Z 2025-05-07T19:50:34.8052510Z 2025-05-07T19:50:34.8052688Z INCLUDE_DIRS: 2025-05-07T19:50:34.8052936Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8053244Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:34.8053541Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:34.8053845Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8054347Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:34.8055133Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:34.8055766Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:34.8056186Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:34.8056601Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:34.8057082Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:34.8057582Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:34.8058046Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:34.8058611Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:34.8059099Z 2025-05-07T19:50:34.8059314Z Selected Source Files: 2025-05-07T19:50:34.8059576Z src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:50:34.8060019Z src/sparse_ops/sparse_async_cumsum.cu 2025-05-07T19:50:34.8060285Z 2025-05-07T19:50:34.8060495Z HIPified Source Files: 2025-05-07T19:50:34.8060639Z 2025-05-07T19:50:34.8060716Z 2025-05-07T19:50:34.8060922Z Library Dependencies: 2025-05-07T19:50:34.8061159Z torch 2025-05-07T19:50:34.8061347Z torch_library 2025-05-07T19:50:34.8061768Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:34.8062391Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:34.8063054Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:34.8063785Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:34.8064482Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:34.8064950Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:34.8065287Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:34.8065677Z 2025-05-07T19:50:34.8065860Z Output Library: 2025-05-07T19:50:34.8066270Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:34.8066521Z 2025-05-07T19:50:34.8066781Z Destination Directory: 2025-05-07T19:50:34.8067026Z fbgemm_gpu 2025-05-07T19:50:34.8067247Z ================================================================================ 2025-05-07T19:50:34.8067493Z 2025-05-07T19:50:34.8067497Z 2025-05-07T19:50:34.8067501Z 2025-05-07T19:50:34.8067611Z ================================================================================ 2025-05-07T19:50:34.8067993Z GPU CPP Library Target: fbgemm_gpu_tbe_common (SHARED) 2025-05-07T19:50:34.8068319Z 2025-05-07T19:50:34.8068529Z CPU_SRCS: 2025-05-07T19:50:34.8068860Z codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:50:34.8069299Z codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:50:34.8069701Z codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:50:34.8070016Z 2025-05-07T19:50:34.8070198Z GPU_SRCS: 2025-05-07T19:50:34.8070444Z codegen/utils/embedding_bounds_check_v1.cu 2025-05-07T19:50:34.8070855Z codegen/utils/embedding_bounds_check_v2.cu 2025-05-07T19:50:34.8071189Z codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:50:34.8071510Z 2025-05-07T19:50:34.8071706Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:34.8071849Z 2025-05-07T19:50:34.8071962Z 2025-05-07T19:50:34.8072151Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:34.8072314Z 2025-05-07T19:50:34.8072392Z 2025-05-07T19:50:34.8072576Z OTHER_SRCS: 2025-05-07T19:50:34.8072711Z 2025-05-07T19:50:34.8072792Z 2025-05-07T19:50:34.8072978Z CC_FLAGS: 2025-05-07T19:50:34.8073109Z 2025-05-07T19:50:34.8073189Z 2025-05-07T19:50:34.8073393Z NVCC_FLAGS: 2025-05-07T19:50:34.8073614Z --expt-relaxed-constexpr 2025-05-07T19:50:34.8073910Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:34.8074290Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:34.8074796Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:34.8075056Z 2025-05-07T19:50:34.8075332Z HIPCC_FLAGS: 2025-05-07T19:50:34.8075460Z 2025-05-07T19:50:34.8075552Z 2025-05-07T19:50:34.8075768Z INCLUDE_DIRS: 2025-05-07T19:50:34.8076013Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8076354Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:34.8076671Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:34.8076986Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8077507Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:34.8078294Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:34.8078968Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:34.8079391Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:34.8079849Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:34.8080338Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:34.8080852Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:34.8081334Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:34.8081887Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:34.8082403Z 2025-05-07T19:50:34.8082602Z Selected Source Files: 2025-05-07T19:50:34.8082909Z codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:50:34.8083327Z codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:50:34.8083741Z codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:50:34.8084106Z codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:50:34.8084451Z codegen/utils/embedding_bounds_check_v1.cu 2025-05-07T19:50:34.8084810Z codegen/utils/embedding_bounds_check_v2.cu 2025-05-07T19:50:34.8085099Z 2025-05-07T19:50:34.8085311Z HIPified Source Files: 2025-05-07T19:50:34.8085466Z 2025-05-07T19:50:34.8085547Z 2025-05-07T19:50:34.8085764Z Library Dependencies: 2025-05-07T19:50:34.8085994Z torch 2025-05-07T19:50:34.8086217Z torch_library 2025-05-07T19:50:34.8086651Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:34.8087481Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:34.8088192Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:34.8088931Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:34.8089635Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:34.8090066Z fbgemm 2025-05-07T19:50:34.8090274Z fbgemm_gpu_config 2025-05-07T19:50:34.8090708Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:34.8091298Z 2025-05-07T19:50:34.8091546Z Output Library: 2025-05-07T19:50:34.8091777Z fbgemm_gpu_tbe_common 2025-05-07T19:50:34.8092024Z 2025-05-07T19:50:34.8092221Z Destination Directory: 2025-05-07T19:50:34.8092482Z fbgemm_gpu 2025-05-07T19:50:34.8092780Z ================================================================================ 2025-05-07T19:50:34.8093035Z 2025-05-07T19:50:34.8093039Z 2025-05-07T19:50:34.8093042Z 2025-05-07T19:50:34.8093153Z ================================================================================ 2025-05-07T19:50:34.8093564Z GPU CPP Library Target: fbgemm_gpu_tbe_optimizers (SHARED) 2025-05-07T19:50:34.8093912Z 2025-05-07T19:50:34.8094126Z CPU_SRCS: 2025-05-07T19:50:34.8094245Z 2025-05-07T19:50:34.8094329Z 2025-05-07T19:50:34.8094536Z GPU_SRCS: 2025-05-07T19:50:34.8094783Z gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:50:34.8095197Z gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:50:34.8095608Z gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:50:34.8095962Z 2025-05-07T19:50:34.8096162Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:34.8096322Z 2025-05-07T19:50:34.8096403Z 2025-05-07T19:50:34.8096627Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:34.8096764Z 2025-05-07T19:50:34.8096846Z 2025-05-07T19:50:34.8097054Z OTHER_SRCS: 2025-05-07T19:50:34.8097170Z 2025-05-07T19:50:34.8097247Z 2025-05-07T19:50:34.8097457Z CC_FLAGS: 2025-05-07T19:50:34.8097569Z 2025-05-07T19:50:34.8097646Z 2025-05-07T19:50:34.8097851Z NVCC_FLAGS: 2025-05-07T19:50:34.8098064Z --expt-relaxed-constexpr 2025-05-07T19:50:34.8098350Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:34.8098623Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:34.8098938Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:34.8099202Z 2025-05-07T19:50:34.8099385Z HIPCC_FLAGS: 2025-05-07T19:50:34.8099511Z 2025-05-07T19:50:34.8099612Z 2025-05-07T19:50:34.8099794Z INCLUDE_DIRS: 2025-05-07T19:50:34.8100050Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8100371Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:34.8100680Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:34.8100979Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8101475Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:34.8102242Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:34.8126743Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:34.8127286Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:34.8127728Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:34.8128211Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:34.8129137Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:34.8129618Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:34.8130200Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:34.8130734Z 2025-05-07T19:50:34.8130936Z Selected Source Files: 2025-05-07T19:50:34.8131253Z gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:50:34.8131673Z gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:50:34.8132094Z gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:50:34.8132459Z 2025-05-07T19:50:34.8132663Z HIPified Source Files: 2025-05-07T19:50:34.8132819Z 2025-05-07T19:50:34.8132925Z 2025-05-07T19:50:34.8133134Z Library Dependencies: 2025-05-07T19:50:34.8133390Z torch 2025-05-07T19:50:34.8133582Z torch_library 2025-05-07T19:50:34.8134038Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:34.8134716Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:34.8135616Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:34.8136439Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:34.8137181Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:34.8137910Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:34.8138322Z 2025-05-07T19:50:34.8138543Z Output Library: 2025-05-07T19:50:34.8138786Z fbgemm_gpu_tbe_optimizers 2025-05-07T19:50:34.8139210Z 2025-05-07T19:50:34.8139431Z Destination Directory: 2025-05-07T19:50:34.8139669Z fbgemm_gpu 2025-05-07T19:50:34.8139922Z ================================================================================ 2025-05-07T19:50:34.8140155Z 2025-05-07T19:50:34.8140160Z 2025-05-07T19:50:34.8140165Z 2025-05-07T19:50:34.8140281Z ================================================================================ 2025-05-07T19:50:34.8140720Z GPU CPP Library Target: fbgemm_gpu_tbe_training_forward (SHARED) 2025-05-07T19:50:34.8141234Z 2025-05-07T19:50:34.8141447Z CPU_SRCS: 2025-05-07T19:50:34.8141712Z gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8142026Z 2025-05-07T19:50:34.8142229Z GPU_SRCS: 2025-05-07T19:50:34.8142470Z gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:50:34.8142968Z gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:50:34.8143317Z gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:50:34.8143702Z gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:50:34.8144120Z gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:50:34.8144532Z gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:50:34.8144928Z gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:50:34.8145293Z gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:50:34.8145666Z gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:50:34.8146032Z gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:50:34.8146462Z gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:50:34.8146861Z gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:50:34.8147282Z gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:50:34.8147678Z gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:50:34.8148092Z gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:50:34.8148515Z gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:50:34.8148915Z gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:50:34.8149322Z gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:50:34.8149701Z gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:50:34.8150099Z gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:50:34.8150500Z gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:50:34.8150923Z gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:34.8151364Z gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:34.8151760Z gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:50:34.8152163Z gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:50:34.8152557Z gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:50:34.8153006Z gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:34.8153419Z gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:50:34.8153827Z gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:34.8154358Z gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:50:34.8154955Z gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:50:34.8155377Z gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:34.8155785Z gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:34.8156197Z gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:50:34.8156696Z gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:34.8157163Z gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:34.8157592Z gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:50:34.8158008Z gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:50:34.8158496Z gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:50:34.8158956Z gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:34.8159412Z gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:50:34.8159823Z gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:34.8160259Z gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:50:34.8160667Z gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:50:34.8161108Z gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:34.8161540Z gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:34.8161967Z gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:50:34.8162404Z gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:34.8162865Z gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:34.8163332Z gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:34.8163745Z gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8164134Z gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8164443Z 2025-05-07T19:50:34.8164656Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:34.8164800Z 2025-05-07T19:50:34.8164882Z 2025-05-07T19:50:34.8165094Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:34.8165232Z 2025-05-07T19:50:34.8165335Z 2025-05-07T19:50:34.8165556Z OTHER_SRCS: 2025-05-07T19:50:34.8165674Z 2025-05-07T19:50:34.8165770Z 2025-05-07T19:50:34.8165948Z CC_FLAGS: 2025-05-07T19:50:34.8166076Z 2025-05-07T19:50:34.8166156Z 2025-05-07T19:50:34.8166333Z NVCC_FLAGS: 2025-05-07T19:50:34.8166565Z --expt-relaxed-constexpr 2025-05-07T19:50:34.8166954Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:34.8167244Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:34.8167528Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:34.8167787Z 2025-05-07T19:50:34.8167998Z HIPCC_FLAGS: 2025-05-07T19:50:34.8168119Z 2025-05-07T19:50:34.8168199Z 2025-05-07T19:50:34.8168403Z INCLUDE_DIRS: 2025-05-07T19:50:34.8168633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8168958Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:34.8169235Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:34.8169550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8170027Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:34.8170811Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:34.8171462Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:34.8171861Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:34.8172299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:34.8172760Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:34.8173279Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:34.8173723Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:34.8174284Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:34.8174794Z 2025-05-07T19:50:34.8174984Z Selected Source Files: 2025-05-07T19:50:34.8175282Z gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8175667Z gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:50:34.8176083Z gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:50:34.8176482Z gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:50:34.8176898Z gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:50:34.8178679Z gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:50:34.8179088Z gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:50:34.8179470Z gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:34.8179882Z gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:34.8180440Z gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:34.8180848Z gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:34.8181237Z gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8181578Z gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8181926Z gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:50:34.8182253Z gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:50:34.8182596Z gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:50:34.8182942Z gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:50:34.8183337Z gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:50:34.8183733Z gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:50:34.8184088Z gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:50:34.8184446Z gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:50:34.8184781Z gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:50:34.8185159Z gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:50:34.8185534Z gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:50:34.8185931Z gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:50:34.8186303Z gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:50:34.8186688Z gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:50:34.8187078Z gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:50:34.8187465Z gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:34.8187875Z gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:50:34.8188240Z gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:50:34.8188622Z gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:50:34.8189027Z gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:34.8189431Z gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:50:34.8189830Z gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:34.8190202Z gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:50:34.8190583Z gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:50:34.8190955Z gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:34.8191325Z gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:50:34.8191715Z gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:34.8192299Z gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:50:34.8192680Z gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:50:34.8193099Z gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:50:34.8193563Z gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:34.8193989Z gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:50:34.8194704Z gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:34.8195120Z gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:50:34.8195551Z gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:50:34.8195972Z gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:34.8196389Z gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:50:34.8196822Z gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:34.8197280Z gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:34.8197740Z gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:34.8198095Z 2025-05-07T19:50:34.8198307Z HIPified Source Files: 2025-05-07T19:50:34.8198459Z 2025-05-07T19:50:34.8198536Z 2025-05-07T19:50:34.8198826Z Library Dependencies: 2025-05-07T19:50:34.8199058Z torch 2025-05-07T19:50:34.8199262Z torch_library 2025-05-07T19:50:34.8199691Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:34.8200389Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:34.8201171Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:34.8201974Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:34.8202709Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:34.8203177Z fbgemm_gpu_tbe_common 2025-05-07T19:50:34.8203530Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:34.8203929Z 2025-05-07T19:50:34.8204101Z Output Library: 2025-05-07T19:50:34.8204329Z fbgemm_gpu_tbe_training_forward 2025-05-07T19:50:34.8204568Z 2025-05-07T19:50:34.8204765Z Destination Directory: 2025-05-07T19:50:34.8205022Z fbgemm_gpu 2025-05-07T19:50:34.8205228Z ================================================================================ 2025-05-07T19:50:34.8205451Z 2025-05-07T19:50:34.8205455Z 2025-05-07T19:50:34.8205460Z 2025-05-07T19:50:34.8205575Z ================================================================================ 2025-05-07T19:50:34.8206001Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_pt2 (SHARED) 2025-05-07T19:50:34.8206387Z 2025-05-07T19:50:34.8206555Z CPU_SRCS: 2025-05-07T19:50:34.8206901Z gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:50:34.8207257Z gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:34.8207598Z gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:50:34.8207913Z gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:50:34.8208214Z gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:50:34.8208536Z gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:50:34.8208908Z gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:50:34.8209321Z gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:50:34.8209677Z gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:50:34.8210066Z gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:34.8210489Z gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:50:34.8210873Z gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:34.8211351Z gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:34.8211894Z gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:34.8212430Z gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:34.8212900Z gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:50:34.8213309Z gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:34.8213703Z gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8214133Z gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8214563Z gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8214938Z gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8215332Z gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8215728Z gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8216186Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8216805Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8217238Z gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8217687Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8218153Z gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8218617Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8219215Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8219824Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8220421Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8221021Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8221610Z 2025-05-07T19:50:34.8221789Z GPU_SRCS: 2025-05-07T19:50:34.8222053Z gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8222495Z gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8222962Z gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8223341Z gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8223728Z gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8224135Z gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8224595Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8225119Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8225566Z gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8226043Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8226538Z gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8227021Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8227604Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8228238Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8229046Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8229624Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8230134Z gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8230489Z 2025-05-07T19:50:34.8230674Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:34.8230818Z 2025-05-07T19:50:34.8230904Z 2025-05-07T19:50:34.8231073Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:34.8231205Z 2025-05-07T19:50:34.8231287Z 2025-05-07T19:50:34.8231451Z OTHER_SRCS: 2025-05-07T19:50:34.8231563Z 2025-05-07T19:50:34.8231649Z 2025-05-07T19:50:34.8231816Z CC_FLAGS: 2025-05-07T19:50:34.8231931Z 2025-05-07T19:50:34.8231999Z 2025-05-07T19:50:34.8232161Z NVCC_FLAGS: 2025-05-07T19:50:34.8232366Z --expt-relaxed-constexpr 2025-05-07T19:50:34.8232613Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:34.8232881Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:34.8233155Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:34.8233381Z 2025-05-07T19:50:34.8233555Z HIPCC_FLAGS: 2025-05-07T19:50:34.8233673Z 2025-05-07T19:50:34.8233743Z 2025-05-07T19:50:34.8233931Z INCLUDE_DIRS: 2025-05-07T19:50:34.8234221Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8234710Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:34.8234970Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:34.8235285Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8235760Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:34.8236551Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:34.8237210Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:34.8237626Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:34.8238066Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:34.8238531Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:34.8239060Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:34.8239834Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:34.8240418Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:34.8240944Z 2025-05-07T19:50:34.8241144Z Selected Source Files: 2025-05-07T19:50:34.8241449Z gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:50:34.8241892Z gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:34.8242285Z gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:50:34.8242618Z gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:50:34.8242980Z gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:50:34.8243321Z gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:50:34.8243732Z gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:50:34.8244187Z gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:50:34.8244570Z gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:50:34.8244994Z gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:34.8245433Z gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:50:34.8245863Z gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:34.8246361Z gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:34.8247059Z gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:34.8247608Z gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:34.8248087Z gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:50:34.8248507Z gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:34.8248891Z gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8249345Z gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8249771Z gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8250161Z gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8250533Z gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8250933Z gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8251387Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8251875Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8252329Z gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8252781Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8253280Z gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8253739Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8254302Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8254931Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8255541Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8256108Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:34.8256571Z gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8257019Z gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8257442Z gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8257842Z gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8258236Z gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8258627Z gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8259099Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8259598Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8260055Z gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8260569Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8261085Z gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8261575Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8262131Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8262831Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8263445Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8264019Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8264537Z gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:34.8264896Z 2025-05-07T19:50:34.8265104Z HIPified Source Files: 2025-05-07T19:50:34.8265250Z 2025-05-07T19:50:34.8265327Z 2025-05-07T19:50:34.8265536Z Library Dependencies: 2025-05-07T19:50:34.8265749Z torch 2025-05-07T19:50:34.8265947Z torch_library 2025-05-07T19:50:34.8266351Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:34.8266992Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:34.8267630Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:34.8268386Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:34.8269085Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:34.8269515Z fbgemm 2025-05-07T19:50:34.8269723Z fbgemm_gpu_config 2025-05-07T19:50:34.8269931Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:34.8270166Z fbgemm_gpu_tbe_common 2025-05-07T19:50:34.8270385Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:34.8270632Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:34.8270997Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:34.8271388Z 2025-05-07T19:50:34.8271589Z Output Library: 2025-05-07T19:50:34.8271808Z fbgemm_gpu_tbe_training_backward_pt2 2025-05-07T19:50:34.8272088Z 2025-05-07T19:50:34.8272274Z Destination Directory: 2025-05-07T19:50:34.8272505Z fbgemm_gpu 2025-05-07T19:50:34.8272729Z ================================================================================ 2025-05-07T19:50:34.8272968Z 2025-05-07T19:50:34.8272972Z 2025-05-07T19:50:34.8272976Z 2025-05-07T19:50:34.8273085Z ================================================================================ 2025-05-07T19:50:34.8273487Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward (SHARED) 2025-05-07T19:50:34.8273837Z 2025-05-07T19:50:34.8274105Z CPU_SRCS: 2025-05-07T19:50:34.8274578Z codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:50:34.8275038Z gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:50:34.8275389Z gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:50:34.8275787Z gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:34.8276162Z gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:50:34.8276517Z gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:50:34.8276870Z gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:50:34.8277215Z gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:50:34.8277640Z gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:50:34.8278069Z gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:50:34.8278470Z gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:50:34.8278875Z gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:34.8279323Z gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:50:34.8279746Z gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:34.8280257Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:34.8280825Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:34.8281482Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:34.8281982Z gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:50:34.8282397Z gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:50:34.8282776Z gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:50:34.8283193Z gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:50:34.8283478Z 2025-05-07T19:50:34.8283648Z GPU_SRCS: 2025-05-07T19:50:34.8283895Z gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:50:34.8284308Z gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:50:34.8284757Z gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:50:34.8285199Z gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:50:34.8285630Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:50:34.8286103Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:50:34.8286592Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:50:34.8287182Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8287674Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8288181Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8288654Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:50:34.8289090Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8289557Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8289971Z gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:34.8290364Z gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8290781Z gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8291209Z gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8291664Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8292128Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8292560Z gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:34.8292955Z gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8293396Z gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8293824Z gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:34.8294260Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8294730Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8295201Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8295706Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8296228Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8296719Z gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:34.8297184Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8297666Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8298083Z gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:50:34.8298439Z gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8298821Z gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8299206Z gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8299627Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8300070Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8300465Z gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:34.8300836Z gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8301289Z gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8301670Z gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:50:34.8302033Z gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8302426Z gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8302871Z gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8303302Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8303756Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8304156Z gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:50:34.8304542Z gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8304942Z gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8305325Z gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:50:34.8305687Z gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8306074Z gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8306467Z gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8306879Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8307326Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8307728Z gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:34.8308108Z gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8308508Z gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8308898Z gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:50:34.8309288Z gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8309706Z gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8310137Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8310587Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8311072Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8311760Z gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:34.8312173Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8312607Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8313045Z gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:50:34.8313528Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8314119Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8314825Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8315392Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8316003Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8316577Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:50:34.8317113Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8317685Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8318214Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:50:34.8318745Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8319290Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8319846Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8320431Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8321032Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8321668Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:34.8322194Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8322756Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8323221Z gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:50:34.8323682Z gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8324104Z gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8324528Z gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8324988Z gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8325470Z gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8325919Z gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:50:34.8326329Z gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8326892Z gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8327360Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:50:34.8327888Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8328590Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8329355Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8329974Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8330617Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8331223Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:50:34.8331802Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8332401Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8332843Z 2025-05-07T19:50:34.8333018Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:34.8333164Z 2025-05-07T19:50:34.8333234Z 2025-05-07T19:50:34.8333406Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:34.8333734Z gen_embedding_backward_split_unweighted_nobag_device_kernel_hip.hip 2025-05-07T19:50:34.8334212Z gen_embedding_backward_split_weighted_device_kernel_hip.hip 2025-05-07T19:50:34.8334806Z gen_embedding_backward_split_unweighted_device_kernel_hip.hip 2025-05-07T19:50:34.8335160Z 2025-05-07T19:50:34.8335330Z OTHER_SRCS: 2025-05-07T19:50:34.8335441Z 2025-05-07T19:50:34.8335528Z 2025-05-07T19:50:34.8335694Z CC_FLAGS: 2025-05-07T19:50:34.8335810Z 2025-05-07T19:50:34.8335882Z 2025-05-07T19:50:34.8336045Z NVCC_FLAGS: 2025-05-07T19:50:34.8336255Z --expt-relaxed-constexpr 2025-05-07T19:50:34.8336503Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:34.8336769Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:34.8337045Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:34.8337272Z 2025-05-07T19:50:34.8337455Z HIPCC_FLAGS: 2025-05-07T19:50:34.8337564Z 2025-05-07T19:50:34.8337634Z 2025-05-07T19:50:34.8337807Z INCLUDE_DIRS: 2025-05-07T19:50:34.8338018Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8338310Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:34.8338566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:34.8338861Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8339317Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:34.8340070Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:34.8340688Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:34.8341164Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:34.8341554Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:34.8341970Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:34.8342549Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:34.8342952Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:34.8343463Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:34.8343920Z 2025-05-07T19:50:34.8344161Z Selected Source Files: 2025-05-07T19:50:34.8344491Z codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:50:34.8344883Z gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:50:34.8345203Z gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:50:34.8345535Z gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:34.8345882Z gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:50:34.8346174Z gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:50:34.8346478Z gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:50:34.8346787Z gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:50:34.8347135Z gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:50:34.8347531Z gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:50:34.8347874Z gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:50:34.8348245Z gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:34.8348629Z gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:50:34.8349008Z gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:34.8349464Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:34.8349979Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:34.8350499Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:34.8350956Z gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:50:34.8351340Z gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:50:34.8351671Z gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:50:34.8352018Z gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:50:34.8352331Z gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:50:34.8352714Z gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:50:34.8353132Z gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:50:34.8353533Z gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:50:34.8353936Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:50:34.8354597Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:50:34.8355093Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:50:34.8355578Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8356107Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8356667Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8357172Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:50:34.8357659Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8358159Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8358608Z gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:34.8359017Z gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8359465Z gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8359913Z gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8360383Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8360886Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8361337Z gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:34.8361766Z gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8362307Z gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8362769Z gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:34.8363254Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8363753Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8364336Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8364871Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8365456Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8365978Z gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:34.8366474Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8367095Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8367503Z gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:50:34.8367875Z gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8368248Z gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8368641Z gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8369050Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8369501Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8369904Z gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:34.8370272Z gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8370670Z gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8371033Z gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:50:34.8371405Z gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8371781Z gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8372170Z gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8372591Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8373046Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8373458Z gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:50:34.8373827Z gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8374242Z gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8374613Z gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:50:34.8374982Z gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8375367Z gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8375761Z gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8376181Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8376624Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8377033Z gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:34.8377401Z gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8377807Z gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8378190Z gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:50:34.8378584Z gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8379005Z gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8379425Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8379873Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8380344Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8380776Z gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:34.8381172Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8381615Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8382124Z gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:50:34.8382600Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8383111Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8383673Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8384214Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8384768Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8385291Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:50:34.8385788Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8386303Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8386803Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:50:34.8387280Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8387792Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8388297Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8388837Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8389400Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8389920Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:34.8390416Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8390926Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8391364Z gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:50:34.8391722Z gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8392120Z gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8392520Z gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8392941Z gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8393394Z gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8393795Z gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:50:34.8394258Z gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8394852Z gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8395358Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:50:34.8395936Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8396527Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8397149Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8397781Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8398450Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8399064Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:50:34.8399660Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8400275Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8400704Z 2025-05-07T19:50:34.8400890Z HIPified Source Files: 2025-05-07T19:50:34.8401034Z 2025-05-07T19:50:34.8401102Z 2025-05-07T19:50:34.8401298Z Library Dependencies: 2025-05-07T19:50:34.8401505Z torch 2025-05-07T19:50:34.8401694Z torch_library 2025-05-07T19:50:34.8402118Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:34.8402853Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:34.8403536Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:34.8404306Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:34.8405097Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:34.8405542Z fbgemm 2025-05-07T19:50:34.8405735Z fbgemm_gpu_config 2025-05-07T19:50:34.8405949Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:34.8406181Z fbgemm_gpu_tbe_common 2025-05-07T19:50:34.8406408Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:34.8406637Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:34.8407115Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:34.8407467Z 2025-05-07T19:50:34.8407645Z Output Library: 2025-05-07T19:50:34.8407847Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:50:34.8408077Z 2025-05-07T19:50:34.8408252Z Destination Directory: 2025-05-07T19:50:34.8408458Z fbgemm_gpu 2025-05-07T19:50:34.8408656Z ================================================================================ 2025-05-07T19:50:34.8408866Z 2025-05-07T19:50:34.8408869Z 2025-05-07T19:50:34.8408873Z 2025-05-07T19:50:34.8408973Z ================================================================================ 2025-05-07T19:50:34.8409364Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_gwd (SHARED) 2025-05-07T19:50:34.8409710Z 2025-05-07T19:50:34.8409880Z CPU_SRCS: 2025-05-07T19:50:34.8409978Z 2025-05-07T19:50:34.8410042Z 2025-05-07T19:50:34.8410207Z GPU_SRCS: 2025-05-07T19:50:34.8410478Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:50:34.8410954Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:50:34.8411461Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:50:34.8411941Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:50:34.8412436Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:50:34.8412942Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:50:34.8413437Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:50:34.8413939Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:34.8414461Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:34.8414981Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:50:34.8415492Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:34.8416038Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:34.8416425Z 2025-05-07T19:50:34.8416600Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:34.8416724Z 2025-05-07T19:50:34.8416792Z 2025-05-07T19:50:34.8416971Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:34.8417098Z 2025-05-07T19:50:34.8417171Z 2025-05-07T19:50:34.8417332Z OTHER_SRCS: 2025-05-07T19:50:34.8417435Z 2025-05-07T19:50:34.8417516Z 2025-05-07T19:50:34.8417673Z CC_FLAGS: 2025-05-07T19:50:34.8417770Z 2025-05-07T19:50:34.8417844Z 2025-05-07T19:50:34.8417994Z NVCC_FLAGS: 2025-05-07T19:50:34.8418199Z --expt-relaxed-constexpr 2025-05-07T19:50:34.8418430Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:34.8418681Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:34.8418938Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:34.8419164Z 2025-05-07T19:50:34.8419323Z HIPCC_FLAGS: 2025-05-07T19:50:34.8419437Z 2025-05-07T19:50:34.8419508Z 2025-05-07T19:50:34.8419684Z INCLUDE_DIRS: 2025-05-07T19:50:34.8419887Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8420167Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:34.8420417Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:34.8420689Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8421188Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:34.8421914Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:34.8422493Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:34.8423557Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:34.8423948Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:34.8424360Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:34.8424828Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:34.8425239Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:34.8425748Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:34.8426198Z 2025-05-07T19:50:34.8426382Z Selected Source Files: 2025-05-07T19:50:34.8426699Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:50:34.8427165Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:50:34.8427668Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:50:34.8428146Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:50:34.8428978Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:50:34.8429512Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:50:34.8430041Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:50:34.8430566Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:34.8431115Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:34.8431656Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:50:34.8432193Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:34.8432428Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:34.8432499Z 2025-05-07T19:50:34.8432590Z HIPified Source Files: 2025-05-07T19:50:34.8432594Z 2025-05-07T19:50:34.8432660Z 2025-05-07T19:50:34.8432755Z Library Dependencies: 2025-05-07T19:50:34.8432830Z torch 2025-05-07T19:50:34.8432905Z torch_library 2025-05-07T19:50:34.8433208Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:34.8433450Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:34.8433763Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:34.8434183Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:34.8434445Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:34.8434720Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:50:34.8434926Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:34.8435012Z 2025-05-07T19:50:34.8435095Z Output Library: 2025-05-07T19:50:34.8435194Z fbgemm_gpu_tbe_training_backward_gwd 2025-05-07T19:50:34.8435277Z 2025-05-07T19:50:34.8435371Z Destination Directory: 2025-05-07T19:50:34.8435468Z fbgemm_gpu 2025-05-07T19:50:34.8435578Z ================================================================================ 2025-05-07T19:50:34.8435582Z 2025-05-07T19:50:34.8435596Z 2025-05-07T19:50:34.8435600Z 2025-05-07T19:50:34.8435704Z ================================================================================ 2025-05-07T19:50:34.8435900Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_vbe (SHARED) 2025-05-07T19:50:34.8435970Z 2025-05-07T19:50:34.8436055Z CPU_SRCS: 2025-05-07T19:50:34.8436060Z 2025-05-07T19:50:34.8436130Z 2025-05-07T19:50:34.8436205Z GPU_SRCS: 2025-05-07T19:50:34.8436530Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:50:34.8436715Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:50:34.8436914Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:34.8437112Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:50:34.8437612Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:50:34.8437859Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:34.8438005Z gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:50:34.8438167Z gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:34.8438316Z gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:50:34.8438477Z gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:34.8438635Z gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:50:34.8438797Z gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:34.8438981Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:50:34.8439196Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8439412Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8439597Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:50:34.8439801Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8440019Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8440212Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:34.8440429Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8440660Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8440848Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:50:34.8441062Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8441285Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8441516Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:50:34.8441776Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8442037Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8442275Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:34.8442534Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8442795Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8442945Z gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:50:34.8443109Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8443273Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8443426Z gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:34.8443598Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8443778Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8443935Z gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:50:34.8444108Z gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8444289Z gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8444448Z gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:34.8444642Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8444828Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8444974Z gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:50:34.8445207Z gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8445384Z gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8445558Z gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:34.8445760Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8445998Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8446082Z 2025-05-07T19:50:34.8446177Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:34.8446197Z 2025-05-07T19:50:34.8446278Z 2025-05-07T19:50:34.8446373Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:34.8446377Z 2025-05-07T19:50:34.8446457Z 2025-05-07T19:50:34.8446555Z OTHER_SRCS: 2025-05-07T19:50:34.8446559Z 2025-05-07T19:50:34.8446643Z 2025-05-07T19:50:34.8446724Z CC_FLAGS: 2025-05-07T19:50:34.8446842Z 2025-05-07T19:50:34.8446928Z 2025-05-07T19:50:34.8447004Z NVCC_FLAGS: 2025-05-07T19:50:34.8447107Z --expt-relaxed-constexpr 2025-05-07T19:50:34.8447213Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:34.8447333Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:34.8447426Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:34.8447497Z 2025-05-07T19:50:34.8447588Z HIPCC_FLAGS: 2025-05-07T19:50:34.8447592Z 2025-05-07T19:50:34.8447664Z 2025-05-07T19:50:34.8447744Z INCLUDE_DIRS: 2025-05-07T19:50:34.8447849Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8447964Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:34.8448064Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:34.8448163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8448443Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:34.8448803Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:34.8448939Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:34.8449109Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:34.8449259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:34.8449450Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:34.8449638Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:34.8449791Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:34.8450081Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:34.8450159Z 2025-05-07T19:50:34.8450268Z Selected Source Files: 2025-05-07T19:50:34.8450453Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:50:34.8450630Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:50:34.8450839Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:34.8451022Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:50:34.8451261Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:50:34.8451495Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:34.8451655Z gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:50:34.8451804Z gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:34.8451954Z gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:50:34.8452128Z gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:34.8452274Z gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:50:34.8452425Z gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:34.8452622Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:50:34.8452824Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8453028Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8453251Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:50:34.8453459Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8453653Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8453836Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:34.8454105Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8454312Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8454491Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:50:34.8454708Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8454910Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8455129Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:50:34.8455391Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8455634Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8455862Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:34.8456112Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8456377Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8456513Z gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:50:34.8456673Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8456847Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8456988Z gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:34.8457153Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8457336Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8457477Z gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:50:34.8457642Z gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8457809Z gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8457972Z gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:34.8458149Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8458326Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8458482Z gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:50:34.8458643Z gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8458807Z gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8458970Z gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:34.8459140Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:34.8459319Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:34.8459391Z 2025-05-07T19:50:34.8459493Z HIPified Source Files: 2025-05-07T19:50:34.8459497Z 2025-05-07T19:50:34.8459568Z 2025-05-07T19:50:34.8459657Z Library Dependencies: 2025-05-07T19:50:34.8459744Z torch 2025-05-07T19:50:34.8459826Z torch_library 2025-05-07T19:50:34.8460115Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:34.8460362Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:34.8460667Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:34.8460989Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:34.8461239Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:34.8461351Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:50:34.8461596Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:34.8461668Z 2025-05-07T19:50:34.8461764Z Output Library: 2025-05-07T19:50:34.8461864Z fbgemm_gpu_tbe_training_backward_vbe 2025-05-07T19:50:34.8461936Z 2025-05-07T19:50:34.8462025Z Destination Directory: 2025-05-07T19:50:34.8462116Z fbgemm_gpu 2025-05-07T19:50:34.8462346Z ================================================================================ 2025-05-07T19:50:34.8462350Z 2025-05-07T19:50:34.8462353Z 2025-05-07T19:50:34.8462357Z 2025-05-07T19:50:34.8462464Z ================================================================================ 2025-05-07T19:50:34.8462680Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_dense (SHARED) 2025-05-07T19:50:34.8462754Z 2025-05-07T19:50:34.8462834Z CPU_SRCS: 2025-05-07T19:50:34.8462838Z 2025-05-07T19:50:34.8462927Z 2025-05-07T19:50:34.8463003Z GPU_SRCS: 2025-05-07T19:50:34.8463135Z gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:50:34.8463293Z gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:50:34.8463447Z gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8463602Z gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8463760Z gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8463933Z gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:34.8464111Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8464290Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8464445Z gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:50:34.8464586Z gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:50:34.8464744Z gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8464920Z gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8465025Z gen_embedding_backward_split_dense.cpp 2025-05-07T19:50:34.8465095Z 2025-05-07T19:50:34.8465181Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:34.8465186Z 2025-05-07T19:50:34.8465274Z 2025-05-07T19:50:34.8465360Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:34.8465364Z 2025-05-07T19:50:34.8465435Z 2025-05-07T19:50:34.8465529Z OTHER_SRCS: 2025-05-07T19:50:34.8465532Z 2025-05-07T19:50:34.8465602Z 2025-05-07T19:50:34.8465680Z CC_FLAGS: 2025-05-07T19:50:34.8465685Z 2025-05-07T19:50:34.8465756Z 2025-05-07T19:50:34.8465851Z NVCC_FLAGS: 2025-05-07T19:50:34.8465946Z --expt-relaxed-constexpr 2025-05-07T19:50:34.8466038Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:34.8466148Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:34.8466240Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:34.8466310Z 2025-05-07T19:50:34.8466389Z HIPCC_FLAGS: 2025-05-07T19:50:34.8466408Z 2025-05-07T19:50:34.8466477Z 2025-05-07T19:50:34.8466548Z INCLUDE_DIRS: 2025-05-07T19:50:34.8466642Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8466739Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:34.8466836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:34.8466933Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8467206Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:34.8467563Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:34.8467691Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:34.8467833Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:34.8467985Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:34.8468172Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:34.8468351Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:34.8468490Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:34.8468773Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:34.8468839Z 2025-05-07T19:50:34.8468984Z Selected Source Files: 2025-05-07T19:50:34.8469120Z gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:50:34.8469277Z gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:34.8469414Z gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:50:34.8469574Z gen_embedding_backward_split_dense.cpp 2025-05-07T19:50:34.8469700Z gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:50:34.8469842Z gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:50:34.8470001Z gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:50:34.8470157Z gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:34.8470330Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:34.8470520Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:34.8470655Z gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:50:34.8470813Z gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:50:34.8470968Z gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:50:34.8471055Z 2025-05-07T19:50:34.8471141Z HIPified Source Files: 2025-05-07T19:50:34.8471145Z 2025-05-07T19:50:34.8471212Z 2025-05-07T19:50:34.8471311Z Library Dependencies: 2025-05-07T19:50:34.8471384Z torch 2025-05-07T19:50:34.8471457Z torch_library 2025-05-07T19:50:34.8471742Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:34.8471981Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:34.8472279Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:34.8472592Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:34.8472853Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:34.8472949Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:50:34.8473140Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:34.8473215Z 2025-05-07T19:50:34.8473292Z Output Library: 2025-05-07T19:50:34.8473391Z fbgemm_gpu_tbe_training_backward_dense 2025-05-07T19:50:34.8473459Z 2025-05-07T19:50:34.8473556Z Destination Directory: 2025-05-07T19:50:34.8473635Z fbgemm_gpu 2025-05-07T19:50:34.8473743Z ================================================================================ 2025-05-07T19:50:34.8473747Z 2025-05-07T19:50:34.8473751Z 2025-05-07T19:50:34.8473754Z 2025-05-07T19:50:34.8473869Z ================================================================================ 2025-05-07T19:50:34.8474164Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_split_host (SHARED) 2025-05-07T19:50:34.8474236Z 2025-05-07T19:50:34.8474324Z CPU_SRCS: 2025-05-07T19:50:34.8474328Z 2025-05-07T19:50:34.8474404Z 2025-05-07T19:50:34.8474654Z GPU_SRCS: 2025-05-07T19:50:34.8474775Z gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:50:34.8474934Z gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:50:34.8475041Z gen_embedding_backward_split_sgd.cpp 2025-05-07T19:50:34.8475188Z gen_embedding_backward_split_adam.cpp 2025-05-07T19:50:34.8475314Z gen_embedding_backward_split_lamb.cpp 2025-05-07T19:50:34.8487265Z gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:50:34.8487458Z gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:50:34.8487593Z gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:50:34.8487702Z gen_embedding_backward_split_none.cpp 2025-05-07T19:50:34.8487869Z gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:34.8487982Z gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:50:34.8488120Z gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:50:34.8488317Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:34.8488652Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:34.8488836Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:34.8488999Z gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:50:34.8489119Z gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:50:34.8489322Z gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:34.8489476Z gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:34.8489639Z gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:34.8489808Z gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:34.8489932Z gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:50:34.8490075Z gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:34.8490197Z gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:50:34.8490331Z gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:50:34.8490466Z gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:50:34.8490595Z gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:34.8490730Z gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:50:34.8490886Z gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:34.8491068Z gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:50:34.8491252Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:50:34.8491430Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:50:34.8491620Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:34.8491744Z gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:50:34.8491878Z gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:50:34.8492096Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:50:34.8492313Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:50:34.8492381Z 2025-05-07T19:50:34.8492468Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:34.8492474Z 2025-05-07T19:50:34.8492542Z 2025-05-07T19:50:34.8492622Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:34.8492626Z 2025-05-07T19:50:34.8492695Z 2025-05-07T19:50:34.8492781Z OTHER_SRCS: 2025-05-07T19:50:34.8492786Z 2025-05-07T19:50:34.8492850Z 2025-05-07T19:50:34.8492918Z CC_FLAGS: 2025-05-07T19:50:34.8492922Z 2025-05-07T19:50:34.8492996Z 2025-05-07T19:50:34.8493071Z NVCC_FLAGS: 2025-05-07T19:50:34.8493161Z --expt-relaxed-constexpr 2025-05-07T19:50:34.8493250Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:34.8493352Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:34.8493432Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:34.8493497Z 2025-05-07T19:50:34.8493577Z HIPCC_FLAGS: 2025-05-07T19:50:34.8493581Z 2025-05-07T19:50:34.8493650Z 2025-05-07T19:50:34.8493720Z INCLUDE_DIRS: 2025-05-07T19:50:34.8493814Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8493913Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:34.8494006Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:34.8494095Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8494368Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:34.8494732Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:34.8494860Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:34.8495015Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:34.8495158Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:34.8495341Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:34.8495524Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:34.8495656Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:34.8495985Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:34.8496055Z 2025-05-07T19:50:34.8496142Z Selected Source Files: 2025-05-07T19:50:34.8496242Z gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:50:34.8496359Z gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:50:34.8496514Z gen_embedding_backward_split_sgd.cpp 2025-05-07T19:50:34.8496605Z gen_embedding_backward_split_adam.cpp 2025-05-07T19:50:34.8496698Z gen_embedding_backward_split_lamb.cpp 2025-05-07T19:50:34.8496797Z gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:50:34.8496938Z gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:50:34.8497068Z gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:50:34.8497160Z gen_embedding_backward_split_none.cpp 2025-05-07T19:50:34.8497331Z gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:34.8497440Z gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:50:34.8497579Z gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:50:34.8497765Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:34.8497975Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:34.8498151Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:34.8498300Z gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:50:34.8498421Z gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:50:34.8498556Z gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:34.8498700Z gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:34.8498877Z gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:34.8499050Z gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:34.8499174Z gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:50:34.8499303Z gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:34.8499442Z gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:50:34.8499576Z gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:50:34.8499698Z gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:50:34.8499841Z gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:34.8499979Z gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:50:34.8500121Z gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:34.8500313Z gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:50:34.8500501Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:50:34.8500681Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:50:34.8500874Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:34.8501003Z gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:50:34.8501137Z gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:50:34.8501341Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:50:34.8501570Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:50:34.8501638Z 2025-05-07T19:50:34.8501721Z HIPified Source Files: 2025-05-07T19:50:34.8501728Z 2025-05-07T19:50:34.8501808Z 2025-05-07T19:50:34.8501890Z Library Dependencies: 2025-05-07T19:50:34.8501954Z torch 2025-05-07T19:50:34.8502024Z torch_library 2025-05-07T19:50:34.8502314Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:34.8502544Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:34.8502842Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:34.8503168Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:34.8503483Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:34.8503564Z fbgemm_gpu_config 2025-05-07T19:50:34.8503648Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:34.8503840Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:34.8503957Z 2025-05-07T19:50:34.8504034Z Output Library: 2025-05-07T19:50:34.8504150Z fbgemm_gpu_tbe_training_backward_split_host 2025-05-07T19:50:34.8504213Z 2025-05-07T19:50:34.8504292Z Destination Directory: 2025-05-07T19:50:34.8504373Z fbgemm_gpu 2025-05-07T19:50:34.8504472Z ================================================================================ 2025-05-07T19:50:34.8504476Z 2025-05-07T19:50:34.8504480Z 2025-05-07T19:50:34.8504483Z 2025-05-07T19:50:34.8504576Z ================================================================================ 2025-05-07T19:50:34.8504738Z GPU CPP Library Target: fbgemm_gpu_tbe_index_select (SHARED) 2025-05-07T19:50:34.8504801Z 2025-05-07T19:50:34.8504869Z CPU_SRCS: 2025-05-07T19:50:34.8505061Z codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T19:50:34.8505238Z codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T19:50:34.8505300Z 2025-05-07T19:50:34.8505370Z GPU_SRCS: 2025-05-07T19:50:34.8505550Z codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T19:50:34.8505673Z gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:50:34.8505785Z gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:50:34.8505904Z gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:50:34.8506037Z gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:50:34.8506151Z gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:50:34.8506268Z gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:50:34.8506400Z gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:50:34.8506464Z 2025-05-07T19:50:34.8506545Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:34.8506549Z 2025-05-07T19:50:34.8506633Z 2025-05-07T19:50:34.8506711Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:34.8506715Z 2025-05-07T19:50:34.8506781Z 2025-05-07T19:50:34.8506848Z OTHER_SRCS: 2025-05-07T19:50:34.8506852Z 2025-05-07T19:50:34.8506929Z 2025-05-07T19:50:34.8507000Z CC_FLAGS: 2025-05-07T19:50:34.8507003Z 2025-05-07T19:50:34.8507070Z 2025-05-07T19:50:34.8507149Z NVCC_FLAGS: 2025-05-07T19:50:34.8507235Z --expt-relaxed-constexpr 2025-05-07T19:50:34.8507320Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:34.8507408Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:34.8507505Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:34.8507568Z 2025-05-07T19:50:34.8507640Z HIPCC_FLAGS: 2025-05-07T19:50:34.8507644Z 2025-05-07T19:50:34.8507720Z 2025-05-07T19:50:34.8507793Z INCLUDE_DIRS: 2025-05-07T19:50:34.8507889Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8507978Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:34.8508084Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:34.8508183Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8508442Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:34.8508809Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:34.8508942Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:34.8509093Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:34.8509247Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:34.8509437Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:34.8509621Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:34.8509753Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:34.8510047Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:34.8510119Z 2025-05-07T19:50:34.8510204Z Selected Source Files: 2025-05-07T19:50:34.8510454Z codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T19:50:34.8510629Z codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T19:50:34.8510805Z codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T19:50:34.8510934Z gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:50:34.8511097Z gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:50:34.8511223Z gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:50:34.8511353Z gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:50:34.8511484Z gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:50:34.8511607Z gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:50:34.8511730Z gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:50:34.8511812Z 2025-05-07T19:50:34.8511898Z HIPified Source Files: 2025-05-07T19:50:34.8511902Z 2025-05-07T19:50:34.8511974Z 2025-05-07T19:50:34.8512061Z Library Dependencies: 2025-05-07T19:50:34.8512147Z torch 2025-05-07T19:50:34.8512224Z torch_library 2025-05-07T19:50:34.8512511Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:34.8512755Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:34.8513057Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:34.8513375Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:34.8513627Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:34.8513717Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:34.8513792Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:34.8513982Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:34.8514169Z 2025-05-07T19:50:34.8514250Z Output Library: 2025-05-07T19:50:34.8514334Z fbgemm_gpu_tbe_index_select 2025-05-07T19:50:34.8514581Z 2025-05-07T19:50:34.8514676Z Destination Directory: 2025-05-07T19:50:34.8514761Z fbgemm_gpu 2025-05-07T19:50:34.8514867Z ================================================================================ 2025-05-07T19:50:34.8514883Z 2025-05-07T19:50:34.8514887Z 2025-05-07T19:50:34.8514891Z 2025-05-07T19:50:34.8514998Z ================================================================================ 2025-05-07T19:50:34.8515185Z GPU CPP Library Target: fbgemm_gpu_embedding_inplace_ops (SHARED) 2025-05-07T19:50:34.8515256Z 2025-05-07T19:50:34.8515344Z CPU_SRCS: 2025-05-07T19:50:34.8515518Z src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T19:50:34.8515592Z 2025-05-07T19:50:34.8515674Z GPU_SRCS: 2025-05-07T19:50:34.8515841Z src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T19:50:34.8515994Z src/embedding_inplace_ops/embedding_inplace_update.cu 2025-05-07T19:50:34.8516063Z 2025-05-07T19:50:34.8516157Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:34.8516162Z 2025-05-07T19:50:34.8516240Z 2025-05-07T19:50:34.8516324Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:34.8516328Z 2025-05-07T19:50:34.8516405Z 2025-05-07T19:50:34.8516480Z OTHER_SRCS: 2025-05-07T19:50:34.8516485Z 2025-05-07T19:50:34.8516555Z 2025-05-07T19:50:34.8516639Z CC_FLAGS: 2025-05-07T19:50:34.8516643Z 2025-05-07T19:50:34.8516721Z 2025-05-07T19:50:34.8516796Z NVCC_FLAGS: 2025-05-07T19:50:34.8516891Z --expt-relaxed-constexpr 2025-05-07T19:50:34.8516995Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:34.8517091Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:34.8517184Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:34.8517266Z 2025-05-07T19:50:34.8517343Z HIPCC_FLAGS: 2025-05-07T19:50:34.8517347Z 2025-05-07T19:50:34.8517420Z 2025-05-07T19:50:34.8517499Z INCLUDE_DIRS: 2025-05-07T19:50:34.8517611Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8517703Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:34.8517806Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:34.8517979Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8518254Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:34.8518640Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:34.8518836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:34.8518989Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:34.8519139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:34.8519332Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:34.8519537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:34.8519675Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:34.8519975Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:34.8520054Z 2025-05-07T19:50:34.8520141Z Selected Source Files: 2025-05-07T19:50:34.8520310Z src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T19:50:34.8520471Z src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T19:50:34.8520621Z src/embedding_inplace_ops/embedding_inplace_update.cu 2025-05-07T19:50:34.8520694Z 2025-05-07T19:50:34.8520782Z HIPified Source Files: 2025-05-07T19:50:34.8520786Z 2025-05-07T19:50:34.8520869Z 2025-05-07T19:50:34.8520952Z Library Dependencies: 2025-05-07T19:50:34.8521022Z torch 2025-05-07T19:50:34.8521104Z torch_library 2025-05-07T19:50:34.8521402Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:34.8521645Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:34.8521976Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:34.8522314Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:34.8522581Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:34.8522785Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:34.8522868Z 2025-05-07T19:50:34.8522947Z Output Library: 2025-05-07T19:50:34.8523046Z fbgemm_gpu_embedding_inplace_ops 2025-05-07T19:50:34.8523127Z 2025-05-07T19:50:34.8523214Z Destination Directory: 2025-05-07T19:50:34.8523286Z fbgemm_gpu 2025-05-07T19:50:34.8523390Z ================================================================================ 2025-05-07T19:50:34.8523394Z 2025-05-07T19:50:34.8523408Z 2025-05-07T19:50:34.8523412Z 2025-05-07T19:50:34.8523515Z ================================================================================ 2025-05-07T19:50:34.8523640Z GPU CPP Library Target: fbgemm_gpu_py (SHARED) 2025-05-07T19:50:34.8523705Z 2025-05-07T19:50:34.8523784Z CPU_SRCS: 2025-05-07T19:50:34.8523881Z src/memory_utils/memory_utils.cpp 2025-05-07T19:50:34.8523982Z src/memory_utils/memory_utils_ops.cpp 2025-05-07T19:50:34.8524187Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T19:50:34.8524393Z src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T19:50:34.8524592Z src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T19:50:34.8524813Z src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T19:50:34.8525018Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T19:50:34.8525245Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T19:50:34.8525387Z src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T19:50:34.8525523Z src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T19:50:34.8525646Z src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T19:50:34.8525759Z src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T19:50:34.8525910Z src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T19:50:34.8526063Z src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T19:50:34.8526165Z src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T19:50:34.8526291Z src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T19:50:34.8526398Z src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T19:50:34.8526493Z src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T19:50:34.8526632Z src/tbe/eeg/eeg_models.cpp 2025-05-07T19:50:34.8526837Z src/tbe/eeg/eeg_utils.cpp 2025-05-07T19:50:34.8526927Z src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T19:50:34.8527015Z src/tbe/eeg/indices_estimator.cpp 2025-05-07T19:50:34.8527111Z src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T19:50:34.8527206Z src/tbe/eeg/indices_generator.cpp 2025-05-07T19:50:34.8527417Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T19:50:34.8527550Z src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T19:50:34.8527753Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T19:50:34.8527967Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T19:50:34.8528062Z src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T19:50:34.8528160Z src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T19:50:34.8528247Z src/metric_ops/metric_ops_host.cpp 2025-05-07T19:50:34.8528520Z src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T19:50:34.8528701Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T19:50:34.8528794Z src/topology_utils.cpp 2025-05-07T19:50:34.8528867Z 2025-05-07T19:50:34.8528937Z GPU_SRCS: 2025-05-07T19:50:34.8529048Z src/histogram_binning_calibration_ops.cu 2025-05-07T19:50:34.8529146Z src/input_combine_ops/input_combine.cu 2025-05-07T19:50:34.8529342Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu 2025-05-07T19:50:34.8529430Z src/memory_utils/memory_utils.cu 2025-05-07T19:50:34.8529532Z src/memory_utils/memory_utils_ops.cu 2025-05-07T19:50:34.8529707Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu 2025-05-07T19:50:34.8529878Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu 2025-05-07T19:50:34.8530005Z src/jagged_tensor_ops/dense_to_jagged_forward.cu 2025-05-07T19:50:34.8530124Z src/jagged_tensor_ops/jagged_dense_bmm_forward.cu 2025-05-07T19:50:34.8530353Z src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu 2025-05-07T19:50:34.8530531Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu 2025-05-07T19:50:34.8530688Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu 2025-05-07T19:50:34.8530815Z src/jagged_tensor_ops/jagged_index_add_2d_forward.cu 2025-05-07T19:50:34.8530949Z src/jagged_tensor_ops/jagged_index_select_2d_forward.cu 2025-05-07T19:50:34.8531076Z src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu 2025-05-07T19:50:34.8531194Z src/jagged_tensor_ops/jagged_softmax_backward.cu 2025-05-07T19:50:34.8531305Z src/jagged_tensor_ops/jagged_softmax_forward.cu 2025-05-07T19:50:34.8531414Z src/jagged_tensor_ops/jagged_tensor_ops.cu 2025-05-07T19:50:34.8531561Z src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu 2025-05-07T19:50:34.8531693Z src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu 2025-05-07T19:50:34.8531811Z src/jagged_tensor_ops/jagged_unique_indices.cu 2025-05-07T19:50:34.8531949Z src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu 2025-05-07T19:50:34.8532069Z src/layout_transform_ops/layout_transform_ops.cu 2025-05-07T19:50:34.8532157Z src/metric_ops/metric_ops.cu 2025-05-07T19:50:34.8532360Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu 2025-05-07T19:50:34.8532532Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu 2025-05-07T19:50:34.8532697Z src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu 2025-05-07T19:50:34.8532798Z src/quantize_ops/quantize_bfloat16.cu 2025-05-07T19:50:34.8532895Z src/quantize_ops/quantize_fp8_rowwise.cu 2025-05-07T19:50:34.8533013Z src/quantize_ops/quantize_fused_8bit_rowwise.cu 2025-05-07T19:50:34.8533135Z src/quantize_ops/quantize_fused_nbit_rowwise.cu 2025-05-07T19:50:34.8533343Z src/quantize_ops/quantize_hfp8.cu 2025-05-07T19:50:34.8533429Z src/quantize_ops/quantize_msfp.cu 2025-05-07T19:50:34.8533537Z src/quantize_ops/quantize_padded_fp8_rowwise.cu 2025-05-07T19:50:34.8533634Z src/quantize_ops/quantize_mx.cu 2025-05-07T19:50:34.8533745Z src/sparse_ops/sparse_async_batched_cumsum.cu 2025-05-07T19:50:34.8533937Z src/sparse_ops/sparse_block_bucketize_features.cu 2025-05-07T19:50:34.8534052Z src/sparse_ops/sparse_bucketize_features.cu 2025-05-07T19:50:34.8534171Z src/sparse_ops/sparse_batched_unary_embeddings.cu 2025-05-07T19:50:34.8534295Z src/sparse_ops/sparse_compute_frequency_sequence.cu 2025-05-07T19:50:34.8534419Z src/sparse_ops/sparse_expand_into_jagged_permute.cu 2025-05-07T19:50:34.8534522Z src/sparse_ops/sparse_group_index.cu 2025-05-07T19:50:34.8534610Z src/sparse_ops/sparse_index_add.cu 2025-05-07T19:50:34.8534699Z src/sparse_ops/sparse_index_select.cu 2025-05-07T19:50:34.8534802Z src/sparse_ops/sparse_invert_permute.cu 2025-05-07T19:50:34.8534920Z src/sparse_ops/sparse_pack_segments_backward.cu 2025-05-07T19:50:34.8535029Z src/sparse_ops/sparse_pack_segments_forward.cu 2025-05-07T19:50:34.8535122Z src/sparse_ops/sparse_permute_1d.cu 2025-05-07T19:50:34.8535217Z src/sparse_ops/sparse_permute_2d.cu 2025-05-07T19:50:34.8535304Z src/sparse_ops/sparse_permute102.cu 2025-05-07T19:50:34.8535422Z src/sparse_ops/sparse_permute_embeddings.cu 2025-05-07T19:50:34.8535514Z src/sparse_ops/sparse_range.cu 2025-05-07T19:50:34.8535613Z src/sparse_ops/sparse_reorder_batched_ad.cu 2025-05-07T19:50:34.8535726Z src/sparse_ops/sparse_segment_sum_csr.cu 2025-05-07T19:50:34.8535817Z src/sparse_ops/sparse_zipf.cu 2025-05-07T19:50:34.8535884Z 2025-05-07T19:50:34.8535961Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:34.8535965Z 2025-05-07T19:50:34.8536038Z 2025-05-07T19:50:34.8536118Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:34.8536122Z 2025-05-07T19:50:34.8536191Z 2025-05-07T19:50:34.8536273Z OTHER_SRCS: 2025-05-07T19:50:34.8536277Z 2025-05-07T19:50:34.8536344Z 2025-05-07T19:50:34.8536411Z CC_FLAGS: 2025-05-07T19:50:34.8536419Z 2025-05-07T19:50:34.8536482Z 2025-05-07T19:50:34.8536560Z NVCC_FLAGS: 2025-05-07T19:50:34.8536648Z --expt-relaxed-constexpr 2025-05-07T19:50:34.8536737Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:34.8536836Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:34.8536930Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:34.8536997Z 2025-05-07T19:50:34.8537066Z HIPCC_FLAGS: 2025-05-07T19:50:34.8537070Z 2025-05-07T19:50:34.8537141Z 2025-05-07T19:50:34.8537214Z INCLUDE_DIRS: 2025-05-07T19:50:34.8537309Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8537399Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:34.8537489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:34.8537578Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:34.8537836Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:34.8538203Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:34.8538330Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:34.8538472Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:34.8538620Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:34.8538951Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:34.8539130Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:34.8539266Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:34.8539546Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:34.8539610Z 2025-05-07T19:50:34.8539698Z Selected Source Files: 2025-05-07T19:50:34.8539789Z src/memory_utils/memory_utils.cpp 2025-05-07T19:50:34.8539879Z src/memory_utils/memory_utils_ops.cpp 2025-05-07T19:50:34.8540059Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T19:50:34.8540501Z src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T19:50:34.8540685Z src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T19:50:34.8540880Z src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T19:50:34.8541081Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T19:50:34.8541350Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T19:50:34.8541485Z src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T19:50:34.8541616Z src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T19:50:34.8541729Z src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T19:50:34.8541833Z src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T19:50:34.8541965Z src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T19:50:34.8542069Z src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T19:50:34.8542164Z src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T19:50:34.8542282Z src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T19:50:34.8542383Z src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T19:50:34.8542470Z src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T19:50:34.8542551Z src/tbe/eeg/eeg_models.cpp 2025-05-07T19:50:34.8542631Z src/tbe/eeg/eeg_utils.cpp 2025-05-07T19:50:34.8542732Z src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T19:50:34.8542826Z src/tbe/eeg/indices_estimator.cpp 2025-05-07T19:50:34.8542917Z src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T19:50:34.8543019Z src/tbe/eeg/indices_generator.cpp 2025-05-07T19:50:34.8543229Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T19:50:34.8543360Z src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T19:50:34.8543552Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T19:50:34.8543776Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T19:50:34.8543867Z src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T19:50:34.8543956Z src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T19:50:34.8544057Z src/metric_ops/metric_ops_host.cpp 2025-05-07T19:50:34.8544159Z src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T19:50:34.8544334Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T19:50:34.8544428Z src/topology_utils.cpp 2025-05-07T19:50:34.8544530Z src/histogram_binning_calibration_ops.cu 2025-05-07T19:50:34.8544620Z src/input_combine_ops/input_combine.cu 2025-05-07T19:50:34.8544812Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu 2025-05-07T19:50:34.8544904Z src/memory_utils/memory_utils.cu 2025-05-07T19:50:34.8544992Z src/memory_utils/memory_utils_ops.cu 2025-05-07T19:50:34.8545165Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu 2025-05-07T19:50:34.8545340Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu 2025-05-07T19:50:34.8545454Z src/jagged_tensor_ops/dense_to_jagged_forward.cu 2025-05-07T19:50:34.8545572Z src/jagged_tensor_ops/jagged_dense_bmm_forward.cu 2025-05-07T19:50:34.8545815Z src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu 2025-05-07T19:50:34.8545977Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu 2025-05-07T19:50:34.8546141Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu 2025-05-07T19:50:34.8546273Z src/jagged_tensor_ops/jagged_index_add_2d_forward.cu 2025-05-07T19:50:34.8546411Z src/jagged_tensor_ops/jagged_index_select_2d_forward.cu 2025-05-07T19:50:34.8546532Z src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu 2025-05-07T19:50:34.8546645Z src/jagged_tensor_ops/jagged_softmax_backward.cu 2025-05-07T19:50:34.8546765Z src/jagged_tensor_ops/jagged_softmax_forward.cu 2025-05-07T19:50:34.8546868Z src/jagged_tensor_ops/jagged_tensor_ops.cu 2025-05-07T19:50:34.8547013Z src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu 2025-05-07T19:50:34.8547150Z src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu 2025-05-07T19:50:34.8547268Z src/jagged_tensor_ops/jagged_unique_indices.cu 2025-05-07T19:50:34.8547454Z src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu 2025-05-07T19:50:34.8547570Z src/layout_transform_ops/layout_transform_ops.cu 2025-05-07T19:50:34.8547667Z src/metric_ops/metric_ops.cu 2025-05-07T19:50:34.8547863Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu 2025-05-07T19:50:34.8548083Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu 2025-05-07T19:50:34.8548257Z src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu 2025-05-07T19:50:34.8548350Z src/quantize_ops/quantize_bfloat16.cu 2025-05-07T19:50:34.8548448Z src/quantize_ops/quantize_fp8_rowwise.cu 2025-05-07T19:50:34.8548560Z src/quantize_ops/quantize_fused_8bit_rowwise.cu 2025-05-07T19:50:34.8548688Z src/quantize_ops/quantize_fused_nbit_rowwise.cu 2025-05-07T19:50:34.8548773Z src/quantize_ops/quantize_hfp8.cu 2025-05-07T19:50:34.8548858Z src/quantize_ops/quantize_msfp.cu 2025-05-07T19:50:34.8548976Z src/quantize_ops/quantize_padded_fp8_rowwise.cu 2025-05-07T19:50:34.8549066Z src/quantize_ops/quantize_mx.cu 2025-05-07T19:50:34.8549173Z src/sparse_ops/sparse_async_batched_cumsum.cu 2025-05-07T19:50:34.8549293Z src/sparse_ops/sparse_block_bucketize_features.cu 2025-05-07T19:50:34.8549403Z src/sparse_ops/sparse_bucketize_features.cu 2025-05-07T19:50:34.8549520Z src/sparse_ops/sparse_batched_unary_embeddings.cu 2025-05-07T19:50:34.8549649Z src/sparse_ops/sparse_compute_frequency_sequence.cu 2025-05-07T19:50:34.8549780Z src/sparse_ops/sparse_expand_into_jagged_permute.cu 2025-05-07T19:50:34.8549871Z src/sparse_ops/sparse_group_index.cu 2025-05-07T19:50:34.8549964Z src/sparse_ops/sparse_index_add.cu 2025-05-07T19:50:34.8550067Z src/sparse_ops/sparse_index_select.cu 2025-05-07T19:50:34.8550159Z src/sparse_ops/sparse_invert_permute.cu 2025-05-07T19:50:34.8550272Z src/sparse_ops/sparse_pack_segments_backward.cu 2025-05-07T19:50:34.8550382Z src/sparse_ops/sparse_pack_segments_forward.cu 2025-05-07T19:50:34.8550485Z src/sparse_ops/sparse_permute_1d.cu 2025-05-07T19:50:34.8550579Z src/sparse_ops/sparse_permute_2d.cu 2025-05-07T19:50:34.8550667Z src/sparse_ops/sparse_permute102.cu 2025-05-07T19:50:34.8550784Z src/sparse_ops/sparse_permute_embeddings.cu 2025-05-07T19:50:34.8550871Z src/sparse_ops/sparse_range.cu 2025-05-07T19:50:34.8550971Z src/sparse_ops/sparse_reorder_batched_ad.cu 2025-05-07T19:50:34.8551072Z src/sparse_ops/sparse_segment_sum_csr.cu 2025-05-07T19:50:34.8551165Z src/sparse_ops/sparse_zipf.cu 2025-05-07T19:50:34.8551229Z 2025-05-07T19:50:34.8551304Z HIPified Source Files: 2025-05-07T19:50:34.8551308Z 2025-05-07T19:50:34.8551385Z 2025-05-07T19:50:34.8551468Z Library Dependencies: 2025-05-07T19:50:34.8551530Z torch 2025-05-07T19:50:34.8551607Z torch_library 2025-05-07T19:50:34.8551899Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:34.8552122Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:34.8552424Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:34.8552745Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:34.8552987Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:34.8553051Z fbgemm 2025-05-07T19:50:34.8553152Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:34.8553242Z fbgemm_gpu_embedding_inplace_ops 2025-05-07T19:50:34.8553321Z fbgemm_gpu_tbe_index_select 2025-05-07T19:50:34.8553395Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:34.8553484Z fbgemm_gpu_tbe_optimizers 2025-05-07T19:50:34.8553555Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:34.8553747Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:34.8553816Z 2025-05-07T19:50:34.8553887Z Output Library: 2025-05-07T19:50:34.8553956Z fbgemm_gpu_py 2025-05-07T19:50:34.8554098Z 2025-05-07T19:50:34.8554188Z Destination Directory: 2025-05-07T19:50:34.8554258Z fbgemm_gpu 2025-05-07T19:50:34.8554575Z ================================================================================ 2025-05-07T19:50:34.8554581Z 2025-05-07T19:50:34.8554678Z -- Configuring done (9.1s) 2025-05-07T19:50:34.9805168Z -- Generating done (0.2s) 2025-05-07T19:50:34.9827244Z -- Build files have been written to: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build 2025-05-07T19:50:34.9994715Z Change Dir: '/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build' 2025-05-07T19:50:34.9994758Z 2025-05-07T19:50:34.9995661Z Run Build Command(s): /github/home/miniconda/envs/build_binary/bin/ninja -v -j 48 install 2025-05-07T19:50:35.1248691Z [1/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp 2025-05-07T19:50:35.1260225Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.1455015Z [2/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp 2025-05-07T19:50:35.1466918Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.1500920Z [3/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp 2025-05-07T19:50:35.1515351Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.1582799Z [4/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp 2025-05-07T19:50:35.1594573Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.1706880Z [5/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp 2025-05-07T19:50:35.1718426Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.1729573Z [6/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp 2025-05-07T19:50:35.1741284Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.1774127Z [7/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp 2025-05-07T19:50:35.1786415Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.1798088Z [8/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp 2025-05-07T19:50:35.1810144Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.1821887Z [9/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp 2025-05-07T19:50:35.1834293Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.1956097Z [10/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp 2025-05-07T19:50:35.1967438Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.2110224Z [11/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp 2025-05-07T19:50:35.2121782Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.2266699Z [12/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp 2025-05-07T19:50:35.2273149Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.2412948Z [13/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp 2025-05-07T19:50:35.2424724Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.2482970Z [14/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp 2025-05-07T19:50:35.2494767Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.2505690Z [15/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp 2025-05-07T19:50:35.2517072Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.2546468Z [16/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp 2025-05-07T19:50:35.2557689Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.2626674Z [17/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp 2025-05-07T19:50:35.2638462Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.2687318Z [18/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp 2025-05-07T19:50:35.2699126Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.2883366Z [19/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp 2025-05-07T19:50:35.2895013Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.3083047Z [20/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp 2025-05-07T19:50:35.3094587Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.3478696Z [21/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp 2025-05-07T19:50:35.3490383Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.3501882Z [22/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp 2025-05-07T19:50:35.3513588Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.3526269Z [23/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp 2025-05-07T19:50:35.3538643Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.3564175Z [24/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp 2025-05-07T19:50:35.3575904Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.3587628Z [25/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp 2025-05-07T19:50:35.3598956Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.3763099Z [26/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp 2025-05-07T19:50:35.3775139Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.3910724Z [27/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp 2025-05-07T19:50:35.3922446Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.3954126Z [28/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp 2025-05-07T19:50:35.3965883Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.3980575Z [29/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp 2025-05-07T19:50:35.3992146Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.4090210Z [30/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp 2025-05-07T19:50:35.4101428Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.4136560Z [31/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp 2025-05-07T19:50:35.4148794Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.4160691Z [32/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp 2025-05-07T19:50:35.4173092Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.4261684Z [33/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp 2025-05-07T19:50:35.4273643Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.4355114Z [34/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp 2025-05-07T19:50:35.4367533Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.4497627Z [35/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp 2025-05-07T19:50:35.4509758Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.4615800Z [36/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp 2025-05-07T19:50:35.4627816Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.4807197Z [37/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp 2025-05-07T19:50:35.4819926Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.4956617Z [38/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp 2025-05-07T19:50:35.4969366Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.5140164Z [39/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp 2025-05-07T19:50:35.5152657Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.5271054Z [40/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp 2025-05-07T19:50:35.5283406Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.5362801Z [41/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp 2025-05-07T19:50:35.5375174Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.5515595Z [42/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp 2025-05-07T19:50:35.5527626Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.5539030Z [43/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp 2025-05-07T19:50:35.5551251Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.5865151Z [44/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp 2025-05-07T19:50:35.5877688Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.6451363Z [45/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp 2025-05-07T19:50:35.6457717Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.6876601Z [46/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp 2025-05-07T19:50:35.6883073Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.7458943Z [47/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp 2025-05-07T19:50:35.7465254Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.7650221Z [48/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp 2025-05-07T19:50:35.7656570Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.7766549Z [49/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp 2025-05-07T19:50:35.7773032Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.7975224Z [50/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp 2025-05-07T19:50:35.7981629Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.8178793Z [51/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp 2025-05-07T19:50:35.8185269Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.8709019Z [52/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp 2025-05-07T19:50:35.8715651Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:35.9183626Z [53/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp 2025-05-07T19:50:35.9194268Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:36.0569262Z [54/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp 2025-05-07T19:50:36.0582024Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:36.1462756Z [55/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp 2025-05-07T19:50:36.1474812Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:36.2329232Z [56/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -mavx512f -mavx512bw -mavx512dq -mavx512vl -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc 2025-05-07T19:50:36.2347387Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:36.2464124Z [57/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:50:36.2473654Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:36.3245379Z [58/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp 2025-05-07T19:50:36.3257748Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:36.4116663Z [59/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp 2025-05-07T19:50:36.4129204Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:36.4696397Z [60/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp 2025-05-07T19:50:36.4707367Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:36.5748303Z [61/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o -c /__w/FBGEMM/FBGEMM/src/QuantUtils.cc 2025-05-07T19:50:36.5764640Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:36.8001926Z [62/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp 2025-05-07T19:50:36.8012126Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:37.4054311Z [63/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,asmjit.so -o asmjit.so CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed && : 2025-05-07T19:50:37.6297576Z [64/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o -c /__w/FBGEMM/FBGEMM/src/Utils.cc 2025-05-07T19:50:37.6315895Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:38.1053446Z [65/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o -c /__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc 2025-05-07T19:50:38.1072000Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:41.4578176Z [66/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o -c /__w/FBGEMM/FBGEMM/src/RefImplementations.cc 2025-05-07T19:50:41.4596341Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:41.8853029Z [67/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o -c /__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc 2025-05-07T19:50:41.8870447Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:43.4891301Z [68/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc 2025-05-07T19:50:43.4908507Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:43.6262500Z [69/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:50:43.6281073Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:43.7751466Z [70/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:50:43.7769185Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:43.7905697Z [71/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:50:43.7923780Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:43.8057256Z [72/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:50:43.8075151Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.6092631Z [73/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:50:44.6111030Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:45.0904503Z [74/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:50:45.0922428Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:46.0052604Z [75/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:50:46.0072889Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:47.0453089Z [76/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_config_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -MF CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o.d -o CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/config/feature_gates.cpp 2025-05-07T19:50:47.0473562Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:47.4193640Z [77/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o -c /__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc 2025-05-07T19:50:47.4211396Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:47.6192961Z [78/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_config.so -o fbgemm_gpu_config.so CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed && : 2025-05-07T19:50:48.9704570Z [79/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:50:50.6865838Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:50.6890748Z [80/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:50:50.6916123Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:50.9122581Z [81/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:50:50.9141580Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:53.2913783Z [82/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:50:53.2933237Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:54.7116741Z [83/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:50:54.7127368Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:55.1629023Z [84/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_sparse_async_cumsum_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o -MF CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o.d -o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:50:55.1647414Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:55.5682407Z [85/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:50:55.5700696Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:58.5783821Z [86/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:50:58.5801961Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:59.2318232Z [87/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:50:59.2336828Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:00.8214607Z [88/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc 2025-05-07T19:51:00.8223977Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:05.2701802Z [89/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:51:05.2721874Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:05.9547476Z [90/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:51:05.9568284Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:07.7480289Z [91/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:51:07.7499151Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:08.4406022Z [92/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:51:08.4426679Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:09.7259643Z [93/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:51:09.7281035Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:10.1427628Z [94/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:51:10.1447137Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:14.2147646Z [95/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:51:14.2163455Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:14.9940830Z [96/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:51:14.9961289Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:16.9676467Z [97/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:51:16.9696190Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:17.5745799Z [98/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:51:17.5762348Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:18.7939366Z [99/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:51:18.7958766Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:19.1811427Z [100/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:51:19.1831397Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:23.2796255Z [101/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:51:23.2813424Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:32.2896110Z [102/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:51:32.2916276Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:32.4467654Z [103/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:51:32.4485514Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:37.3882102Z [104/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o 2025-05-07T19:51:37.3904068Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:37.3906701Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:37.3908022Z ^ 2025-05-07T19:51:37.3908266Z 2025-05-07T19:51:37.3908730Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:37.3909398Z 2025-05-07T19:51:37.3911146Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:37.3913846Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:37.3915151Z ^ 2025-05-07T19:51:37.3915499Z 2025-05-07T19:51:37.3917076Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:37.3919668Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:37.3920838Z ^ 2025-05-07T19:51:37.3921082Z 2025-05-07T19:51:37.3921537Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:37.3922180Z 2025-05-07T19:51:37.3923705Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:37.3926245Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:37.3927362Z ^ 2025-05-07T19:51:37.3927741Z 2025-05-07T19:51:37.3929542Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:37.3931848Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:37.3932866Z ^ 2025-05-07T19:51:37.3933244Z 2025-05-07T19:51:37.3933659Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:37.3934292Z 2025-05-07T19:51:37.3935786Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:37.3938564Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:37.3939661Z ^ 2025-05-07T19:51:37.3940003Z 2025-05-07T19:51:37.3942120Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:37.3944914Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:37.3946261Z ^ 2025-05-07T19:51:37.3946505Z 2025-05-07T19:51:37.3946943Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:37.3947592Z 2025-05-07T19:51:37.3949215Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:37.3951494Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:37.3952555Z ^ 2025-05-07T19:51:37.3952947Z 2025-05-07T19:51:37.3954559Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:37.3957049Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:37.3958201Z ^ 2025-05-07T19:51:37.3958483Z 2025-05-07T19:51:37.3958883Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:37.3959493Z 2025-05-07T19:51:37.3961030Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:37.3963884Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:37.3965029Z ^ 2025-05-07T19:51:37.3965369Z 2025-05-07T19:51:38.2763641Z [105/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_split_lookup.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o 2025-05-07T19:51:38.2780112Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.2782043Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.2782869Z ^ 2025-05-07T19:51:38.2783049Z 2025-05-07T19:51:38.2783392Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:38.2783857Z 2025-05-07T19:51:38.2784949Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.2787025Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.2787915Z ^ 2025-05-07T19:51:38.2788213Z 2025-05-07T19:51:38.2789552Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.2791625Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.2792531Z ^ 2025-05-07T19:51:38.2792773Z 2025-05-07T19:51:38.2793107Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:38.2793584Z 2025-05-07T19:51:38.2794903Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.2796722Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.2797586Z ^ 2025-05-07T19:51:38.2797847Z 2025-05-07T19:51:38.2799097Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.2801241Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.2802262Z ^ 2025-05-07T19:51:38.2802450Z 2025-05-07T19:51:38.2802757Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:38.2803276Z 2025-05-07T19:51:38.2804443Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.2806784Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.2807718Z ^ 2025-05-07T19:51:38.2807998Z 2025-05-07T19:51:38.2809197Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.2811415Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.2812419Z ^ 2025-05-07T19:51:38.2812644Z 2025-05-07T19:51:38.2813014Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:38.2813616Z 2025-05-07T19:51:38.2814918Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.2816825Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.2817699Z ^ 2025-05-07T19:51:38.2818025Z 2025-05-07T19:51:38.2819318Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.2821455Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.2822425Z ^ 2025-05-07T19:51:38.2822643Z 2025-05-07T19:51:38.2823038Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:38.2823497Z 2025-05-07T19:51:38.2824626Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.2826573Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.2827463Z ^ 2025-05-07T19:51:38.2827724Z 2025-05-07T19:51:38.2946559Z [106/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o 2025-05-07T19:51:38.2965287Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.2967516Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.2968519Z ^ 2025-05-07T19:51:38.2968754Z 2025-05-07T19:51:38.2969120Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:38.2969692Z 2025-05-07T19:51:38.2971069Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.2973288Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.2974258Z ^ 2025-05-07T19:51:38.2974576Z 2025-05-07T19:51:38.2976023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.2978220Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.2979189Z ^ 2025-05-07T19:51:38.2979418Z 2025-05-07T19:51:38.2979781Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:38.2980334Z 2025-05-07T19:51:38.2981728Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.2983927Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.2984926Z ^ 2025-05-07T19:51:38.2985223Z 2025-05-07T19:51:38.2986572Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.2988920Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.2990059Z ^ 2025-05-07T19:51:38.2990295Z 2025-05-07T19:51:38.2990707Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:38.2991359Z 2025-05-07T19:51:38.2992919Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.2995789Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.2996910Z ^ 2025-05-07T19:51:38.2997270Z 2025-05-07T19:51:38.2998784Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.3001509Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.3002602Z ^ 2025-05-07T19:51:38.3002835Z 2025-05-07T19:51:38.3003273Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:38.3003889Z 2025-05-07T19:51:38.3005669Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.3008264Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.3009439Z ^ 2025-05-07T19:51:38.3009785Z 2025-05-07T19:51:38.3011365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.3013921Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.3015066Z ^ 2025-05-07T19:51:38.3015307Z 2025-05-07T19:51:38.3015752Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:38.3016424Z 2025-05-07T19:51:38.3018011Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.3020698Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.3021797Z ^ 2025-05-07T19:51:38.3022134Z 2025-05-07T19:51:38.3548882Z [107/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o 2025-05-07T19:51:38.3571552Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.3574453Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.3575633Z ^ 2025-05-07T19:51:38.3575907Z 2025-05-07T19:51:38.3576432Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:38.3577090Z 2025-05-07T19:51:38.3578747Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.3581461Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.3582684Z ^ 2025-05-07T19:51:38.3583052Z 2025-05-07T19:51:38.3584624Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.3587299Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.3588471Z ^ 2025-05-07T19:51:38.3588765Z 2025-05-07T19:51:38.3589222Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:38.3589901Z 2025-05-07T19:51:38.3591559Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.3594365Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.3595579Z ^ 2025-05-07T19:51:38.3595957Z 2025-05-07T19:51:38.3597618Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.3600196Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.3601390Z ^ 2025-05-07T19:51:38.3601649Z 2025-05-07T19:51:38.3602097Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:38.3602792Z 2025-05-07T19:51:38.3604758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.3607427Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.3608796Z ^ 2025-05-07T19:51:38.3609182Z 2025-05-07T19:51:38.3610798Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.3613603Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.3614764Z ^ 2025-05-07T19:51:38.3615062Z 2025-05-07T19:51:38.3615527Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:38.3616191Z 2025-05-07T19:51:38.3617888Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.3620526Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.3621733Z ^ 2025-05-07T19:51:38.3622097Z 2025-05-07T19:51:38.3623731Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.3626392Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.3627581Z ^ 2025-05-07T19:51:38.3627808Z 2025-05-07T19:51:38.3628265Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:38.3629228Z 2025-05-07T19:51:38.3630898Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.3633591Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.3634863Z ^ 2025-05-07T19:51:38.3635257Z 2025-05-07T19:51:38.8166204Z [108/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o 2025-05-07T19:51:38.8187290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.8189916Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.8190986Z ^ 2025-05-07T19:51:38.8191237Z 2025-05-07T19:51:38.8191679Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:38.8192337Z 2025-05-07T19:51:38.8193938Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.8212626Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.8213947Z ^ 2025-05-07T19:51:38.8214343Z 2025-05-07T19:51:38.8216017Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.8218426Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.8219514Z ^ 2025-05-07T19:51:38.8219767Z 2025-05-07T19:51:38.8220234Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:38.8220868Z 2025-05-07T19:51:38.8222454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.8224963Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.8226093Z ^ 2025-05-07T19:51:38.8226445Z 2025-05-07T19:51:38.8227985Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.8230826Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.8231930Z ^ 2025-05-07T19:51:38.8232212Z 2025-05-07T19:51:38.8232957Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:38.8233562Z 2025-05-07T19:51:38.8235249Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.8238087Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.8239235Z ^ 2025-05-07T19:51:38.8239594Z 2025-05-07T19:51:38.8241201Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.8243562Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.8244660Z ^ 2025-05-07T19:51:38.8244848Z 2025-05-07T19:51:38.8245232Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:38.8245863Z 2025-05-07T19:51:38.8247282Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.8249798Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.8250870Z ^ 2025-05-07T19:51:38.8251249Z 2025-05-07T19:51:38.8252731Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.8255188Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.8256283Z ^ 2025-05-07T19:51:38.8256556Z 2025-05-07T19:51:38.8256985Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:38.8257620Z 2025-05-07T19:51:38.8259140Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:38.8261593Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:38.8262755Z ^ 2025-05-07T19:51:38.8263083Z 2025-05-07T19:51:39.1500356Z [109/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o 2025-05-07T19:51:39.1523566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.1526278Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.1527464Z ^ 2025-05-07T19:51:39.1527749Z 2025-05-07T19:51:39.1528202Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.1529101Z 2025-05-07T19:51:39.1530808Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.1533498Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.1534729Z ^ 2025-05-07T19:51:39.1535094Z 2025-05-07T19:51:39.1536745Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.1539413Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.1540590Z ^ 2025-05-07T19:51:39.1540855Z 2025-05-07T19:51:39.1541312Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.1542020Z 2025-05-07T19:51:39.1543713Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.1546659Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.1547854Z ^ 2025-05-07T19:51:39.1548240Z 2025-05-07T19:51:39.1549866Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.1552307Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.1553301Z ^ 2025-05-07T19:51:39.1553555Z 2025-05-07T19:51:39.1553937Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.1554902Z 2025-05-07T19:51:39.1556336Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.1558644Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.1559762Z ^ 2025-05-07T19:51:39.1560085Z 2025-05-07T19:51:39.1561587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.1564021Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.1565134Z ^ 2025-05-07T19:51:39.1565382Z 2025-05-07T19:51:39.1565801Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.1566442Z 2025-05-07T19:51:39.1567938Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.1570427Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.1571527Z ^ 2025-05-07T19:51:39.1571888Z 2025-05-07T19:51:39.1573391Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.1575856Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.1576926Z ^ 2025-05-07T19:51:39.1577153Z 2025-05-07T19:51:39.1577584Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.1578201Z 2025-05-07T19:51:39.1579765Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.1582255Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.1583407Z ^ 2025-05-07T19:51:39.1583751Z 2025-05-07T19:51:39.5716287Z [110/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o 2025-05-07T19:51:39.5736252Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.5738751Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.5739889Z ^ 2025-05-07T19:51:39.5740132Z 2025-05-07T19:51:39.5740483Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.5740952Z 2025-05-07T19:51:39.5742311Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.5744501Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.5745608Z ^ 2025-05-07T19:51:39.5745961Z 2025-05-07T19:51:39.5747513Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.5749778Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.5750692Z ^ 2025-05-07T19:51:39.5750921Z 2025-05-07T19:51:39.5751312Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.5751944Z 2025-05-07T19:51:39.5753549Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.5756105Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.5757188Z ^ 2025-05-07T19:51:39.5757542Z 2025-05-07T19:51:39.5759384Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.5761995Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.5763321Z ^ 2025-05-07T19:51:39.5763589Z 2025-05-07T19:51:39.5763989Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.5764564Z 2025-05-07T19:51:39.5766088Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.5768454Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.5769559Z ^ 2025-05-07T19:51:39.5769915Z 2025-05-07T19:51:39.5771353Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.5773699Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.5774834Z ^ 2025-05-07T19:51:39.5775075Z 2025-05-07T19:51:39.5775496Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.5776146Z 2025-05-07T19:51:39.5777591Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.5780078Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.5781076Z ^ 2025-05-07T19:51:39.5781404Z 2025-05-07T19:51:39.5782846Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.5785268Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.5786313Z ^ 2025-05-07T19:51:39.5786545Z 2025-05-07T19:51:39.5786998Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.5787571Z 2025-05-07T19:51:39.5789025Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.5791403Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.5792514Z ^ 2025-05-07T19:51:39.5792848Z 2025-05-07T19:51:39.6992025Z [111/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o 2025-05-07T19:51:39.7012577Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.7015010Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.7016194Z ^ 2025-05-07T19:51:39.7016424Z 2025-05-07T19:51:39.7017000Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.7017579Z 2025-05-07T19:51:39.7019040Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.7021463Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.7022577Z ^ 2025-05-07T19:51:39.7022890Z 2025-05-07T19:51:39.7024225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.7026692Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.7027777Z ^ 2025-05-07T19:51:39.7028015Z 2025-05-07T19:51:39.7028738Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.7029391Z 2025-05-07T19:51:39.7030865Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.7033350Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.7034896Z ^ 2025-05-07T19:51:39.7035280Z 2025-05-07T19:51:39.7036714Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.7039129Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.7039943Z ^ 2025-05-07T19:51:39.7040164Z 2025-05-07T19:51:39.7040540Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.7041100Z 2025-05-07T19:51:39.7042556Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.7044976Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.7046061Z ^ 2025-05-07T19:51:39.7046418Z 2025-05-07T19:51:39.7047947Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.7050339Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.7051471Z ^ 2025-05-07T19:51:39.7051719Z 2025-05-07T19:51:39.7052140Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.7052806Z 2025-05-07T19:51:39.7054372Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.7056886Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.7058016Z ^ 2025-05-07T19:51:39.7058423Z 2025-05-07T19:51:39.7059933Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.7062387Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.7063418Z ^ 2025-05-07T19:51:39.7063671Z 2025-05-07T19:51:39.7064201Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.7064832Z 2025-05-07T19:51:39.7066155Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.7068832Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.7069947Z ^ 2025-05-07T19:51:39.7070228Z 2025-05-07T19:51:40.3954593Z [112/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/generate_vbe_metadata.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o 2025-05-07T19:51:45.5504309Z [113/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/get_infos_metadata.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o 2025-05-07T19:51:45.5524248Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.5527101Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.5528074Z ^ 2025-05-07T19:51:45.5528701Z 2025-05-07T19:51:45.5529102Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:45.5529691Z 2025-05-07T19:51:45.5531190Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.5533603Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.5534658Z ^ 2025-05-07T19:51:45.5535011Z 2025-05-07T19:51:45.5536454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.5538836Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.5539928Z ^ 2025-05-07T19:51:45.5540157Z 2025-05-07T19:51:45.5540554Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:45.5541163Z 2025-05-07T19:51:45.5542645Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.5545054Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.5546103Z ^ 2025-05-07T19:51:45.5546429Z 2025-05-07T19:51:45.5547859Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.5550203Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.5551233Z ^ 2025-05-07T19:51:45.5551504Z 2025-05-07T19:51:45.5551910Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:45.5552491Z 2025-05-07T19:51:45.5553926Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.5556655Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.5557906Z ^ 2025-05-07T19:51:45.5558232Z 2025-05-07T19:51:45.5559678Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.5562504Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.5563781Z ^ 2025-05-07T19:51:45.5564028Z 2025-05-07T19:51:45.5564453Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:45.5565038Z 2025-05-07T19:51:45.5566349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.5568988Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.5570030Z ^ 2025-05-07T19:51:45.5570360Z 2025-05-07T19:51:45.5571790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.5574235Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.5575312Z ^ 2025-05-07T19:51:45.5575579Z 2025-05-07T19:51:45.5575965Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:45.5576560Z 2025-05-07T19:51:45.5578077Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.5580538Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.5581637Z ^ 2025-05-07T19:51:45.5581963Z 2025-05-07T19:51:51.7730834Z [114/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_v1.cu -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o 2025-05-07T19:51:51.7751249Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:51.7753711Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:51.7755094Z ^ 2025-05-07T19:51:51.7755352Z 2025-05-07T19:51:51.7755798Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:51.7756418Z 2025-05-07T19:51:51.7757997Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:51.7760536Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:51.7761756Z ^ 2025-05-07T19:51:51.7762144Z 2025-05-07T19:51:51.7763766Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:51.7766054Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:51.7767122Z ^ 2025-05-07T19:51:51.7767372Z 2025-05-07T19:51:51.7767771Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:51.7768358Z 2025-05-07T19:51:51.7769835Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:51.7772382Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:51.7773489Z ^ 2025-05-07T19:51:51.7773850Z 2025-05-07T19:51:51.7775362Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:51.7778088Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:51.7779179Z ^ 2025-05-07T19:51:51.7779396Z 2025-05-07T19:51:51.7779789Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:51.7780401Z 2025-05-07T19:51:51.7781858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:51.7784271Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:51.7785355Z ^ 2025-05-07T19:51:51.7785699Z 2025-05-07T19:51:51.7787590Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:51.7790055Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:51.7791181Z ^ 2025-05-07T19:51:51.7791448Z 2025-05-07T19:51:51.7791929Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:51.7792928Z 2025-05-07T19:51:51.7794696Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:51.7797096Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:51.7798177Z ^ 2025-05-07T19:51:51.7798519Z 2025-05-07T19:51:51.7800012Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:51.7802554Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:51.7803651Z ^ 2025-05-07T19:51:51.7803904Z 2025-05-07T19:51:51.7804351Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:51.7805134Z 2025-05-07T19:51:51.7806758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:51.7809454Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:51.7810654Z ^ 2025-05-07T19:51:51.7811001Z 2025-05-07T19:51:52.2566485Z [115/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_v2.cu -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o 2025-05-07T19:51:52.2586480Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:52.2588938Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:52.2590003Z ^ 2025-05-07T19:51:52.2590229Z 2025-05-07T19:51:52.2590762Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:52.2591381Z 2025-05-07T19:51:52.2592857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:52.2595472Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:52.2596577Z ^ 2025-05-07T19:51:52.2596907Z 2025-05-07T19:51:52.2598409Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:52.2600840Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:52.2601910Z ^ 2025-05-07T19:51:52.2602152Z 2025-05-07T19:51:52.2602567Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:52.2603166Z 2025-05-07T19:51:52.2604618Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:52.2606983Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:52.2608052Z ^ 2025-05-07T19:51:52.2608393Z 2025-05-07T19:51:52.2609903Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:52.2612310Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:52.2613399Z ^ 2025-05-07T19:51:52.2613609Z 2025-05-07T19:51:52.2613996Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:52.2614609Z 2025-05-07T19:51:52.2616048Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:52.2618489Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:52.2619551Z ^ 2025-05-07T19:51:52.2619892Z 2025-05-07T19:51:52.2621629Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:52.2623975Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:52.2626224Z ^ 2025-05-07T19:51:52.2626476Z 2025-05-07T19:51:52.2626868Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:52.2627479Z 2025-05-07T19:51:52.2629235Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:52.2631678Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:52.2632809Z ^ 2025-05-07T19:51:52.2633155Z 2025-05-07T19:51:52.2634805Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:52.2637304Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:52.2638362Z ^ 2025-05-07T19:51:52.2638610Z 2025-05-07T19:51:52.2638961Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:52.2639500Z 2025-05-07T19:51:52.2640912Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:52.2643244Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:52.2644233Z ^ 2025-05-07T19:51:52.2644570Z 2025-05-07T19:51:55.6828281Z [116/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o 2025-05-07T19:51:55.6851873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:55.6854296Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:55.6855368Z ^ 2025-05-07T19:51:55.6855589Z 2025-05-07T19:51:55.6855991Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:55.6856594Z 2025-05-07T19:51:55.6858134Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:55.6860586Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:55.6861673Z ^ 2025-05-07T19:51:55.6862021Z 2025-05-07T19:51:55.6863332Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:55.6865608Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:55.6866720Z ^ 2025-05-07T19:51:55.6866963Z 2025-05-07T19:51:55.6867386Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:55.6868016Z 2025-05-07T19:51:55.6869643Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:55.6872200Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:55.6873426Z ^ 2025-05-07T19:51:55.6873796Z 2025-05-07T19:51:55.6875619Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:55.6878350Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:55.6879530Z ^ 2025-05-07T19:51:55.6879777Z 2025-05-07T19:51:55.6880221Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:55.6880911Z 2025-05-07T19:51:55.6882628Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:55.6885708Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:55.6886903Z ^ 2025-05-07T19:51:55.6887280Z 2025-05-07T19:51:55.6888967Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:55.6891802Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:55.6892971Z ^ 2025-05-07T19:51:55.6893230Z 2025-05-07T19:51:55.6893676Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:55.6894342Z 2025-05-07T19:51:55.6896068Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:55.6898780Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:55.6899957Z ^ 2025-05-07T19:51:55.6900321Z 2025-05-07T19:51:55.6901955Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:55.6904590Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:55.6905771Z ^ 2025-05-07T19:51:55.6906018Z 2025-05-07T19:51:55.6906458Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:55.6907143Z 2025-05-07T19:51:55.6908856Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:55.6911597Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:55.6912798Z ^ 2025-05-07T19:51:55.6913176Z 2025-05-07T19:51:56.7033992Z [117/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc 2025-05-07T19:51:56.7051705Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:57.3719725Z [118/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm.so -o fbgemm.so CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so asmjit.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so && : 2025-05-07T19:51:58.0359953Z [119/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_common.so -o fbgemm_gpu_tbe_common.so CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_config.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && : 2025-05-07T19:51:59.4671277Z [120/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o 2025-05-07T19:51:59.4692847Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:59.4695958Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:59.4697091Z ^ 2025-05-07T19:51:59.4697349Z 2025-05-07T19:51:59.4697782Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:59.4698432Z 2025-05-07T19:51:59.4700040Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:59.4702689Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:59.4703857Z ^ 2025-05-07T19:51:59.4704214Z 2025-05-07T19:51:59.4705857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:59.4708379Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:59.4709477Z ^ 2025-05-07T19:51:59.4709725Z 2025-05-07T19:51:59.4710154Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:59.4710827Z 2025-05-07T19:51:59.4712465Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:59.4715231Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:59.4716380Z ^ 2025-05-07T19:51:59.4716750Z 2025-05-07T19:51:59.4718398Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:59.4721056Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:59.4722208Z ^ 2025-05-07T19:51:59.4722455Z 2025-05-07T19:51:59.4722908Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:59.4723569Z 2025-05-07T19:51:59.4725187Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:59.4727659Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:59.4729110Z ^ 2025-05-07T19:51:59.4729451Z 2025-05-07T19:51:59.4731073Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:59.4733500Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:59.4734595Z ^ 2025-05-07T19:51:59.4734833Z 2025-05-07T19:51:59.4735608Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:59.4736278Z 2025-05-07T19:51:59.4737936Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:59.4740643Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:59.4741834Z ^ 2025-05-07T19:51:59.4742202Z 2025-05-07T19:51:59.4743884Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:59.4746600Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:59.4747791Z ^ 2025-05-07T19:51:59.4748032Z 2025-05-07T19:51:59.4748469Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:59.4749264Z 2025-05-07T19:51:59.4750790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:59.4753351Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:59.4754648Z ^ 2025-05-07T19:51:59.4755022Z 2025-05-07T19:52:04.3371196Z [121/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o 2025-05-07T19:52:04.3394914Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.3397597Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.3398769Z ^ 2025-05-07T19:52:04.3399023Z 2025-05-07T19:52:04.3399485Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:04.3400181Z 2025-05-07T19:52:04.3401872Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.3404583Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.3405752Z ^ 2025-05-07T19:52:04.3406108Z 2025-05-07T19:52:04.3407763Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.3410409Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.3411576Z ^ 2025-05-07T19:52:04.3411826Z 2025-05-07T19:52:04.3412265Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:04.3412929Z 2025-05-07T19:52:04.3414575Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.3417195Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.3418388Z ^ 2025-05-07T19:52:04.3418746Z 2025-05-07T19:52:04.3420392Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.3422999Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.3424126Z ^ 2025-05-07T19:52:04.3424332Z 2025-05-07T19:52:04.3424711Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:04.3425400Z 2025-05-07T19:52:04.3426937Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.3429822Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.3430954Z ^ 2025-05-07T19:52:04.3431328Z 2025-05-07T19:52:04.3433410Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.3436184Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.3437304Z ^ 2025-05-07T19:52:04.3437797Z 2025-05-07T19:52:04.3438237Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:04.3438900Z 2025-05-07T19:52:04.3440576Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.3443263Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.3444453Z ^ 2025-05-07T19:52:04.3444811Z 2025-05-07T19:52:04.3446479Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.3449131Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.3450314Z ^ 2025-05-07T19:52:04.3450557Z 2025-05-07T19:52:04.3450983Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:04.3451659Z 2025-05-07T19:52:04.3453330Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.3455987Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.3457103Z ^ 2025-05-07T19:52:04.3457470Z 2025-05-07T19:52:05.1152199Z [122/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_sparse_async_cumsum_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o -MF CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_cumsum.cu -o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o 2025-05-07T19:52:05.1170604Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.1172898Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.1173845Z ^ 2025-05-07T19:52:05.1174073Z 2025-05-07T19:52:05.1174473Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:05.1175026Z 2025-05-07T19:52:05.1176430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.1178672Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.1179650Z ^ 2025-05-07T19:52:05.1179946Z 2025-05-07T19:52:05.1181310Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.1183528Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.1184494Z ^ 2025-05-07T19:52:05.1184700Z 2025-05-07T19:52:05.1185071Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:05.1185637Z 2025-05-07T19:52:05.1187045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.1189280Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.1190340Z ^ 2025-05-07T19:52:05.1190669Z 2025-05-07T19:52:05.1192085Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.1194448Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.1195407Z ^ 2025-05-07T19:52:05.1195634Z 2025-05-07T19:52:05.1196001Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:05.1196573Z 2025-05-07T19:52:05.1197973Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.1200176Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.1201153Z ^ 2025-05-07T19:52:05.1201445Z 2025-05-07T19:52:05.1205972Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.1208282Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.1209427Z ^ 2025-05-07T19:52:05.1209655Z 2025-05-07T19:52:05.1210016Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:05.1210588Z 2025-05-07T19:52:05.1211983Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.1214191Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.1215189Z ^ 2025-05-07T19:52:05.1215502Z 2025-05-07T19:52:05.1216873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.1219168Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.1220193Z ^ 2025-05-07T19:52:05.1220419Z 2025-05-07T19:52:05.1220772Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:05.1221316Z 2025-05-07T19:52:05.1222705Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.1224894Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.1225893Z ^ 2025-05-07T19:52:05.1226188Z 2025-05-07T19:52:05.1635656Z [123/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o 2025-05-07T19:52:05.1656084Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.1658561Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.1659675Z ^ 2025-05-07T19:52:05.1659918Z 2025-05-07T19:52:05.1660409Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:05.1661009Z 2025-05-07T19:52:05.1662505Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.1665014Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.1666122Z ^ 2025-05-07T19:52:05.1666464Z 2025-05-07T19:52:05.1667947Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.1670360Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.1671330Z ^ 2025-05-07T19:52:05.1671570Z 2025-05-07T19:52:05.1671960Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:05.1672562Z 2025-05-07T19:52:05.1674259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.1676709Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.1677847Z ^ 2025-05-07T19:52:05.1678188Z 2025-05-07T19:52:05.1679738Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.1682122Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.1683184Z ^ 2025-05-07T19:52:05.1683428Z 2025-05-07T19:52:05.1683828Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:05.1684417Z 2025-05-07T19:52:05.1685956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.1688611Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.1689618Z ^ 2025-05-07T19:52:05.1689959Z 2025-05-07T19:52:05.1691473Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.1694014Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.1695053Z ^ 2025-05-07T19:52:05.1695290Z 2025-05-07T19:52:05.1695676Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:05.1696257Z 2025-05-07T19:52:05.1697832Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.1700253Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.1701322Z ^ 2025-05-07T19:52:05.1701646Z 2025-05-07T19:52:05.1703130Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.1705581Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.1706606Z ^ 2025-05-07T19:52:05.1706827Z 2025-05-07T19:52:05.1707227Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:05.1707864Z 2025-05-07T19:52:05.1709399Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.1711793Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.1712699Z ^ 2025-05-07T19:52:05.1713000Z 2025-05-07T19:52:05.9318604Z [124/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o 2025-05-07T19:52:05.9342024Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.9344707Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.9345722Z ^ 2025-05-07T19:52:05.9346014Z 2025-05-07T19:52:05.9346440Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:05.9347097Z 2025-05-07T19:52:05.9348573Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.9351072Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.9352145Z ^ 2025-05-07T19:52:05.9352484Z 2025-05-07T19:52:05.9354142Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.9356630Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.9357719Z ^ 2025-05-07T19:52:05.9357971Z 2025-05-07T19:52:05.9358398Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:05.9359066Z 2025-05-07T19:52:05.9360719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.9363355Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.9364544Z ^ 2025-05-07T19:52:05.9364905Z 2025-05-07T19:52:05.9366548Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.9369053Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.9370184Z ^ 2025-05-07T19:52:05.9370423Z 2025-05-07T19:52:05.9371196Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:05.9371889Z 2025-05-07T19:52:05.9373427Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.9376165Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.9377269Z ^ 2025-05-07T19:52:05.9377633Z 2025-05-07T19:52:05.9379188Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.9381727Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.9382871Z ^ 2025-05-07T19:52:05.9383106Z 2025-05-07T19:52:05.9383558Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:05.9384200Z 2025-05-07T19:52:05.9385833Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.9388410Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.9389558Z ^ 2025-05-07T19:52:05.9389897Z 2025-05-07T19:52:05.9391494Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.9394269Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.9395429Z ^ 2025-05-07T19:52:05.9395669Z 2025-05-07T19:52:05.9396106Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:05.9396775Z 2025-05-07T19:52:05.9398361Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:05.9400958Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:05.9402158Z ^ 2025-05-07T19:52:05.9402516Z 2025-05-07T19:52:06.6763412Z [125/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o 2025-05-07T19:52:06.6785138Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:06.6787924Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:06.6789098Z ^ 2025-05-07T19:52:06.6789361Z 2025-05-07T19:52:06.6789808Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:06.6790481Z 2025-05-07T19:52:06.6792205Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:06.6795033Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:06.6796241Z ^ 2025-05-07T19:52:06.6796605Z 2025-05-07T19:52:06.6798291Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:06.6800980Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:06.6802084Z ^ 2025-05-07T19:52:06.6802332Z 2025-05-07T19:52:06.6802770Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:06.6803442Z 2025-05-07T19:52:06.6805072Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:06.6807765Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:06.6808926Z ^ 2025-05-07T19:52:06.6809303Z 2025-05-07T19:52:06.6810957Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:06.6813535Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:06.6814912Z ^ 2025-05-07T19:52:06.6815182Z 2025-05-07T19:52:06.6815620Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:06.6816272Z 2025-05-07T19:52:06.6817871Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:06.6820456Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:06.6821391Z ^ 2025-05-07T19:52:06.6821703Z 2025-05-07T19:52:06.6823180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:06.6825812Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:06.6826927Z ^ 2025-05-07T19:52:06.6827213Z 2025-05-07T19:52:06.6827647Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:06.6828320Z 2025-05-07T19:52:06.6830196Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:06.6832830Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:06.6834120Z ^ 2025-05-07T19:52:06.6834489Z 2025-05-07T19:52:06.6836164Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:06.6838865Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:06.6840055Z ^ 2025-05-07T19:52:06.6840292Z 2025-05-07T19:52:06.6840743Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:06.6841421Z 2025-05-07T19:52:06.6843099Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:06.6845377Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:06.6846436Z ^ 2025-05-07T19:52:06.6846823Z 2025-05-07T19:52:37.5690269Z [126/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o 2025-05-07T19:52:37.5712687Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:37.5715623Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:37.5716804Z ^ 2025-05-07T19:52:37.5717087Z 2025-05-07T19:52:37.5717538Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:37.5718202Z 2025-05-07T19:52:37.5719855Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:37.5722541Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:37.5723759Z ^ 2025-05-07T19:52:37.5724118Z 2025-05-07T19:52:37.5725731Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:37.5728625Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:37.5729765Z ^ 2025-05-07T19:52:37.5730016Z 2025-05-07T19:52:37.5730466Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:37.5731157Z 2025-05-07T19:52:37.5732796Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:37.5735466Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:37.5736625Z ^ 2025-05-07T19:52:37.5736990Z 2025-05-07T19:52:37.5739028Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:37.5741666Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:37.5742827Z ^ 2025-05-07T19:52:37.5743314Z 2025-05-07T19:52:37.5743784Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:37.5744438Z 2025-05-07T19:52:37.5746244Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:37.5748896Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:37.5750086Z ^ 2025-05-07T19:52:37.5750454Z 2025-05-07T19:52:37.5752098Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:37.5754747Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:37.5755718Z ^ 2025-05-07T19:52:37.5755957Z 2025-05-07T19:52:37.5756344Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:37.5756961Z 2025-05-07T19:52:37.5758567Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:37.5761413Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:37.5762652Z ^ 2025-05-07T19:52:37.5763035Z 2025-05-07T19:52:37.5764486Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:37.5767167Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:37.5768358Z ^ 2025-05-07T19:52:37.5768620Z 2025-05-07T19:52:37.5769114Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:37.5769793Z 2025-05-07T19:52:37.5771497Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:37.5774202Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:37.5775565Z ^ 2025-05-07T19:52:37.5775947Z 2025-05-07T19:52:38.1506770Z [127/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_optimizers.so -o fbgemm_gpu_tbe_optimizers.so CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -ldl && : 2025-05-07T19:52:39.0610422Z [128/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o 2025-05-07T19:52:39.0632336Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:39.0635467Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:39.0636546Z ^ 2025-05-07T19:52:39.0636823Z 2025-05-07T19:52:39.0637290Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:39.0638010Z 2025-05-07T19:52:39.0639700Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:39.0642119Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:39.0643018Z ^ 2025-05-07T19:52:39.0643315Z 2025-05-07T19:52:39.0644584Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:39.0647085Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:39.0648205Z ^ 2025-05-07T19:52:39.0648472Z 2025-05-07T19:52:39.0648925Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:39.0649597Z 2025-05-07T19:52:39.0651303Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:39.0654101Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:39.0655331Z ^ 2025-05-07T19:52:39.0655683Z 2025-05-07T19:52:39.0657341Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:39.0660037Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:39.0661197Z ^ 2025-05-07T19:52:39.0661464Z 2025-05-07T19:52:39.0661904Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:39.0662620Z 2025-05-07T19:52:39.0664310Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:39.0666977Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:39.0668161Z ^ 2025-05-07T19:52:39.0668542Z 2025-05-07T19:52:39.0670177Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:39.0672808Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:39.0674124Z ^ 2025-05-07T19:52:39.0674821Z 2025-05-07T19:52:39.0675268Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:39.0675913Z 2025-05-07T19:52:39.0677567Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:39.0680364Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:39.0681554Z ^ 2025-05-07T19:52:39.0681904Z 2025-05-07T19:52:39.0683493Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:39.0686125Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:39.0687293Z ^ 2025-05-07T19:52:39.0687550Z 2025-05-07T19:52:39.0687988Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:39.0688688Z 2025-05-07T19:52:39.0690359Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:39.0692988Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:39.0693863Z ^ 2025-05-07T19:52:39.0694231Z 2025-05-07T19:52:39.2714850Z [129/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_dense_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o 2025-05-07T19:52:39.2738545Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:39.2741182Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:39.2742380Z ^ 2025-05-07T19:52:39.2742633Z 2025-05-07T19:52:39.2743070Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:39.2743652Z 2025-05-07T19:52:39.2745417Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:39.2748262Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:39.2749483Z ^ 2025-05-07T19:52:39.2749874Z 2025-05-07T19:52:39.2751533Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:39.2754437Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:39.2755632Z ^ 2025-05-07T19:52:39.2755934Z 2025-05-07T19:52:39.2756387Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:39.2756953Z 2025-05-07T19:52:39.2758390Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:39.2760923Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:39.2762053Z ^ 2025-05-07T19:52:39.2762394Z 2025-05-07T19:52:39.2764091Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:39.2766791Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:39.2767953Z ^ 2025-05-07T19:52:39.2768453Z 2025-05-07T19:52:39.2768870Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:39.2769463Z 2025-05-07T19:52:39.2770948Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:39.2773736Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:39.2774969Z ^ 2025-05-07T19:52:39.2775321Z 2025-05-07T19:52:39.2777219Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:39.2779689Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:39.2780728Z ^ 2025-05-07T19:52:39.2783821Z 2025-05-07T19:52:39.2784314Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:39.2784988Z 2025-05-07T19:52:39.2786715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:39.2789399Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:39.2790547Z ^ 2025-05-07T19:52:39.2790826Z 2025-05-07T19:52:39.2792246Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:39.2795091Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:39.2796292Z ^ 2025-05-07T19:52:39.2796585Z 2025-05-07T19:52:39.2797090Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:39.2797678Z 2025-05-07T19:52:39.2799132Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:39.2801438Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:39.2802625Z ^ 2025-05-07T19:52:39.2802987Z 2025-05-07T19:52:39.7500912Z [130/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_cache.so -o fbgemm_gpu_tbe_cache.so CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -ldl && : 2025-05-07T19:52:47.0473158Z [131/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/transpose_embedding_input.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o 2025-05-07T19:52:47.0494982Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:47.0497673Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:47.0498876Z ^ 2025-05-07T19:52:47.0499435Z 2025-05-07T19:52:47.0499855Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:47.0500496Z 2025-05-07T19:52:47.0502006Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:47.0504295Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:47.0505369Z ^ 2025-05-07T19:52:47.0505759Z 2025-05-07T19:52:47.0507328Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:47.0509870Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:47.0510995Z ^ 2025-05-07T19:52:47.0511259Z 2025-05-07T19:52:47.0511757Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:47.0512377Z 2025-05-07T19:52:47.0514151Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:47.0517003Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:47.0518166Z ^ 2025-05-07T19:52:47.0518519Z 2025-05-07T19:52:47.0520119Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:47.0522696Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:47.0523834Z ^ 2025-05-07T19:52:47.0524081Z 2025-05-07T19:52:47.0524506Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:47.0525172Z 2025-05-07T19:52:47.0526886Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:47.0529753Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:47.0530877Z ^ 2025-05-07T19:52:47.0531239Z 2025-05-07T19:52:47.0532762Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:47.0535239Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:47.0536340Z ^ 2025-05-07T19:52:47.0536600Z 2025-05-07T19:52:47.0537017Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:47.0537625Z 2025-05-07T19:52:47.0539572Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:47.0542142Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:47.0543521Z ^ 2025-05-07T19:52:47.0543873Z 2025-05-07T19:52:47.0545401Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:47.0547925Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:47.0549201Z ^ 2025-05-07T19:52:47.0549448Z 2025-05-07T19:52:47.0549879Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:47.0550567Z 2025-05-07T19:52:47.0552141Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:47.0554973Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:47.0556442Z ^ 2025-05-07T19:52:47.0556802Z 2025-05-07T19:52:52.1588465Z [132/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o 2025-05-07T19:52:52.1611669Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:52.1614928Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:52.1616127Z ^ 2025-05-07T19:52:52.1616397Z 2025-05-07T19:52:52.1616829Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:52.1617480Z 2025-05-07T19:52:52.1619154Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:52.1621961Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:52.1623133Z ^ 2025-05-07T19:52:52.1623518Z 2025-05-07T19:52:52.1625060Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:52.1627675Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:52.1629110Z ^ 2025-05-07T19:52:52.1629359Z 2025-05-07T19:52:52.1629824Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:52.1630469Z 2025-05-07T19:52:52.1632147Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:52.1634785Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:52.1635990Z ^ 2025-05-07T19:52:52.1636330Z 2025-05-07T19:52:52.1637984Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:52.1640644Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:52.1641733Z ^ 2025-05-07T19:52:52.1641985Z 2025-05-07T19:52:52.1642416Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:52.1643072Z 2025-05-07T19:52:52.1644729Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:52.1647353Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:52.1648542Z ^ 2025-05-07T19:52:52.1648906Z 2025-05-07T19:52:52.1650468Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:52.1653105Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:52.1654286Z ^ 2025-05-07T19:52:52.1654953Z 2025-05-07T19:52:52.1655430Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:52.1656077Z 2025-05-07T19:52:52.1657729Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:52.1660458Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:52.1661807Z ^ 2025-05-07T19:52:52.1662168Z 2025-05-07T19:52:52.1663839Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:52.1666570Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:52.1667751Z ^ 2025-05-07T19:52:52.1668024Z 2025-05-07T19:52:52.1668479Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:52.1669054Z 2025-05-07T19:52:52.1670753Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:52.1673472Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:52.1674796Z ^ 2025-05-07T19:52:52.1675170Z 2025-05-07T19:52:54.6128781Z [133/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_dense_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o 2025-05-07T19:52:54.6144411Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:54.6146225Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:54.6147008Z ^ 2025-05-07T19:52:54.6147215Z 2025-05-07T19:52:54.6147512Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:54.6147954Z 2025-05-07T19:52:54.6149084Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:54.6150835Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:54.6151662Z ^ 2025-05-07T19:52:54.6151910Z 2025-05-07T19:52:54.6153013Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:54.6154874Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:54.6155674Z ^ 2025-05-07T19:52:54.6155871Z 2025-05-07T19:52:54.6156177Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:54.6156640Z 2025-05-07T19:52:54.6157716Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:54.6159496Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:54.6160279Z ^ 2025-05-07T19:52:54.6160552Z 2025-05-07T19:52:54.6161617Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:54.6163382Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:54.6164152Z ^ 2025-05-07T19:52:54.6164357Z 2025-05-07T19:52:54.6164658Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:54.6165119Z 2025-05-07T19:52:54.6166327Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:54.6168299Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:54.6169197Z ^ 2025-05-07T19:52:54.6169478Z 2025-05-07T19:52:54.6171072Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:54.6173112Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:54.6174071Z ^ 2025-05-07T19:52:54.6177523Z 2025-05-07T19:52:54.6177891Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:54.6178454Z 2025-05-07T19:52:54.6179702Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:54.6181845Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:54.6182807Z ^ 2025-05-07T19:52:54.6183124Z 2025-05-07T19:52:54.6184443Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:54.6186550Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:54.6187511Z ^ 2025-05-07T19:52:54.6187735Z 2025-05-07T19:52:54.6188131Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:54.6188657Z 2025-05-07T19:52:54.6189970Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:54.6192117Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:54.6193078Z ^ 2025-05-07T19:52:54.6193374Z 2025-05-07T19:52:58.3759898Z [134/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_dense_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o 2025-05-07T19:52:58.3782334Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:58.3784915Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:58.3786053Z ^ 2025-05-07T19:52:58.3786298Z 2025-05-07T19:52:58.3786724Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:58.3787369Z 2025-05-07T19:52:58.3788974Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:58.3791543Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:58.3792674Z ^ 2025-05-07T19:52:58.3793025Z 2025-05-07T19:52:58.3794936Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:58.3812635Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:58.3813819Z ^ 2025-05-07T19:52:58.3814117Z 2025-05-07T19:52:58.3814559Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:58.3815200Z 2025-05-07T19:52:58.3816800Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:58.3819324Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:58.3820439Z ^ 2025-05-07T19:52:58.3820797Z 2025-05-07T19:52:58.3822341Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:58.3824953Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:58.3826069Z ^ 2025-05-07T19:52:58.3826324Z 2025-05-07T19:52:58.3826729Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:58.3827327Z 2025-05-07T19:52:58.3829229Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:58.3831734Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:58.3833297Z ^ 2025-05-07T19:52:58.3833665Z 2025-05-07T19:52:58.3835332Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:58.3838242Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:58.3839421Z ^ 2025-05-07T19:52:58.3839681Z 2025-05-07T19:52:58.3840159Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:58.3840837Z 2025-05-07T19:52:58.3842470Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:58.3845093Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:58.3846209Z ^ 2025-05-07T19:52:58.3846601Z 2025-05-07T19:52:58.3848127Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:58.3850738Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:58.3851830Z ^ 2025-05-07T19:52:58.3852090Z 2025-05-07T19:52:58.3852505Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:58.3853160Z 2025-05-07T19:52:58.3854644Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:58.3857174Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:58.3858345Z ^ 2025-05-07T19:52:58.3858686Z 2025-05-07T19:52:59.1365889Z [135/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o 2025-05-07T19:52:59.1387631Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:59.1390223Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:59.1391359Z ^ 2025-05-07T19:52:59.1391637Z 2025-05-07T19:52:59.1392063Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:59.1392696Z 2025-05-07T19:52:59.1394454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:59.1397009Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:59.1398175Z ^ 2025-05-07T19:52:59.1398540Z 2025-05-07T19:52:59.1400080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:59.1402622Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:59.1403746Z ^ 2025-05-07T19:52:59.1403998Z 2025-05-07T19:52:59.1404414Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:59.1405145Z 2025-05-07T19:52:59.1406727Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:59.1409521Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:59.1410651Z ^ 2025-05-07T19:52:59.1411049Z 2025-05-07T19:52:59.1412632Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:59.1414940Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:59.1415895Z ^ 2025-05-07T19:52:59.1416152Z 2025-05-07T19:52:59.1416533Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:59.1417089Z 2025-05-07T19:52:59.1419012Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:59.1421598Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:59.1422900Z ^ 2025-05-07T19:52:59.1423256Z 2025-05-07T19:52:59.1424627Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:59.1427014Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:59.1428191Z ^ 2025-05-07T19:52:59.1428988Z 2025-05-07T19:52:59.1429442Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:59.1430103Z 2025-05-07T19:52:59.1431738Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:59.1434281Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:59.1435397Z ^ 2025-05-07T19:52:59.1435776Z 2025-05-07T19:52:59.1437376Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:59.1439992Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:59.1441100Z ^ 2025-05-07T19:52:59.1441412Z 2025-05-07T19:52:59.1441825Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:59.1442390Z 2025-05-07T19:52:59.1443905Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:59.1446695Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:59.1447865Z ^ 2025-05-07T19:52:59.1448222Z 2025-05-07T19:53:04.3605991Z [136/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o 2025-05-07T19:53:04.3629508Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:04.3632078Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:04.3633303Z ^ 2025-05-07T19:53:04.3633624Z 2025-05-07T19:53:04.3634219Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:04.3634931Z 2025-05-07T19:53:04.3636760Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:04.3639432Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:04.3640458Z ^ 2025-05-07T19:53:04.3640785Z 2025-05-07T19:53:04.3641999Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:04.3644137Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:04.3645223Z ^ 2025-05-07T19:53:04.3645490Z 2025-05-07T19:53:04.3645922Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:04.3646596Z 2025-05-07T19:53:04.3648174Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:04.3650970Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:04.3652191Z ^ 2025-05-07T19:53:04.3652604Z 2025-05-07T19:53:04.3654404Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:04.3657209Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:04.3658390Z ^ 2025-05-07T19:53:04.3658649Z 2025-05-07T19:53:04.3659609Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:04.3660288Z 2025-05-07T19:53:04.3661968Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:04.3664878Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:04.3666095Z ^ 2025-05-07T19:53:04.3666471Z 2025-05-07T19:53:04.3668061Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:04.3670693Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:04.3671874Z ^ 2025-05-07T19:53:04.3672130Z 2025-05-07T19:53:04.3672580Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:04.3673249Z 2025-05-07T19:53:04.3675306Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:04.3678100Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:04.3679312Z ^ 2025-05-07T19:53:04.3679680Z 2025-05-07T19:53:04.3681419Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:04.3684083Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:04.3685264Z ^ 2025-05-07T19:53:04.3685514Z 2025-05-07T19:53:04.3685997Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:04.3686654Z 2025-05-07T19:53:04.3688272Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:04.3690951Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:04.3692147Z ^ 2025-05-07T19:53:04.3692522Z 2025-05-07T19:53:11.6155069Z [137/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o 2025-05-07T19:53:11.6177823Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:11.6180673Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:11.6181806Z ^ 2025-05-07T19:53:11.6182065Z 2025-05-07T19:53:11.6182568Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:11.6183210Z 2025-05-07T19:53:11.6184835Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:11.6187530Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:11.6188695Z ^ 2025-05-07T19:53:11.6189058Z 2025-05-07T19:53:11.6190646Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:11.6193073Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:11.6194389Z ^ 2025-05-07T19:53:11.6194630Z 2025-05-07T19:53:11.6195004Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:11.6195638Z 2025-05-07T19:53:11.6197233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:11.6199948Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:11.6201111Z ^ 2025-05-07T19:53:11.6201477Z 2025-05-07T19:53:11.6203409Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:11.6205902Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:11.6207238Z ^ 2025-05-07T19:53:11.6207530Z 2025-05-07T19:53:11.6208157Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:11.6208821Z 2025-05-07T19:53:11.6210633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:11.6213243Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:11.6214354Z ^ 2025-05-07T19:53:11.6214696Z 2025-05-07T19:53:11.6216439Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:11.6219063Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:11.6220232Z ^ 2025-05-07T19:53:11.6220502Z 2025-05-07T19:53:11.6220971Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:11.6221664Z 2025-05-07T19:53:11.6223373Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:11.6226150Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:11.6227228Z ^ 2025-05-07T19:53:11.6227640Z 2025-05-07T19:53:11.6229610Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:11.6232177Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:11.6233306Z ^ 2025-05-07T19:53:11.6233567Z 2025-05-07T19:53:11.6234148Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:11.6234791Z 2025-05-07T19:53:11.6236590Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:11.6239181Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:11.6240389Z ^ 2025-05-07T19:53:11.6240746Z 2025-05-07T19:53:12.4129944Z [138/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o 2025-05-07T19:53:12.4152253Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:12.4154905Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:12.4155999Z ^ 2025-05-07T19:53:12.4156240Z 2025-05-07T19:53:12.4156666Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:12.4157290Z 2025-05-07T19:53:12.4158800Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:12.4161227Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:12.4162256Z ^ 2025-05-07T19:53:12.4162643Z 2025-05-07T19:53:12.4164128Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.4166037Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.4166551Z ^ 2025-05-07T19:53:12.4166827Z 2025-05-07T19:53:12.4168326Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.4170164Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.4170893Z ^ 2025-05-07T19:53:12.4171165Z 2025-05-07T19:53:12.4173021Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.4174928Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.4175437Z ^ 2025-05-07T19:53:12.4175725Z 2025-05-07T19:53:12.4177422Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:12.4179901Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:12.4181022Z ^ 2025-05-07T19:53:12.4181262Z 2025-05-07T19:53:12.4181688Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:12.4182330Z 2025-05-07T19:53:12.4183927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:12.4186403Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:12.4187518Z ^ 2025-05-07T19:53:12.4187883Z 2025-05-07T19:53:12.4189423Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.4191351Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.4191948Z ^ 2025-05-07T19:53:12.4192221Z 2025-05-07T19:53:12.4193785Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.4195809Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.4196385Z ^ 2025-05-07T19:53:12.4196722Z 2025-05-07T19:53:12.4198191Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.4200045Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.4200586Z ^ 2025-05-07T19:53:12.4200851Z 2025-05-07T19:53:12.4202397Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:12.4204873Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:12.4205959Z ^ 2025-05-07T19:53:12.4206208Z 2025-05-07T19:53:12.4206660Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:12.4207315Z 2025-05-07T19:53:12.4208907Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:12.4211478Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:12.4212812Z ^ 2025-05-07T19:53:12.4213153Z 2025-05-07T19:53:12.4214822Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.4216704Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.4217242Z ^ 2025-05-07T19:53:12.4217566Z 2025-05-07T19:53:12.4218999Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.4220993Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.4221516Z ^ 2025-05-07T19:53:12.4221805Z 2025-05-07T19:53:12.4223230Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.4225041Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.4225583Z ^ 2025-05-07T19:53:12.4225865Z 2025-05-07T19:53:12.4227331Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:12.4229929Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:12.4231045Z ^ 2025-05-07T19:53:12.4231297Z 2025-05-07T19:53:12.4231756Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:12.4232376Z 2025-05-07T19:53:12.4234027Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:12.4236786Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:12.4237911Z ^ 2025-05-07T19:53:12.4238294Z 2025-05-07T19:53:12.4239810Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.4241743Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.4242264Z ^ 2025-05-07T19:53:12.4242576Z 2025-05-07T19:53:12.4243994Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.4245877Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.4246423Z ^ 2025-05-07T19:53:12.4246704Z 2025-05-07T19:53:12.4248198Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.4250061Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.4250565Z ^ 2025-05-07T19:53:12.4250849Z 2025-05-07T19:53:12.4252412Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:12.4255189Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:12.4256284Z ^ 2025-05-07T19:53:12.4256531Z 2025-05-07T19:53:12.4257373Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:12.4258040Z 2025-05-07T19:53:12.4259566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:12.4262273Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:12.4263352Z ^ 2025-05-07T19:53:12.4263692Z 2025-05-07T19:53:12.4265176Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.4266938Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.4267483Z ^ 2025-05-07T19:53:12.4267796Z 2025-05-07T19:53:12.4269240Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.4271199Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.4271912Z ^ 2025-05-07T19:53:12.4272205Z 2025-05-07T19:53:12.4273657Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.4275620Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.4276192Z ^ 2025-05-07T19:53:12.4276495Z 2025-05-07T19:53:20.4879724Z [139/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/radix_sort_pairs.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o 2025-05-07T19:53:20.4900926Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.4907628Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:20.4908888Z ^ 2025-05-07T19:53:20.4909182Z 2025-05-07T19:53:20.4909754Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:20.4910607Z 2025-05-07T19:53:20.4912409Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.4915333Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:20.4916765Z ^ 2025-05-07T19:53:20.4917152Z 2025-05-07T19:53:20.4918848Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.4921551Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:20.4922783Z ^ 2025-05-07T19:53:20.4923043Z 2025-05-07T19:53:20.4923515Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:20.4924193Z 2025-05-07T19:53:20.4925922Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.4928908Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:20.4930120Z ^ 2025-05-07T19:53:20.4930509Z 2025-05-07T19:53:20.4932171Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.4934867Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:20.4936043Z ^ 2025-05-07T19:53:20.4936331Z 2025-05-07T19:53:20.4936806Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:20.4937655Z 2025-05-07T19:53:20.4939408Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.4942179Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:20.4943416Z ^ 2025-05-07T19:53:20.4943777Z 2025-05-07T19:53:20.4945445Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.4948260Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:20.4949575Z ^ 2025-05-07T19:53:20.4949837Z 2025-05-07T19:53:20.4950260Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:20.4950938Z 2025-05-07T19:53:20.4952770Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.4955714Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:20.4956886Z ^ 2025-05-07T19:53:20.4957276Z 2025-05-07T19:53:20.4958927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.4961603Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:20.4962727Z ^ 2025-05-07T19:53:20.4963007Z 2025-05-07T19:53:20.4963451Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:20.4964085Z 2025-05-07T19:53:20.4965623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.4968321Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:20.4969517Z ^ 2025-05-07T19:53:20.4969878Z 2025-05-07T19:53:20.6889413Z [140/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o 2025-05-07T19:53:20.6912633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.6915440Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:20.6916583Z ^ 2025-05-07T19:53:20.6916842Z 2025-05-07T19:53:20.6917263Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:20.6917870Z 2025-05-07T19:53:20.6919456Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.6922068Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:20.6923228Z ^ 2025-05-07T19:53:20.6923575Z 2025-05-07T19:53:20.6925080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:20.6927258Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:20.6928001Z ^ 2025-05-07T19:53:20.6928282Z 2025-05-07T19:53:20.6930220Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:20.6932046Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:20.6932499Z ^ 2025-05-07T19:53:20.6932746Z 2025-05-07T19:53:20.6934158Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:20.6935855Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:20.6936353Z ^ 2025-05-07T19:53:20.6936583Z 2025-05-07T19:53:20.6937945Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:20.6939966Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:20.6940538Z ^ 2025-05-07T19:53:20.6940829Z 2025-05-07T19:53:20.6942549Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.6945330Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:20.6946521Z ^ 2025-05-07T19:53:20.6946802Z 2025-05-07T19:53:20.6947259Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:20.6947937Z 2025-05-07T19:53:20.6950005Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.6952502Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:20.6954063Z ^ 2025-05-07T19:53:20.6954371Z 2025-05-07T19:53:20.6955512Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:20.6957419Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:20.6958111Z ^ 2025-05-07T19:53:20.6958380Z 2025-05-07T19:53:20.6959844Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:20.6961808Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:20.6962408Z ^ 2025-05-07T19:53:20.6962703Z 2025-05-07T19:53:20.6964098Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:20.6966039Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:20.6966568Z ^ 2025-05-07T19:53:20.6966826Z 2025-05-07T19:53:20.6968257Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:20.6970154Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:20.6970701Z ^ 2025-05-07T19:53:20.6970997Z 2025-05-07T19:53:20.6972581Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.6975224Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:20.6976381Z ^ 2025-05-07T19:53:20.6976654Z 2025-05-07T19:53:20.6977088Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:20.6977735Z 2025-05-07T19:53:20.6979327Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.6981950Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:20.6983023Z ^ 2025-05-07T19:53:20.6983406Z 2025-05-07T19:53:20.6984906Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:20.6986550Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:20.6987207Z ^ 2025-05-07T19:53:20.6987455Z 2025-05-07T19:53:20.6988830Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:20.6990890Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:20.6991411Z ^ 2025-05-07T19:53:20.6991717Z 2025-05-07T19:53:20.6993170Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:20.6995602Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:20.6996156Z ^ 2025-05-07T19:53:20.6996447Z 2025-05-07T19:53:20.6997785Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:20.6999525Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:20.7000074Z ^ 2025-05-07T19:53:20.7000333Z 2025-05-07T19:53:20.7001940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.7004051Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:20.7005120Z ^ 2025-05-07T19:53:20.7005338Z 2025-05-07T19:53:20.7005806Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:20.7006452Z 2025-05-07T19:53:20.7008055Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.7010702Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:20.7011872Z ^ 2025-05-07T19:53:20.7012228Z 2025-05-07T19:53:20.7013734Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:20.7015838Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:20.7016567Z ^ 2025-05-07T19:53:20.7016866Z 2025-05-07T19:53:20.7018374Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:20.7020305Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:20.7020861Z ^ 2025-05-07T19:53:20.7021159Z 2025-05-07T19:53:20.7022664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:20.7024558Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:20.7025141Z ^ 2025-05-07T19:53:20.7025408Z 2025-05-07T19:53:20.7026904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:20.7029097Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:20.7029640Z ^ 2025-05-07T19:53:20.7029895Z 2025-05-07T19:53:20.7031820Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.7034300Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:20.7035334Z ^ 2025-05-07T19:53:20.7035558Z 2025-05-07T19:53:20.7036228Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:20.7036853Z 2025-05-07T19:53:20.7038550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.7041148Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:20.7042292Z ^ 2025-05-07T19:53:20.7042646Z 2025-05-07T19:53:20.7044093Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:20.7046226Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:20.7046978Z ^ 2025-05-07T19:53:20.7047267Z 2025-05-07T19:53:20.7048833Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:20.7050751Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:20.7051298Z ^ 2025-05-07T19:53:20.7051576Z 2025-05-07T19:53:20.7052998Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:20.7054895Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:20.7055444Z ^ 2025-05-07T19:53:20.7055735Z 2025-05-07T19:53:20.7057195Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:20.7059173Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:20.7059735Z ^ 2025-05-07T19:53:20.7060055Z 2025-05-07T19:53:21.0720240Z [141/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o 2025-05-07T19:53:21.0742281Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.0744575Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:21.0745626Z ^ 2025-05-07T19:53:21.0745877Z 2025-05-07T19:53:21.0746305Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:21.0746960Z 2025-05-07T19:53:21.0748484Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.0750902Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:21.0751978Z ^ 2025-05-07T19:53:21.0752375Z 2025-05-07T19:53:21.0753692Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.0755648Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:21.0756296Z ^ 2025-05-07T19:53:21.0756759Z 2025-05-07T19:53:21.0758196Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.0759800Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:21.0760310Z ^ 2025-05-07T19:53:21.0760600Z 2025-05-07T19:53:21.0761904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.0763608Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:21.0764061Z ^ 2025-05-07T19:53:21.0764310Z 2025-05-07T19:53:21.0765676Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.0767320Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:21.0767988Z ^ 2025-05-07T19:53:21.0768627Z 2025-05-07T19:53:21.0770095Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.0772381Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:21.0773828Z ^ 2025-05-07T19:53:21.0774019Z 2025-05-07T19:53:21.0774356Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:21.0774936Z 2025-05-07T19:53:21.0776343Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.0778644Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:21.0779647Z ^ 2025-05-07T19:53:21.0779994Z 2025-05-07T19:53:21.0781206Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.0783023Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:21.0783699Z ^ 2025-05-07T19:53:21.0783963Z 2025-05-07T19:53:21.0785270Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.0786930Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:21.0787435Z ^ 2025-05-07T19:53:21.0787714Z 2025-05-07T19:53:21.0789117Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.0790860Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:21.0791336Z ^ 2025-05-07T19:53:21.0791570Z 2025-05-07T19:53:21.0792916Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.0794746Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:21.0795241Z ^ 2025-05-07T19:53:21.0795481Z 2025-05-07T19:53:21.0796893Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.0799258Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:21.0800304Z ^ 2025-05-07T19:53:21.0800558Z 2025-05-07T19:53:21.0800943Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:21.0801550Z 2025-05-07T19:53:21.0803059Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.0805507Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:21.0806512Z ^ 2025-05-07T19:53:21.0806836Z 2025-05-07T19:53:21.0808384Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.0810288Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:21.0811181Z ^ 2025-05-07T19:53:21.0811451Z 2025-05-07T19:53:21.0812868Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.0814593Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:21.0815157Z ^ 2025-05-07T19:53:21.0815412Z 2025-05-07T19:53:21.0816730Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.0818147Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:21.0818592Z ^ 2025-05-07T19:53:21.0818822Z 2025-05-07T19:53:21.0820133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.0821751Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:21.0822233Z ^ 2025-05-07T19:53:21.0822466Z 2025-05-07T19:53:21.0823770Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.0825933Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:21.0826941Z ^ 2025-05-07T19:53:21.0827186Z 2025-05-07T19:53:21.0827575Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:21.0828131Z 2025-05-07T19:53:21.0829863Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.0832241Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:21.0833347Z ^ 2025-05-07T19:53:21.0833660Z 2025-05-07T19:53:21.0835149Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.0837074Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:21.0837736Z ^ 2025-05-07T19:53:21.0837996Z 2025-05-07T19:53:21.0839391Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.0841180Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:21.0841713Z ^ 2025-05-07T19:53:21.0841975Z 2025-05-07T19:53:21.0843353Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.0845056Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:21.0845499Z ^ 2025-05-07T19:53:21.0846121Z 2025-05-07T19:53:21.0847327Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.0848864Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:21.0849570Z ^ 2025-05-07T19:53:21.0849818Z 2025-05-07T19:53:21.0851184Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.0853391Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:21.0854464Z ^ 2025-05-07T19:53:21.0854688Z 2025-05-07T19:53:21.0855143Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:21.0855742Z 2025-05-07T19:53:21.0857235Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.0859697Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:21.0860846Z ^ 2025-05-07T19:53:21.0861219Z 2025-05-07T19:53:21.0862709Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.0864774Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:21.0865478Z ^ 2025-05-07T19:53:21.0865730Z 2025-05-07T19:53:21.0866955Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.0868746Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:21.0869270Z ^ 2025-05-07T19:53:21.0869564Z 2025-05-07T19:53:21.0871052Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.0872810Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:21.0873331Z ^ 2025-05-07T19:53:21.0873622Z 2025-05-07T19:53:21.0875099Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.0876834Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:21.0877373Z ^ 2025-05-07T19:53:21.0877635Z 2025-05-07T19:53:21.1087940Z [142/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_utils.so -o fbgemm_gpu_tbe_utils.so CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -ldl && : 2025-05-07T19:53:21.7061954Z [143/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_sparse_async_cumsum.so -o fbgemm_gpu_sparse_async_cumsum.so CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl && : 2025-05-07T19:53:23.1991756Z [144/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:53:23.2014059Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:23.2016523Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:23.2017652Z ^ 2025-05-07T19:53:23.2017894Z 2025-05-07T19:53:23.2018346Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:23.2018974Z 2025-05-07T19:53:23.2020532Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:23.2023293Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:23.2024440Z ^ 2025-05-07T19:53:23.2024830Z 2025-05-07T19:53:23.2026398Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:23.2029583Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:23.2030696Z ^ 2025-05-07T19:53:23.2030976Z 2025-05-07T19:53:23.2031515Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:23.2032490Z 2025-05-07T19:53:23.2034190Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:23.2036536Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:23.2037553Z ^ 2025-05-07T19:53:23.2037899Z 2025-05-07T19:53:23.2039412Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:23.2041733Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:23.2042816Z ^ 2025-05-07T19:53:23.2043076Z 2025-05-07T19:53:23.2043498Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:23.2044094Z 2025-05-07T19:53:23.2045643Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:23.2048184Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:23.2049313Z ^ 2025-05-07T19:53:23.2049684Z 2025-05-07T19:53:23.2051228Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:23.2053677Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:23.2054814Z ^ 2025-05-07T19:53:23.2055065Z 2025-05-07T19:53:23.2055456Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:23.2056051Z 2025-05-07T19:53:23.2057533Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:23.2059945Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:23.2061007Z ^ 2025-05-07T19:53:23.2061327Z 2025-05-07T19:53:23.2062853Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:23.2065280Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:23.2066522Z ^ 2025-05-07T19:53:23.2066751Z 2025-05-07T19:53:23.2067180Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:23.2067819Z 2025-05-07T19:53:23.2069634Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:23.2072233Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:23.2073463Z ^ 2025-05-07T19:53:23.2074141Z 2025-05-07T19:53:25.5788166Z [145/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o 2025-05-07T19:53:25.5810347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:25.5812975Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:25.5814150Z ^ 2025-05-07T19:53:25.5814417Z 2025-05-07T19:53:25.5814848Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:25.5815528Z 2025-05-07T19:53:25.5817093Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:25.5819688Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:25.5820646Z ^ 2025-05-07T19:53:25.5820947Z 2025-05-07T19:53:25.5822816Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:25.5825145Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:25.5828207Z ^ 2025-05-07T19:53:25.5828777Z 2025-05-07T19:53:25.5829201Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:25.5829803Z 2025-05-07T19:53:25.5831307Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:25.5834010Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:25.5835121Z ^ 2025-05-07T19:53:25.5835464Z 2025-05-07T19:53:25.5836890Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:25.5839417Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:25.5840408Z ^ 2025-05-07T19:53:25.5840660Z 2025-05-07T19:53:25.5841030Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:25.5841778Z 2025-05-07T19:53:25.5843277Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:25.5845674Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:25.5846746Z ^ 2025-05-07T19:53:25.5847089Z 2025-05-07T19:53:25.5848786Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:25.5851116Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:25.5852175Z ^ 2025-05-07T19:53:25.5852415Z 2025-05-07T19:53:25.5852826Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:25.5853478Z 2025-05-07T19:53:25.5854890Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:25.5857294Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:25.5858392Z ^ 2025-05-07T19:53:25.5858735Z 2025-05-07T19:53:25.5860271Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:25.5862829Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:25.5863932Z ^ 2025-05-07T19:53:25.5864163Z 2025-05-07T19:53:25.5864968Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:25.5865583Z 2025-05-07T19:53:25.5867053Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:25.5869905Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:25.5871031Z ^ 2025-05-07T19:53:25.5871397Z 2025-05-07T19:53:25.6588980Z [146/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o 2025-05-07T19:53:25.6609727Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:25.6612302Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:25.6613380Z ^ 2025-05-07T19:53:25.6613637Z 2025-05-07T19:53:25.6614074Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:25.6614718Z 2025-05-07T19:53:25.6616349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:25.6619508Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:25.6620630Z ^ 2025-05-07T19:53:25.6620972Z 2025-05-07T19:53:25.6622422Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:25.6625213Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:25.6626379Z ^ 2025-05-07T19:53:25.6626622Z 2025-05-07T19:53:25.6627049Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:25.6627713Z 2025-05-07T19:53:25.6629730Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:25.6632238Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:25.6633331Z ^ 2025-05-07T19:53:25.6633635Z 2025-05-07T19:53:25.6635143Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:25.6637658Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:25.6638787Z ^ 2025-05-07T19:53:25.6639060Z 2025-05-07T19:53:25.6639594Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:25.6640275Z 2025-05-07T19:53:25.6641903Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:25.6644562Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:25.6645659Z ^ 2025-05-07T19:53:25.6645994Z 2025-05-07T19:53:25.6647611Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:25.6650217Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:25.6651573Z ^ 2025-05-07T19:53:25.6651827Z 2025-05-07T19:53:25.6652248Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:25.6652863Z 2025-05-07T19:53:25.6654387Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:25.6656796Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:25.6657908Z ^ 2025-05-07T19:53:25.6658294Z 2025-05-07T19:53:25.6659869Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:25.6662761Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:25.6663878Z ^ 2025-05-07T19:53:25.6664140Z 2025-05-07T19:53:25.6664546Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:25.6665388Z 2025-05-07T19:53:25.6666844Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:25.6669377Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:25.6670716Z ^ 2025-05-07T19:53:25.6671080Z 2025-05-07T19:53:32.8921523Z [147/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:53:32.8943804Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.8946333Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.8947489Z ^ 2025-05-07T19:53:32.8947758Z 2025-05-07T19:53:32.8948210Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:32.8949478Z 2025-05-07T19:53:32.8951066Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.8953637Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.8954775Z ^ 2025-05-07T19:53:32.8955112Z 2025-05-07T19:53:32.8956541Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.8958771Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.8959879Z ^ 2025-05-07T19:53:32.8960128Z 2025-05-07T19:53:32.8960538Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:32.8961039Z 2025-05-07T19:53:32.8962578Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.8965138Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.8966063Z ^ 2025-05-07T19:53:32.8966348Z 2025-05-07T19:53:32.8967714Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.8970092Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.8971084Z ^ 2025-05-07T19:53:32.8971324Z 2025-05-07T19:53:32.8971738Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:32.8972375Z 2025-05-07T19:53:32.8973957Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.8976492Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.8977669Z ^ 2025-05-07T19:53:32.8977994Z 2025-05-07T19:53:32.8979566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.8982016Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.8983021Z ^ 2025-05-07T19:53:32.8983245Z 2025-05-07T19:53:32.8983653Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:32.8984283Z 2025-05-07T19:53:32.8985913Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.8988626Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.8989837Z ^ 2025-05-07T19:53:32.8990242Z 2025-05-07T19:53:32.8992102Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.8994999Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.8996297Z ^ 2025-05-07T19:53:32.8996558Z 2025-05-07T19:53:32.8996984Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:32.8997585Z 2025-05-07T19:53:32.8999135Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.9001813Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.9003068Z ^ 2025-05-07T19:53:32.9003452Z 2025-05-07T19:53:33.8582239Z [148/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o 2025-05-07T19:53:33.8605372Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:33.8608655Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:33.8609851Z ^ 2025-05-07T19:53:33.8610116Z 2025-05-07T19:53:33.8610707Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:33.8611573Z 2025-05-07T19:53:33.8613310Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:33.8616033Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:33.8617263Z ^ 2025-05-07T19:53:33.8617630Z 2025-05-07T19:53:33.8619318Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:33.8622028Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:33.8623243Z ^ 2025-05-07T19:53:33.8623505Z 2025-05-07T19:53:33.8623950Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:33.8624650Z 2025-05-07T19:53:33.8626324Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:33.8629297Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:33.8630511Z ^ 2025-05-07T19:53:33.8630919Z 2025-05-07T19:53:33.8632629Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:33.8635326Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:33.8636283Z ^ 2025-05-07T19:53:33.8636524Z 2025-05-07T19:53:33.8636962Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:33.8637553Z 2025-05-07T19:53:33.8638791Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:33.8641325Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:33.8642415Z ^ 2025-05-07T19:53:33.8642756Z 2025-05-07T19:53:33.8644081Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:33.8646087Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:33.8647032Z ^ 2025-05-07T19:53:33.8647269Z 2025-05-07T19:53:33.8647653Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:33.8648233Z 2025-05-07T19:53:33.8649811Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:33.8652500Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:33.8653546Z ^ 2025-05-07T19:53:33.8653874Z 2025-05-07T19:53:33.8655419Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:33.8657760Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:33.8658768Z ^ 2025-05-07T19:53:33.8658975Z 2025-05-07T19:53:33.8659391Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:33.8659977Z 2025-05-07T19:53:33.8661434Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:33.8664070Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:33.8665280Z ^ 2025-05-07T19:53:33.8665610Z 2025-05-07T19:53:45.6501137Z [149/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:53:45.6520112Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:49.1503115Z [150/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o 2025-05-07T19:53:49.1526576Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:49.1529606Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:49.1530827Z ^ 2025-05-07T19:53:49.1531091Z 2025-05-07T19:53:49.1531557Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:49.1532197Z 2025-05-07T19:53:49.1533864Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:49.1536586Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:49.1537805Z ^ 2025-05-07T19:53:49.1538185Z 2025-05-07T19:53:49.1539908Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:49.1542424Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:49.1543556Z ^ 2025-05-07T19:53:49.1543818Z 2025-05-07T19:53:49.1544729Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:49.1545372Z 2025-05-07T19:53:49.1546867Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:49.1549664Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:49.1550772Z ^ 2025-05-07T19:53:49.1551130Z 2025-05-07T19:53:49.1552597Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:49.1555559Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:49.1556959Z ^ 2025-05-07T19:53:49.1557250Z 2025-05-07T19:53:49.1557718Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:49.1558411Z 2025-05-07T19:53:49.1560080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:49.1562612Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:49.1563867Z ^ 2025-05-07T19:53:49.1564392Z 2025-05-07T19:53:49.1565868Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:49.1568256Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:49.1569351Z ^ 2025-05-07T19:53:49.1569595Z 2025-05-07T19:53:49.1570024Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:49.1570674Z 2025-05-07T19:53:49.1572217Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:49.1574761Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:49.1575880Z ^ 2025-05-07T19:53:49.1576255Z 2025-05-07T19:53:49.1577743Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:49.1580195Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:49.1581334Z ^ 2025-05-07T19:53:49.1581605Z 2025-05-07T19:53:49.1582042Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:49.1582654Z 2025-05-07T19:53:49.1584148Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:49.1586588Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:49.1587706Z ^ 2025-05-07T19:53:49.1588339Z 2025-05-07T19:53:50.0334786Z [151/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:53:50.0352287Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:57.6558456Z [152/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:53:57.6582715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:57.6585521Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:57.6586767Z ^ 2025-05-07T19:53:57.6587030Z 2025-05-07T19:53:57.6587487Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:57.6588213Z 2025-05-07T19:53:57.6589953Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:57.6592778Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:57.6594119Z ^ 2025-05-07T19:53:57.6594542Z 2025-05-07T19:53:57.6596261Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:57.6599055Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:57.6600282Z ^ 2025-05-07T19:53:57.6600544Z 2025-05-07T19:53:57.6601035Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:57.6601732Z 2025-05-07T19:53:57.6603557Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:57.6606565Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:57.6607860Z ^ 2025-05-07T19:53:57.6608257Z 2025-05-07T19:53:57.6610024Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:57.6612906Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:57.6614189Z ^ 2025-05-07T19:53:57.6614459Z 2025-05-07T19:53:57.6615046Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:57.6615768Z 2025-05-07T19:53:57.6617683Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:57.6620653Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:57.6621886Z ^ 2025-05-07T19:53:57.6622400Z 2025-05-07T19:53:57.6624116Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:57.6626891Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:57.6628109Z ^ 2025-05-07T19:53:57.6628634Z 2025-05-07T19:53:57.6629124Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:57.6629821Z 2025-05-07T19:53:57.6631562Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:57.6634438Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:57.6635688Z ^ 2025-05-07T19:53:57.6636070Z 2025-05-07T19:53:57.6637785Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:57.6640533Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:57.6641760Z ^ 2025-05-07T19:53:57.6642020Z 2025-05-07T19:53:57.6642484Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:57.6643167Z 2025-05-07T19:53:57.6644927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:57.6647701Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:57.6648934Z ^ 2025-05-07T19:53:57.6649308Z 2025-05-07T19:53:57.6705805Z [153/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:53:57.6726373Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:01.5014851Z [154/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:54:01.5036507Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:03.9745289Z [155/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:54:03.9765768Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:08.1017660Z [156/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:54:08.1037012Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:13.0475681Z [157/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:54:13.0493524Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:13.6936811Z [158/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:54:13.6957504Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:14.3914380Z [159/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:54:14.3932062Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:15.9244646Z [160/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:15.9266491Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:15.9268999Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:15.9270131Z ^ 2025-05-07T19:54:15.9270385Z 2025-05-07T19:54:15.9270892Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:15.9271522Z 2025-05-07T19:54:15.9273054Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:15.9275674Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:15.9276756Z ^ 2025-05-07T19:54:15.9277157Z 2025-05-07T19:54:15.9278720Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:15.9281174Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:15.9282246Z ^ 2025-05-07T19:54:15.9282502Z 2025-05-07T19:54:15.9282897Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:15.9283478Z 2025-05-07T19:54:15.9285096Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:15.9287609Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:15.9288742Z ^ 2025-05-07T19:54:15.9289092Z 2025-05-07T19:54:15.9290639Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:15.9293456Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:15.9294627Z ^ 2025-05-07T19:54:15.9294880Z 2025-05-07T19:54:15.9295343Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:15.9310538Z 2025-05-07T19:54:15.9312199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:15.9314987Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:15.9316087Z ^ 2025-05-07T19:54:15.9316458Z 2025-05-07T19:54:15.9318011Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:15.9320631Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:15.9321593Z ^ 2025-05-07T19:54:15.9321851Z 2025-05-07T19:54:15.9322251Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:15.9322832Z 2025-05-07T19:54:15.9324326Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:15.9326782Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:15.9327952Z ^ 2025-05-07T19:54:15.9328310Z 2025-05-07T19:54:15.9330099Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:15.9332594Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:15.9333668Z ^ 2025-05-07T19:54:15.9333910Z 2025-05-07T19:54:15.9334363Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:15.9334978Z 2025-05-07T19:54:15.9336550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:15.9339180Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:15.9340348Z ^ 2025-05-07T19:54:15.9340711Z 2025-05-07T19:54:16.2775205Z [161/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:54:16.2796524Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:17.9047460Z [162/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o 2025-05-07T19:54:17.9071147Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.9074472Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:17.9075623Z ^ 2025-05-07T19:54:17.9075907Z 2025-05-07T19:54:17.9076341Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:17.9077004Z 2025-05-07T19:54:17.9078846Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.9081629Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:17.9082845Z ^ 2025-05-07T19:54:17.9083134Z 2025-05-07T19:54:17.9084526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.9086966Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:17.9088011Z ^ 2025-05-07T19:54:17.9088255Z 2025-05-07T19:54:17.9088661Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:17.9089279Z 2025-05-07T19:54:17.9090832Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.9093580Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:17.9094792Z ^ 2025-05-07T19:54:17.9095130Z 2025-05-07T19:54:17.9096881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.9099749Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:17.9100912Z ^ 2025-05-07T19:54:17.9101158Z 2025-05-07T19:54:17.9101610Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:17.9102268Z 2025-05-07T19:54:17.9103916Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.9106691Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:17.9107913Z ^ 2025-05-07T19:54:17.9108291Z 2025-05-07T19:54:17.9109873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.9112497Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:17.9114105Z ^ 2025-05-07T19:54:17.9114388Z 2025-05-07T19:54:17.9114796Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:17.9115435Z 2025-05-07T19:54:17.9117088Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.9119845Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:17.9120934Z ^ 2025-05-07T19:54:17.9121260Z 2025-05-07T19:54:17.9122835Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.9125550Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:17.9126807Z ^ 2025-05-07T19:54:17.9127077Z 2025-05-07T19:54:17.9127583Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:17.9128277Z 2025-05-07T19:54:17.9130091Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.9132585Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:17.9133648Z ^ 2025-05-07T19:54:17.9134023Z 2025-05-07T19:54:18.3714898Z [163/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:54:18.3735416Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:20.3850046Z [164/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:54:20.3872086Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:20.4159452Z [165/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:54:20.4180403Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:20.4471577Z [166/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:54:20.4492261Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:20.4773941Z [167/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:54:20.4784359Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:20.5093101Z [168/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:54:20.5103863Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:20.5409481Z [169/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:54:20.5420231Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:20.5727731Z [170/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:54:20.5738322Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:20.6045084Z [171/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:54:20.6055662Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:20.6455252Z [172/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:54:20.6465695Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:20.6778170Z [173/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:20.6788670Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:20.7097935Z [174/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:54:20.7111569Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:20.7417499Z [175/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:54:20.7430005Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:20.7736116Z [176/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:54:20.7746750Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:20.8058371Z [177/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:20.8069022Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:24.3231536Z [178/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:24.3255871Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:24.3258525Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:24.3259741Z ^ 2025-05-07T19:54:24.3260008Z 2025-05-07T19:54:24.3260502Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:24.3261164Z 2025-05-07T19:54:24.3262489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:24.3264746Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:24.3265830Z ^ 2025-05-07T19:54:24.3266150Z 2025-05-07T19:54:24.3267676Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:24.3270214Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:24.3271293Z ^ 2025-05-07T19:54:24.3271571Z 2025-05-07T19:54:24.3271977Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:24.3272639Z 2025-05-07T19:54:24.3274426Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:24.3276966Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:24.3278056Z ^ 2025-05-07T19:54:24.3278381Z 2025-05-07T19:54:24.3280056Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:24.3282830Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:24.3284087Z ^ 2025-05-07T19:54:24.3284353Z 2025-05-07T19:54:24.3284824Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:24.3285576Z 2025-05-07T19:54:24.3287347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:24.3290205Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:24.3291455Z ^ 2025-05-07T19:54:24.3291866Z 2025-05-07T19:54:24.3293892Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:24.3296638Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:24.3297802Z ^ 2025-05-07T19:54:24.3298217Z 2025-05-07T19:54:24.3298672Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:24.3299339Z 2025-05-07T19:54:24.3301105Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:24.3303894Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:24.3305160Z ^ 2025-05-07T19:54:24.3305532Z 2025-05-07T19:54:24.3307262Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:24.3309900Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:24.3311044Z ^ 2025-05-07T19:54:24.3311274Z 2025-05-07T19:54:24.3311700Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:24.3312331Z 2025-05-07T19:54:24.3314047Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:24.3316861Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:24.3317905Z ^ 2025-05-07T19:54:24.3318263Z 2025-05-07T19:54:25.1415622Z [179/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:54:25.1436113Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:25.7823794Z [180/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:54:25.7843992Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:27.2424901Z [181/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:54:27.2445712Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:27.7136389Z [182/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_dense_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o 2025-05-07T19:54:27.7158351Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:27.7161022Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:27.7162526Z ^ 2025-05-07T19:54:27.7162795Z 2025-05-07T19:54:27.7163256Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:27.7163960Z 2025-05-07T19:54:27.7165644Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:27.7168401Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:27.7169608Z ^ 2025-05-07T19:54:27.7170011Z 2025-05-07T19:54:27.7171710Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:27.7174440Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:27.7175635Z ^ 2025-05-07T19:54:27.7175918Z 2025-05-07T19:54:27.7176374Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:27.7177051Z 2025-05-07T19:54:27.7178734Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:27.7181285Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:27.7182535Z ^ 2025-05-07T19:54:27.7182915Z 2025-05-07T19:54:27.7184462Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:27.7187152Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:27.7188338Z ^ 2025-05-07T19:54:27.7188588Z 2025-05-07T19:54:27.7189042Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:27.7189725Z 2025-05-07T19:54:27.7191430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:27.7194181Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:27.7195302Z ^ 2025-05-07T19:54:27.7195699Z 2025-05-07T19:54:27.7197279Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:27.7199790Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:27.7200680Z ^ 2025-05-07T19:54:27.7200882Z 2025-05-07T19:54:27.7201362Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:27.7202029Z 2025-05-07T19:54:27.7203908Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:27.7206560Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:27.7207894Z ^ 2025-05-07T19:54:27.7208243Z 2025-05-07T19:54:27.7209862Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:27.7212508Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:27.7213689Z ^ 2025-05-07T19:54:27.7213943Z 2025-05-07T19:54:27.7214397Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:27.7215058Z 2025-05-07T19:54:27.7216736Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:27.7219552Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:27.7220971Z ^ 2025-05-07T19:54:27.7221341Z 2025-05-07T19:54:28.2850599Z [183/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o 2025-05-07T19:54:28.2873574Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.2876858Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:28.2878061Z ^ 2025-05-07T19:54:28.2878363Z 2025-05-07T19:54:28.2878832Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:28.2879543Z 2025-05-07T19:54:28.2881292Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.2883882Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:28.2885066Z ^ 2025-05-07T19:54:28.2885428Z 2025-05-07T19:54:28.2886758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.2888668Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.2889688Z ^ 2025-05-07T19:54:28.2892847Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:28.2895860Z 2025-05-07T19:54:28.2897070Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.2898889Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.2899788Z ^ 2025-05-07T19:54:28.2903341Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:28.2906822Z 2025-05-07T19:54:28.2908178Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.2910213Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.2911149Z ^ 2025-05-07T19:54:28.2915196Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:28.2918356Z 2025-05-07T19:54:28.2919623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.2921802Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.2922737Z ^ 2025-05-07T19:54:28.2926279Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:28.2930053Z 2025-05-07T19:54:28.2931335Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.2933344Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.2934247Z ^ 2025-05-07T19:54:28.2937696Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:28.2940639Z 2025-05-07T19:54:28.2941821Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.2943931Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.2944764Z ^ 2025-05-07T19:54:28.2948149Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:28.2951423Z 2025-05-07T19:54:28.2952715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.2954932Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.2955854Z ^ 2025-05-07T19:54:28.2959808Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:28.2963134Z 2025-05-07T19:54:28.2964399Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.2966594Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.2967504Z ^ 2025-05-07T19:54:28.2970538Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:28.2973255Z 2025-05-07T19:54:28.2974469Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.2976368Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.2977287Z ^ 2025-05-07T19:54:28.2980498Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:28.2983544Z 2025-05-07T19:54:28.2984788Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.2986483Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.2987356Z ^ 2025-05-07T19:54:28.2990844Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:28.2993820Z 2025-05-07T19:54:28.2995138Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.2997146Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.2998082Z ^ 2025-05-07T19:54:28.3001329Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:28.3004492Z 2025-05-07T19:54:28.3006097Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3008068Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3009124Z ^ 2025-05-07T19:54:28.3012593Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:28.3015914Z 2025-05-07T19:54:28.3017171Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3019041Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3019968Z ^ 2025-05-07T19:54:28.3023317Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:28.3026464Z 2025-05-07T19:54:28.3027652Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3029902Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3030822Z ^ 2025-05-07T19:54:28.3034382Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:28.3037677Z 2025-05-07T19:54:28.3039106Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3041010Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3041841Z ^ 2025-05-07T19:54:28.3045266Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:28.3048416Z 2025-05-07T19:54:28.3050052Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3052008Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3052886Z ^ 2025-05-07T19:54:28.3056569Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:28.3059824Z 2025-05-07T19:54:28.3061121Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3063055Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3063969Z ^ 2025-05-07T19:54:28.3067379Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:28.3070409Z 2025-05-07T19:54:28.3071706Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3073783Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3074845Z ^ 2025-05-07T19:54:28.3078388Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:28.3081498Z 2025-05-07T19:54:28.3082834Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3084814Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3085735Z ^ 2025-05-07T19:54:28.3089302Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:28.3092760Z 2025-05-07T19:54:28.3094233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3096123Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3096991Z ^ 2025-05-07T19:54:28.3100535Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:28.3103841Z 2025-05-07T19:54:28.3105142Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3107143Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3108100Z ^ 2025-05-07T19:54:28.3111722Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:28.3115081Z 2025-05-07T19:54:28.3116321Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3118163Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3118920Z ^ 2025-05-07T19:54:28.3122640Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:28.3126039Z 2025-05-07T19:54:28.3127306Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3129641Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3130685Z ^ 2025-05-07T19:54:28.3134146Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:28.3137371Z 2025-05-07T19:54:28.3138647Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3140983Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3142029Z ^ 2025-05-07T19:54:28.3145632Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:28.3149200Z 2025-05-07T19:54:28.3150870Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.3153423Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:28.3154601Z ^ 2025-05-07T19:54:28.3154829Z 2025-05-07T19:54:28.3155262Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:28.3155867Z 2025-05-07T19:54:28.3157358Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.3160009Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:28.3161192Z ^ 2025-05-07T19:54:28.3161558Z 2025-05-07T19:54:28.3162836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3164834Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3165679Z ^ 2025-05-07T19:54:28.3168965Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:28.3172303Z 2025-05-07T19:54:28.3173602Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3175660Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3176565Z ^ 2025-05-07T19:54:28.3179644Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:28.3182481Z 2025-05-07T19:54:28.3183838Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3185636Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3186447Z ^ 2025-05-07T19:54:28.3189628Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:28.3192502Z 2025-05-07T19:54:28.3193678Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3195639Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3196442Z ^ 2025-05-07T19:54:28.3199524Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:28.3202450Z 2025-05-07T19:54:28.3203676Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3205511Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3206443Z ^ 2025-05-07T19:54:28.3209870Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:28.3213078Z 2025-05-07T19:54:28.3214597Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3216550Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3217338Z ^ 2025-05-07T19:54:28.3220393Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:28.3223232Z 2025-05-07T19:54:28.3224363Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3226375Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3227155Z ^ 2025-05-07T19:54:28.3230432Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:28.3233625Z 2025-05-07T19:54:28.3234929Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3236664Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3237471Z ^ 2025-05-07T19:54:28.3240516Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:28.3243408Z 2025-05-07T19:54:28.3244487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3246087Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3246845Z ^ 2025-05-07T19:54:28.3249851Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:28.3252653Z 2025-05-07T19:54:28.3253791Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3255541Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3256344Z ^ 2025-05-07T19:54:28.3259861Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:28.3263076Z 2025-05-07T19:54:28.3264237Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3269917Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3270892Z ^ 2025-05-07T19:54:28.3274301Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:28.3277570Z 2025-05-07T19:54:28.3278836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3280715Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3281614Z ^ 2025-05-07T19:54:28.3284662Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:28.3287688Z 2025-05-07T19:54:28.3288909Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3290764Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3291717Z ^ 2025-05-07T19:54:28.3295082Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:28.3297919Z 2025-05-07T19:54:28.3299044Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3300913Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3301705Z ^ 2025-05-07T19:54:28.3305075Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:28.3307911Z 2025-05-07T19:54:28.3309130Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3310979Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3312088Z ^ 2025-05-07T19:54:28.3315757Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:28.3319117Z 2025-05-07T19:54:28.3320276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3322149Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3323144Z ^ 2025-05-07T19:54:28.3326306Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:28.3329377Z 2025-05-07T19:54:28.3330544Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3332402Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3333284Z ^ 2025-05-07T19:54:28.3336487Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:28.3339730Z 2025-05-07T19:54:28.3341072Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3343076Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3344012Z ^ 2025-05-07T19:54:28.3347518Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:28.3350949Z 2025-05-07T19:54:28.3352240Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3354309Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3355806Z ^ 2025-05-07T19:54:28.3359257Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:28.3362608Z 2025-05-07T19:54:28.3363891Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3365801Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3366693Z ^ 2025-05-07T19:54:28.3370112Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:28.3373234Z 2025-05-07T19:54:28.3374445Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3376530Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3377377Z ^ 2025-05-07T19:54:28.3380852Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:28.3384056Z 2025-05-07T19:54:28.3385246Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3386995Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3387773Z ^ 2025-05-07T19:54:28.3391068Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:28.3394390Z 2025-05-07T19:54:28.3395801Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3397782Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3398673Z ^ 2025-05-07T19:54:28.3402462Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:28.3405871Z 2025-05-07T19:54:28.3407147Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3409123Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3410006Z ^ 2025-05-07T19:54:28.3413730Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:28.3417178Z 2025-05-07T19:54:28.3418822Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.3421517Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:28.3422670Z ^ 2025-05-07T19:54:28.3422953Z 2025-05-07T19:54:28.3423408Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:28.3424078Z 2025-05-07T19:54:28.3425685Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.3428617Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:28.3429845Z ^ 2025-05-07T19:54:28.3430215Z 2025-05-07T19:54:28.3431450Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3433326Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3434329Z ^ 2025-05-07T19:54:28.3437506Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:28.3440708Z 2025-05-07T19:54:28.3441962Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3444049Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3445413Z ^ 2025-05-07T19:54:28.3448864Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:28.3452146Z 2025-05-07T19:54:28.3453313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3455360Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3456299Z ^ 2025-05-07T19:54:28.3459726Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:28.3462624Z 2025-05-07T19:54:28.3463706Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3465619Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3466478Z ^ 2025-05-07T19:54:28.3469805Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:28.3473150Z 2025-05-07T19:54:28.3474481Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3476500Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3477399Z ^ 2025-05-07T19:54:28.3480593Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:28.3483615Z 2025-05-07T19:54:28.3484741Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3486697Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3487634Z ^ 2025-05-07T19:54:28.3491495Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:28.3494732Z 2025-05-07T19:54:28.3496071Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3498096Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3499039Z ^ 2025-05-07T19:54:28.3502628Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:28.3505976Z 2025-05-07T19:54:28.3507338Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3509365Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3510295Z ^ 2025-05-07T19:54:28.3514238Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:28.3517569Z 2025-05-07T19:54:28.3518680Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3520375Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3521306Z ^ 2025-05-07T19:54:28.3524597Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:28.3527440Z 2025-05-07T19:54:28.3528858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3530667Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3531529Z ^ 2025-05-07T19:54:28.3535296Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:28.3538698Z 2025-05-07T19:54:28.3539921Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3541842Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3542724Z ^ 2025-05-07T19:54:28.3546083Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:28.3549278Z 2025-05-07T19:54:28.3550418Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3552179Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3552998Z ^ 2025-05-07T19:54:28.3556536Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:28.3559402Z 2025-05-07T19:54:28.3560511Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3562250Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3563052Z ^ 2025-05-07T19:54:28.3566005Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:28.3568643Z 2025-05-07T19:54:28.3569774Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3571664Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3572489Z ^ 2025-05-07T19:54:28.3576256Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:28.3579489Z 2025-05-07T19:54:28.3580738Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3582616Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3583484Z ^ 2025-05-07T19:54:28.3586711Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:28.3589443Z 2025-05-07T19:54:28.3590573Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3592262Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3592998Z ^ 2025-05-07T19:54:28.3596662Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:28.3600022Z 2025-05-07T19:54:28.3601357Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3603394Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3604258Z ^ 2025-05-07T19:54:28.3607982Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:28.3611228Z 2025-05-07T19:54:28.3612498Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3614472Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3615374Z ^ 2025-05-07T19:54:28.3619140Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:28.3622642Z 2025-05-07T19:54:28.3623934Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3626120Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3627066Z ^ 2025-05-07T19:54:28.3630916Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:28.3634437Z 2025-05-07T19:54:28.3635709Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3637812Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3638713Z ^ 2025-05-07T19:54:28.3642222Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:28.3645513Z 2025-05-07T19:54:28.3646798Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3648795Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3649676Z ^ 2025-05-07T19:54:28.3653167Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:28.3656567Z 2025-05-07T19:54:28.3657874Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3659904Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3660815Z ^ 2025-05-07T19:54:28.3664790Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:28.3668163Z 2025-05-07T19:54:28.3669632Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3671598Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3672532Z ^ 2025-05-07T19:54:28.3676226Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:28.3679563Z 2025-05-07T19:54:28.3680894Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3682904Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3683835Z ^ 2025-05-07T19:54:28.3687432Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:28.3690769Z 2025-05-07T19:54:28.3692498Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.3695236Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:28.3696450Z ^ 2025-05-07T19:54:28.3696710Z 2025-05-07T19:54:28.3697173Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:28.3697897Z 2025-05-07T19:54:28.3699615Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.3702372Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:28.3703588Z ^ 2025-05-07T19:54:28.3703990Z 2025-05-07T19:54:28.3705213Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3707137Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3708011Z ^ 2025-05-07T19:54:28.3711650Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:28.3714627Z 2025-05-07T19:54:28.3715731Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3717570Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3718435Z ^ 2025-05-07T19:54:28.3721765Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:28.3724734Z 2025-05-07T19:54:28.3725867Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3727554Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3728688Z ^ 2025-05-07T19:54:28.3732070Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:28.3735110Z 2025-05-07T19:54:28.3736036Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3737841Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3738651Z ^ 2025-05-07T19:54:28.3741850Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:28.3744782Z 2025-05-07T19:54:28.3746085Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3747912Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3748825Z ^ 2025-05-07T19:54:28.3752490Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:28.3755875Z 2025-05-07T19:54:28.3757644Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3759599Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3760535Z ^ 2025-05-07T19:54:28.3763945Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:28.3767099Z 2025-05-07T19:54:28.3768337Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3770267Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3771185Z ^ 2025-05-07T19:54:28.3774500Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:28.3777830Z 2025-05-07T19:54:28.3779013Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3780907Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3781776Z ^ 2025-05-07T19:54:28.3785160Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:28.3788267Z 2025-05-07T19:54:28.3789505Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3791395Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3792254Z ^ 2025-05-07T19:54:28.3795864Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:28.3798909Z 2025-05-07T19:54:28.3800345Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3804654Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3805555Z ^ 2025-05-07T19:54:28.3809065Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:28.3812323Z 2025-05-07T19:54:28.3813609Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3815746Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3816681Z ^ 2025-05-07T19:54:28.3820225Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:28.3823464Z 2025-05-07T19:54:28.3824781Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3826746Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3827654Z ^ 2025-05-07T19:54:28.3831068Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:28.3834130Z 2025-05-07T19:54:28.3835305Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3837172Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3838024Z ^ 2025-05-07T19:54:28.3841577Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:28.3844591Z 2025-05-07T19:54:28.3845832Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3848342Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3849201Z ^ 2025-05-07T19:54:28.3852608Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:28.3855737Z 2025-05-07T19:54:28.3857068Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3858928Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3859889Z ^ 2025-05-07T19:54:28.3862988Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:28.3866040Z 2025-05-07T19:54:28.3867227Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3869121Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3869962Z ^ 2025-05-07T19:54:28.3873305Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:28.3876531Z 2025-05-07T19:54:28.3877766Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3879658Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3880519Z ^ 2025-05-07T19:54:28.3884038Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:28.3887207Z 2025-05-07T19:54:28.3888359Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3890476Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3891361Z ^ 2025-05-07T19:54:28.3894392Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:28.3897395Z 2025-05-07T19:54:28.3898564Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3900355Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3901156Z ^ 2025-05-07T19:54:28.3904348Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:28.3907300Z 2025-05-07T19:54:28.3908487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3910344Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3911184Z ^ 2025-05-07T19:54:28.3914806Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:28.3918119Z 2025-05-07T19:54:28.3919378Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3921291Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3922188Z ^ 2025-05-07T19:54:28.3925548Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:28.3929340Z 2025-05-07T19:54:28.3930542Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3932421Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3933519Z ^ 2025-05-07T19:54:28.3936947Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:28.3940069Z 2025-05-07T19:54:28.3941307Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3943144Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3944085Z ^ 2025-05-07T19:54:28.3947308Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:28.3950304Z 2025-05-07T19:54:28.3951555Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3953587Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3954632Z ^ 2025-05-07T19:54:28.3957881Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:28.3961024Z 2025-05-07T19:54:28.3962540Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.3965228Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:28.3966454Z ^ 2025-05-07T19:54:28.3966720Z 2025-05-07T19:54:28.3967180Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:28.3967857Z 2025-05-07T19:54:28.3969555Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.3972524Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:28.3973683Z ^ 2025-05-07T19:54:28.3974024Z 2025-05-07T19:54:28.3975226Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3977170Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3978025Z ^ 2025-05-07T19:54:28.3981331Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:28.3984285Z 2025-05-07T19:54:28.3985425Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3987307Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3988215Z ^ 2025-05-07T19:54:28.3991787Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:28.3995316Z 2025-05-07T19:54:28.3996628Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.3998589Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.3999505Z ^ 2025-05-07T19:54:28.4003128Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:28.4006392Z 2025-05-07T19:54:28.4007649Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.4009654Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.4010592Z ^ 2025-05-07T19:54:28.4014149Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:28.4017203Z 2025-05-07T19:54:28.4018577Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.4020880Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.4021727Z ^ 2025-05-07T19:54:28.4025298Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:28.4028224Z 2025-05-07T19:54:28.4029817Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.4031774Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.4032597Z ^ 2025-05-07T19:54:28.4035895Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:28.4038865Z 2025-05-07T19:54:28.4040112Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.4042076Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.4042952Z ^ 2025-05-07T19:54:28.4046271Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:28.4049268Z 2025-05-07T19:54:28.4050553Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.4052444Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.4053371Z ^ 2025-05-07T19:54:28.4056767Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:28.4060006Z 2025-05-07T19:54:28.4061880Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.4063816Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.4064900Z ^ 2025-05-07T19:54:28.4068293Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:28.4071830Z 2025-05-07T19:54:28.4073145Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.4075265Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.4076208Z ^ 2025-05-07T19:54:28.4079696Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:28.4082946Z 2025-05-07T19:54:28.4084241Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.4086243Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.4087181Z ^ 2025-05-07T19:54:28.4090671Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:28.4093928Z 2025-05-07T19:54:28.4095179Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.4097135Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.4098150Z ^ 2025-05-07T19:54:28.4101667Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:28.4104795Z 2025-05-07T19:54:28.4106229Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.4107873Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.4108636Z ^ 2025-05-07T19:54:28.4111998Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:28.4115363Z 2025-05-07T19:54:28.4116615Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.4118779Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.4119689Z ^ 2025-05-07T19:54:28.4123451Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:28.4126790Z 2025-05-07T19:54:28.4128109Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.4130441Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.4131356Z ^ 2025-05-07T19:54:28.4134909Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:28.4138184Z 2025-05-07T19:54:28.4139486Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.4141429Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.4142352Z ^ 2025-05-07T19:54:28.4145810Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:28.4149067Z 2025-05-07T19:54:28.4150739Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.4152704Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.4153630Z ^ 2025-05-07T19:54:28.4157477Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:28.4160787Z 2025-05-07T19:54:28.4162117Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.4164108Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.4165031Z ^ 2025-05-07T19:54:28.4168849Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:28.4172231Z 2025-05-07T19:54:28.4173609Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.4175658Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.4176595Z ^ 2025-05-07T19:54:28.4180081Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:28.4183340Z 2025-05-07T19:54:28.4184626Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.4186648Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.4187584Z ^ 2025-05-07T19:54:28.4191074Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:28.4194407Z 2025-05-07T19:54:28.4195645Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.4197854Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.4198813Z ^ 2025-05-07T19:54:28.4202421Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:28.4206053Z 2025-05-07T19:54:28.4207323Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.4209347Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.4210064Z ^ 2025-05-07T19:54:28.4213226Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:28.4216273Z 2025-05-07T19:54:28.4217484Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.4219346Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.4220256Z ^ 2025-05-07T19:54:28.4223631Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:28.4227018Z 2025-05-07T19:54:28.4228156Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.4230339Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.4231176Z ^ 2025-05-07T19:54:28.4234736Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:28.4237777Z 2025-05-07T19:54:29.5474524Z [184/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:29.5494937Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:30.1630955Z [185/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:54:30.1652268Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:31.7517200Z [186/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:54:31.7537861Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:31.8911549Z [187/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:31.8931595Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:31.9220618Z [188/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:54:31.9238669Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:31.9535146Z [189/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:31.9553581Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:31.9851670Z [190/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:54:31.9871888Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:32.0156724Z [191/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:54:32.0175420Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:32.0466212Z [192/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:54:32.0487005Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:32.0772461Z [193/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:32.0795939Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:37.0826596Z [194/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o 2025-05-07T19:54:37.0848570Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.0850921Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.0851931Z ^ 2025-05-07T19:54:37.0852194Z 2025-05-07T19:54:37.0852676Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:37.0853409Z 2025-05-07T19:54:37.0854778Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.0857265Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.0858262Z ^ 2025-05-07T19:54:37.0858585Z 2025-05-07T19:54:37.0860014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.0862501Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.0879022Z ^ 2025-05-07T19:54:37.0879336Z 2025-05-07T19:54:37.0879799Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:37.0880417Z 2025-05-07T19:54:37.0881929Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.0884680Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.0885914Z ^ 2025-05-07T19:54:37.0886283Z 2025-05-07T19:54:37.0887817Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.0890159Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.0891384Z ^ 2025-05-07T19:54:37.0891614Z 2025-05-07T19:54:37.0891966Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:37.0892559Z 2025-05-07T19:54:37.0894012Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.0896338Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.0897551Z ^ 2025-05-07T19:54:37.0897845Z 2025-05-07T19:54:37.0899451Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.0901930Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.0902934Z ^ 2025-05-07T19:54:37.0903166Z 2025-05-07T19:54:37.0903576Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:37.0904346Z 2025-05-07T19:54:37.0905810Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.0908209Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.0909313Z ^ 2025-05-07T19:54:37.0909709Z 2025-05-07T19:54:37.0911291Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.0913780Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.0915025Z ^ 2025-05-07T19:54:37.0915283Z 2025-05-07T19:54:37.0915715Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:37.0916334Z 2025-05-07T19:54:37.0917800Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.0920732Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:37.0921919Z ^ 2025-05-07T19:54:37.0922287Z 2025-05-07T19:54:37.4342207Z [195/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:37.4362754Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:38.5307272Z [196/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_dense_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o 2025-05-07T19:54:38.5329014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:38.5331711Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:38.5332747Z ^ 2025-05-07T19:54:38.5332992Z 2025-05-07T19:54:38.5333441Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:38.5334089Z 2025-05-07T19:54:38.5335739Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:38.5338340Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:38.5339401Z ^ 2025-05-07T19:54:38.5339739Z 2025-05-07T19:54:38.5341148Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:38.5343416Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:38.5344410Z ^ 2025-05-07T19:54:38.5344636Z 2025-05-07T19:54:38.5345020Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:38.5345630Z 2025-05-07T19:54:38.5347125Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:38.5349762Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:38.5350814Z ^ 2025-05-07T19:54:38.5351143Z 2025-05-07T19:54:38.5352487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:38.5355012Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:38.5356070Z ^ 2025-05-07T19:54:38.5356311Z 2025-05-07T19:54:38.5356740Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:38.5357389Z 2025-05-07T19:54:38.5359443Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:38.5362010Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:38.5363398Z ^ 2025-05-07T19:54:38.5363752Z 2025-05-07T19:54:38.5365313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:38.5367739Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:38.5368854Z ^ 2025-05-07T19:54:38.5369111Z 2025-05-07T19:54:38.5369596Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:38.5370226Z 2025-05-07T19:54:38.5371898Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:38.5374299Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:38.5375440Z ^ 2025-05-07T19:54:38.5375776Z 2025-05-07T19:54:38.5377279Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:38.5379751Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:38.5380879Z ^ 2025-05-07T19:54:38.5381156Z 2025-05-07T19:54:38.5381614Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:38.5382297Z 2025-05-07T19:54:38.5384028Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:38.5386763Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:38.5387986Z ^ 2025-05-07T19:54:38.5388355Z 2025-05-07T19:54:41.4286987Z [197/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:54:41.4310124Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.4312497Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:41.4313681Z ^ 2025-05-07T19:54:41.4313954Z 2025-05-07T19:54:41.4314536Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:41.4315214Z 2025-05-07T19:54:41.4316892Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.4319529Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:41.4320699Z ^ 2025-05-07T19:54:41.4320997Z 2025-05-07T19:54:41.4322409Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.4324659Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:41.4325713Z ^ 2025-05-07T19:54:41.4325937Z 2025-05-07T19:54:41.4326385Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:41.4326970Z 2025-05-07T19:54:41.4328778Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.4331372Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:41.4332490Z ^ 2025-05-07T19:54:41.4332820Z 2025-05-07T19:54:41.4334395Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.4337314Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:41.4338383Z ^ 2025-05-07T19:54:41.4338641Z 2025-05-07T19:54:41.4339012Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:41.4339568Z 2025-05-07T19:54:41.4341276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.4343850Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:41.4344989Z ^ 2025-05-07T19:54:41.4345333Z 2025-05-07T19:54:41.4346925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.4349415Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:41.4350560Z ^ 2025-05-07T19:54:41.4350823Z 2025-05-07T19:54:41.4351267Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:41.4351939Z 2025-05-07T19:54:41.4353641Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.4356331Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:41.4357672Z ^ 2025-05-07T19:54:41.4358060Z 2025-05-07T19:54:41.4359776Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.4362539Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:41.4364033Z ^ 2025-05-07T19:54:41.4364311Z 2025-05-07T19:54:41.4364754Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:41.4365425Z 2025-05-07T19:54:41.4366938Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.4369375Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:41.4370585Z ^ 2025-05-07T19:54:41.4370939Z 2025-05-07T19:54:41.6849846Z [198/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o 2025-05-07T19:54:41.6873211Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.6875973Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:41.6877074Z ^ 2025-05-07T19:54:41.6877366Z 2025-05-07T19:54:41.6877785Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:41.6878362Z 2025-05-07T19:54:41.6879908Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.6882595Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:41.6884016Z ^ 2025-05-07T19:54:41.6884399Z 2025-05-07T19:54:41.6886127Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.6888218Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:41.6888846Z ^ 2025-05-07T19:54:41.6889181Z 2025-05-07T19:54:41.6890830Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.6892940Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:41.6893550Z ^ 2025-05-07T19:54:41.6893862Z 2025-05-07T19:54:41.6895537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.6897955Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:41.6898578Z ^ 2025-05-07T19:54:41.6898893Z 2025-05-07T19:54:41.6900644Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.6903740Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:41.6904950Z ^ 2025-05-07T19:54:41.6905208Z 2025-05-07T19:54:41.6905669Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:41.6906364Z 2025-05-07T19:54:41.6907968Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.6910268Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:41.6911427Z ^ 2025-05-07T19:54:41.6911814Z 2025-05-07T19:54:41.6913254Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.6915124Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:41.6915598Z ^ 2025-05-07T19:54:41.6915857Z 2025-05-07T19:54:41.6917117Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.6918916Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:41.6919483Z ^ 2025-05-07T19:54:41.6919767Z 2025-05-07T19:54:41.6921180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.6923241Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:41.6923747Z ^ 2025-05-07T19:54:41.6924004Z 2025-05-07T19:54:41.6925556Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.6927900Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:41.6929292Z ^ 2025-05-07T19:54:41.6929539Z 2025-05-07T19:54:41.6929916Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:41.6930550Z 2025-05-07T19:54:41.6932303Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.6934933Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:41.6936104Z ^ 2025-05-07T19:54:41.6936469Z 2025-05-07T19:54:41.6937806Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.6940032Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:41.6940537Z ^ 2025-05-07T19:54:41.6940796Z 2025-05-07T19:54:41.6942146Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.6944122Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:41.6944611Z ^ 2025-05-07T19:54:41.6944864Z 2025-05-07T19:54:41.6946248Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.6947924Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:41.6948469Z ^ 2025-05-07T19:54:41.6948767Z 2025-05-07T19:54:41.6950350Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.6952690Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:41.6953716Z ^ 2025-05-07T19:54:41.6954290Z 2025-05-07T19:54:41.6954712Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:41.6955422Z 2025-05-07T19:54:41.6957141Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.6959915Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:41.6961148Z ^ 2025-05-07T19:54:41.6961540Z 2025-05-07T19:54:41.6963111Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.6964964Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:41.6965520Z ^ 2025-05-07T19:54:41.6965826Z 2025-05-07T19:54:41.6967293Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.6969192Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:41.6969780Z ^ 2025-05-07T19:54:41.6970068Z 2025-05-07T19:54:41.6971489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.6973451Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:41.6974187Z ^ 2025-05-07T19:54:41.6974496Z 2025-05-07T19:54:41.6975984Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.6978602Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:41.6979804Z ^ 2025-05-07T19:54:41.6980031Z 2025-05-07T19:54:41.6980501Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:41.6981156Z 2025-05-07T19:54:41.6983041Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.6985463Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:41.6986682Z ^ 2025-05-07T19:54:41.6987017Z 2025-05-07T19:54:41.6988482Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.6990337Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:41.6990890Z ^ 2025-05-07T19:54:41.6991224Z 2025-05-07T19:54:41.6992719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.6994649Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:41.6995152Z ^ 2025-05-07T19:54:41.6995432Z 2025-05-07T19:54:41.6997104Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.6998999Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:41.6999576Z ^ 2025-05-07T19:54:41.6999870Z 2025-05-07T19:54:42.1973985Z [199/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:54:42.1992746Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:45.0124325Z [200/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:54:45.0143258Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:45.0414869Z [201/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:54:45.0438312Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:45.3854540Z [202/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:54:45.3873183Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:45.8257474Z [203/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:54:45.8278375Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:45.9728901Z [204/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:54:45.9751504Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.9754775Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:45.9755889Z ^ 2025-05-07T19:54:45.9756127Z 2025-05-07T19:54:45.9756541Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:45.9757170Z 2025-05-07T19:54:45.9758765Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.9761222Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:45.9762359Z ^ 2025-05-07T19:54:45.9762704Z 2025-05-07T19:54:45.9764193Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.9766660Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:45.9767757Z ^ 2025-05-07T19:54:45.9768002Z 2025-05-07T19:54:45.9768436Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:45.9769048Z 2025-05-07T19:54:45.9770563Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.9773255Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:45.9774397Z ^ 2025-05-07T19:54:45.9774766Z 2025-05-07T19:54:45.9776275Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.9778874Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:45.9780008Z ^ 2025-05-07T19:54:45.9780279Z 2025-05-07T19:54:45.9780722Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:45.9781379Z 2025-05-07T19:54:45.9782922Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.9785429Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:45.9786709Z ^ 2025-05-07T19:54:45.9787041Z 2025-05-07T19:54:45.9788544Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.9790917Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:45.9791948Z ^ 2025-05-07T19:54:45.9792183Z 2025-05-07T19:54:45.9792989Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:45.9793601Z 2025-05-07T19:54:45.9795213Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.9797666Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:45.9798711Z ^ 2025-05-07T19:54:45.9799065Z 2025-05-07T19:54:45.9800542Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.9803228Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:45.9804325Z ^ 2025-05-07T19:54:45.9804608Z 2025-05-07T19:54:45.9805044Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:45.9805695Z 2025-05-07T19:54:45.9807180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.9809622Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:45.9810740Z ^ 2025-05-07T19:54:45.9811059Z 2025-05-07T19:54:46.7029634Z [205/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:54:46.7050174Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:46.7230435Z [206/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:54:46.7255806Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:46.7258599Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:46.7259792Z ^ 2025-05-07T19:54:46.7260064Z 2025-05-07T19:54:46.7260513Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:46.7261195Z 2025-05-07T19:54:46.7262920Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:46.7265651Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:46.7266862Z ^ 2025-05-07T19:54:46.7267224Z 2025-05-07T19:54:46.7269122Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:46.7271842Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:46.7273114Z ^ 2025-05-07T19:54:46.7273358Z 2025-05-07T19:54:46.7273795Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:46.7274571Z 2025-05-07T19:54:46.7276260Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:46.7279135Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:46.7280350Z ^ 2025-05-07T19:54:46.7280733Z 2025-05-07T19:54:46.7282569Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:46.7285321Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:46.7286504Z ^ 2025-05-07T19:54:46.7286754Z 2025-05-07T19:54:46.7287218Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:46.7288038Z 2025-05-07T19:54:46.7289740Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:46.7292529Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:46.7293741Z ^ 2025-05-07T19:54:46.7294103Z 2025-05-07T19:54:46.7295792Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:46.7298529Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:46.7299726Z ^ 2025-05-07T19:54:46.7299968Z 2025-05-07T19:54:46.7300411Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:46.7301095Z 2025-05-07T19:54:46.7302811Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:46.7305548Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:46.7306743Z ^ 2025-05-07T19:54:46.7307114Z 2025-05-07T19:54:46.7309119Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:46.7311901Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:46.7313127Z ^ 2025-05-07T19:54:46.7313380Z 2025-05-07T19:54:46.7313850Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:46.7314679Z 2025-05-07T19:54:46.7319996Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:46.7322600Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:46.7323822Z ^ 2025-05-07T19:54:46.7324169Z 2025-05-07T19:54:47.0727198Z [207/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:47.0744715Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:47.9992906Z [208/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:54:48.0010789Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:48.9087299Z [209/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:48.9108000Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:51.0099333Z [210/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:54:51.0118672Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:52.0428117Z [211/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:54:52.0448600Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:54.7576105Z [212/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:54:54.7594733Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:55.3245882Z [213/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:54:55.3265073Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:56.7377687Z [214/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:54:56.7396942Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:57.7148776Z [215/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:54:57.7167036Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:58.3108397Z [216/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:54:58.3126584Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:58.6195540Z [217/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:54:58.6223008Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:58.9871481Z [218/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:54:58.9891530Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:59.2344632Z [219/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:54:59.2364971Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:59.2991772Z [220/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:54:59.3012574Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:59.8355533Z [221/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:54:59.8376088Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:59.9413025Z [222/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:55:00.0111484Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:00.0175154Z [223/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_pt2.so -o fbgemm_gpu_tbe_training_backward_pt2.so CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_common.so fbgemm_gpu_sparse_async_cumsum.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && : 2025-05-07T19:55:00.0738705Z [224/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:55:00.0759717Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:00.0902538Z [225/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:55:00.0922628Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:00.5459700Z [226/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:55:00.5482890Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:00.5485550Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:00.5486703Z ^ 2025-05-07T19:55:00.5486947Z 2025-05-07T19:55:00.5487314Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:00.5487943Z 2025-05-07T19:55:00.5489491Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:00.5492066Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:00.5493483Z ^ 2025-05-07T19:55:00.5493853Z 2025-05-07T19:55:00.5495441Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:00.5498070Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:00.5499217Z ^ 2025-05-07T19:55:00.5499476Z 2025-05-07T19:55:00.5499897Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:00.5500540Z 2025-05-07T19:55:00.5502239Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:00.5504772Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:00.5505967Z ^ 2025-05-07T19:55:00.5506287Z 2025-05-07T19:55:00.5507825Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:00.5510457Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:00.5511670Z ^ 2025-05-07T19:55:00.5511931Z 2025-05-07T19:55:00.5512391Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:00.5513068Z 2025-05-07T19:55:00.5515087Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:00.5517770Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:00.5519042Z ^ 2025-05-07T19:55:00.5519429Z 2025-05-07T19:55:00.5521097Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:00.5523824Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:00.5525012Z ^ 2025-05-07T19:55:00.5525301Z 2025-05-07T19:55:00.5525744Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:00.5526419Z 2025-05-07T19:55:00.5527977Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:00.5530624Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:00.5531766Z ^ 2025-05-07T19:55:00.5532129Z 2025-05-07T19:55:00.5533769Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:00.5536489Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:00.5537638Z ^ 2025-05-07T19:55:00.5538116Z 2025-05-07T19:55:00.5538560Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:00.5539253Z 2025-05-07T19:55:00.5540881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:00.5543541Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:00.5544639Z ^ 2025-05-07T19:55:00.5545019Z 2025-05-07T19:55:00.5915583Z [227/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:55:00.5934429Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:00.6217109Z [228/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:55:00.6237303Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:01.4484998Z [229/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:55:01.4507009Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:01.9811836Z [230/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o 2025-05-07T19:55:01.9836267Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:01.9838851Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:01.9840029Z ^ 2025-05-07T19:55:01.9840267Z 2025-05-07T19:55:01.9840724Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:01.9841440Z 2025-05-07T19:55:01.9843087Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:01.9845935Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:01.9847113Z ^ 2025-05-07T19:55:01.9847499Z 2025-05-07T19:55:01.9849116Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9851130Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9851689Z ^ 2025-05-07T19:55:01.9852000Z 2025-05-07T19:55:01.9853888Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9855826Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9856358Z ^ 2025-05-07T19:55:01.9856663Z 2025-05-07T19:55:01.9858170Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9860045Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9860557Z ^ 2025-05-07T19:55:01.9860858Z 2025-05-07T19:55:01.9862523Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:01.9865098Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:01.9866470Z ^ 2025-05-07T19:55:01.9866737Z 2025-05-07T19:55:01.9867217Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:01.9867876Z 2025-05-07T19:55:01.9869599Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:01.9872145Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:01.9873343Z ^ 2025-05-07T19:55:01.9873709Z 2025-05-07T19:55:01.9875828Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9877843Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9878535Z ^ 2025-05-07T19:55:01.9878876Z 2025-05-07T19:55:01.9880470Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9882469Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9883029Z ^ 2025-05-07T19:55:01.9883354Z 2025-05-07T19:55:01.9884931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9886946Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9887675Z ^ 2025-05-07T19:55:01.9887988Z 2025-05-07T19:55:01.9889639Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:01.9892274Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:01.9893378Z ^ 2025-05-07T19:55:01.9893608Z 2025-05-07T19:55:01.9894000Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:01.9894626Z 2025-05-07T19:55:01.9896164Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:01.9898931Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:01.9900094Z ^ 2025-05-07T19:55:01.9900488Z 2025-05-07T19:55:01.9901948Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9903879Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9904420Z ^ 2025-05-07T19:55:01.9904711Z 2025-05-07T19:55:01.9906264Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9908175Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9908784Z ^ 2025-05-07T19:55:01.9909089Z 2025-05-07T19:55:01.9910718Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9912743Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9913328Z ^ 2025-05-07T19:55:01.9913639Z 2025-05-07T19:55:01.9915755Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:01.9919960Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:01.9921258Z ^ 2025-05-07T19:55:01.9921531Z 2025-05-07T19:55:01.9922001Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:01.9922720Z 2025-05-07T19:55:01.9924453Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:01.9927140Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:01.9928579Z ^ 2025-05-07T19:55:01.9928985Z 2025-05-07T19:55:01.9930564Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9932630Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9933239Z ^ 2025-05-07T19:55:01.9933551Z 2025-05-07T19:55:01.9935160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9937236Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9937992Z ^ 2025-05-07T19:55:01.9938296Z 2025-05-07T19:55:01.9939927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9941951Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9942528Z ^ 2025-05-07T19:55:01.9942987Z 2025-05-07T19:55:01.9944663Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:01.9947386Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:01.9948593Z ^ 2025-05-07T19:55:01.9948850Z 2025-05-07T19:55:01.9949301Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:01.9950002Z 2025-05-07T19:55:01.9951702Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:01.9954576Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:01.9955784Z ^ 2025-05-07T19:55:01.9956157Z 2025-05-07T19:55:01.9957793Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9959779Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9960352Z ^ 2025-05-07T19:55:01.9960646Z 2025-05-07T19:55:01.9962245Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9964229Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9965114Z ^ 2025-05-07T19:55:01.9965423Z 2025-05-07T19:55:01.9967281Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9969346Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9970001Z ^ 2025-05-07T19:55:01.9970311Z 2025-05-07T19:55:03.4866638Z [231/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:55:03.4887493Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:04.5182655Z [232/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:55:04.5202876Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:08.2253350Z [233/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:55:08.2271672Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:10.0257458Z [234/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:55:10.0278402Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:10.3078183Z [235/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:55:10.3098509Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:13.5546247Z [236/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o 2025-05-07T19:55:13.5578300Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:13.5581939Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:13.5583449Z ^ 2025-05-07T19:55:13.5583771Z 2025-05-07T19:55:13.5584342Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:13.5585467Z 2025-05-07T19:55:13.5587580Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:13.5590927Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:13.5592485Z ^ 2025-05-07T19:55:13.5592933Z 2025-05-07T19:55:13.5596636Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5599483Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:13.5600419Z ^ 2025-05-07T19:55:13.5600835Z 2025-05-07T19:55:13.5602910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5605394Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.5606094Z ^ 2025-05-07T19:55:13.5606505Z 2025-05-07T19:55:13.5608433Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5610972Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.5611703Z ^ 2025-05-07T19:55:13.5612063Z 2025-05-07T19:55:13.5614189Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5616929Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.5617658Z ^ 2025-05-07T19:55:13.5618018Z 2025-05-07T19:55:13.5620191Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:13.5623527Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:13.5625226Z ^ 2025-05-07T19:55:13.5625697Z 2025-05-07T19:55:13.5626265Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:13.5627108Z 2025-05-07T19:55:13.5629386Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:13.5632847Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:13.5634461Z ^ 2025-05-07T19:55:13.5634962Z 2025-05-07T19:55:13.5636995Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5639856Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:13.5640819Z ^ 2025-05-07T19:55:13.5641182Z 2025-05-07T19:55:13.5643074Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5645449Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.5646185Z ^ 2025-05-07T19:55:13.5646526Z 2025-05-07T19:55:13.5648448Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5650802Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.5651522Z ^ 2025-05-07T19:55:13.5651843Z 2025-05-07T19:55:13.5654026Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5656632Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.5657355Z ^ 2025-05-07T19:55:13.5657791Z 2025-05-07T19:55:13.5660073Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:13.5663046Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:13.5664286Z ^ 2025-05-07T19:55:13.5664619Z 2025-05-07T19:55:13.5665227Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:13.5666147Z 2025-05-07T19:55:13.5668550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:13.5672064Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:13.5673614Z ^ 2025-05-07T19:55:13.5674197Z 2025-05-07T19:55:13.5676107Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5679053Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:13.5680071Z ^ 2025-05-07T19:55:13.5680450Z 2025-05-07T19:55:13.5682571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5685431Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.5686216Z ^ 2025-05-07T19:55:13.5686584Z 2025-05-07T19:55:13.5688571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5691069Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.5691793Z ^ 2025-05-07T19:55:13.5692137Z 2025-05-07T19:55:13.5694113Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5696742Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.5697368Z ^ 2025-05-07T19:55:13.5697719Z 2025-05-07T19:55:13.5699905Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:13.5703222Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:13.5704708Z ^ 2025-05-07T19:55:13.5705042Z 2025-05-07T19:55:13.5705603Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:13.5706406Z 2025-05-07T19:55:13.5708554Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:13.5712040Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:13.5713541Z ^ 2025-05-07T19:55:13.5713975Z 2025-05-07T19:55:13.5716069Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5718781Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:13.5719761Z ^ 2025-05-07T19:55:13.5720125Z 2025-05-07T19:55:13.5722233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5724798Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.5725637Z ^ 2025-05-07T19:55:13.5726172Z 2025-05-07T19:55:13.5728203Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5730915Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.5731618Z ^ 2025-05-07T19:55:13.5732005Z 2025-05-07T19:55:13.5733970Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5736504Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.5737224Z ^ 2025-05-07T19:55:13.5737583Z 2025-05-07T19:55:13.5739809Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:13.5743598Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:13.5745221Z ^ 2025-05-07T19:55:13.5745728Z 2025-05-07T19:55:13.5746330Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:13.5747197Z 2025-05-07T19:55:13.5749342Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:13.5752605Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:13.5754247Z ^ 2025-05-07T19:55:13.5754697Z 2025-05-07T19:55:13.5756576Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5759187Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:13.5760119Z ^ 2025-05-07T19:55:13.5760484Z 2025-05-07T19:55:13.5762439Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5764999Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.5765712Z ^ 2025-05-07T19:55:13.5766100Z 2025-05-07T19:55:13.5768500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5770581Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.5771326Z ^ 2025-05-07T19:55:13.5771805Z 2025-05-07T19:55:13.5774007Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5776707Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.5777426Z ^ 2025-05-07T19:55:13.5777798Z 2025-05-07T19:55:14.9741752Z [237/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o 2025-05-07T19:55:14.9754801Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:14.9756247Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:14.9756919Z ^ 2025-05-07T19:55:14.9757074Z 2025-05-07T19:55:14.9757352Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:14.9757725Z 2025-05-07T19:55:14.9758782Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:14.9760222Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:14.9761012Z ^ 2025-05-07T19:55:14.9761227Z 2025-05-07T19:55:14.9762054Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.9763235Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:14.9763690Z ^ 2025-05-07T19:55:14.9763857Z 2025-05-07T19:55:14.9764767Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.9765805Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:14.9766162Z ^ 2025-05-07T19:55:14.9766329Z 2025-05-07T19:55:14.9767145Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.9768208Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:14.9768556Z ^ 2025-05-07T19:55:14.9768721Z 2025-05-07T19:55:14.9769537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.9770646Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:14.9770968Z ^ 2025-05-07T19:55:14.9771156Z 2025-05-07T19:55:14.9772043Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:14.9773503Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:14.9774150Z ^ 2025-05-07T19:55:14.9774329Z 2025-05-07T19:55:14.9774589Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:14.9774957Z 2025-05-07T19:55:14.9775879Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:14.9777316Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:14.9778002Z ^ 2025-05-07T19:55:14.9778214Z 2025-05-07T19:55:14.9779056Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.9780199Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:14.9780667Z ^ 2025-05-07T19:55:14.9780834Z 2025-05-07T19:55:14.9781643Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.9782781Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:14.9783129Z ^ 2025-05-07T19:55:14.9783291Z 2025-05-07T19:55:14.9784096Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.9785186Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:14.9785502Z ^ 2025-05-07T19:55:14.9785684Z 2025-05-07T19:55:14.9786637Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.9787658Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:14.9787966Z ^ 2025-05-07T19:55:14.9788148Z 2025-05-07T19:55:14.9789006Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:14.9790420Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:14.9791047Z ^ 2025-05-07T19:55:14.9791195Z 2025-05-07T19:55:14.9791458Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:14.9791814Z 2025-05-07T19:55:14.9792675Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:14.9794219Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:14.9794937Z ^ 2025-05-07T19:55:14.9795141Z 2025-05-07T19:55:14.9795934Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.9797061Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:14.9797502Z ^ 2025-05-07T19:55:14.9797665Z 2025-05-07T19:55:14.9798456Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.9799671Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:14.9799991Z ^ 2025-05-07T19:55:14.9800178Z 2025-05-07T19:55:14.9800996Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.9802057Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:14.9802378Z ^ 2025-05-07T19:55:14.9802535Z 2025-05-07T19:55:14.9803365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.9804394Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:14.9804728Z ^ 2025-05-07T19:55:14.9804886Z 2025-05-07T19:55:14.9805851Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:14.9807274Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:14.9807945Z ^ 2025-05-07T19:55:14.9808091Z 2025-05-07T19:55:14.9808344Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:14.9808775Z 2025-05-07T19:55:14.9809660Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:14.9811101Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:14.9811754Z ^ 2025-05-07T19:55:14.9811988Z 2025-05-07T19:55:14.9812807Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.9813962Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:14.9814389Z ^ 2025-05-07T19:55:14.9814577Z 2025-05-07T19:55:14.9815382Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.9816566Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:14.9816877Z ^ 2025-05-07T19:55:14.9817037Z 2025-05-07T19:55:14.9817845Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.9818909Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:14.9819238Z ^ 2025-05-07T19:55:14.9819561Z 2025-05-07T19:55:14.9820370Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.9821376Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:14.9821707Z ^ 2025-05-07T19:55:14.9821860Z 2025-05-07T19:55:14.9822715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:14.9824114Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:14.9824760Z ^ 2025-05-07T19:55:14.9824903Z 2025-05-07T19:55:14.9825147Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:14.9825524Z 2025-05-07T19:55:14.9826387Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:14.9827792Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:14.9828694Z ^ 2025-05-07T19:55:14.9828898Z 2025-05-07T19:55:14.9829721Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.9830965Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:14.9831407Z ^ 2025-05-07T19:55:14.9831567Z 2025-05-07T19:55:14.9832386Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.9833449Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:14.9833784Z ^ 2025-05-07T19:55:14.9833946Z 2025-05-07T19:55:14.9834916Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.9836050Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:14.9836386Z ^ 2025-05-07T19:55:14.9836543Z 2025-05-07T19:55:14.9837356Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.9838415Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:14.9838759Z ^ 2025-05-07T19:55:14.9838916Z 2025-05-07T19:55:16.1430406Z [238/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_dense_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o 2025-05-07T19:55:16.1454059Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:16.1478562Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:16.1480218Z ^ 2025-05-07T19:55:16.1480477Z 2025-05-07T19:55:16.1480915Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:16.1481649Z 2025-05-07T19:55:16.1483302Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:16.1486066Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:16.1487277Z ^ 2025-05-07T19:55:16.1487692Z 2025-05-07T19:55:16.1489377Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:16.1492440Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:16.1493592Z ^ 2025-05-07T19:55:16.1493879Z 2025-05-07T19:55:16.1494314Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:16.1494985Z 2025-05-07T19:55:16.1496641Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:16.1499220Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:16.1500563Z ^ 2025-05-07T19:55:16.1500919Z 2025-05-07T19:55:16.1502671Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:16.1505341Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:16.1506712Z ^ 2025-05-07T19:55:16.1506979Z 2025-05-07T19:55:16.1507422Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:16.1508117Z 2025-05-07T19:55:16.1509835Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:16.1512571Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:16.1513768Z ^ 2025-05-07T19:55:16.1514338Z 2025-05-07T19:55:16.1516015Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:16.1518742Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:16.1519895Z ^ 2025-05-07T19:55:16.1520194Z 2025-05-07T19:55:16.1520666Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:16.1521379Z 2025-05-07T19:55:16.1523277Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:16.1525896Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:16.1527426Z ^ 2025-05-07T19:55:16.1527765Z 2025-05-07T19:55:16.1529865Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:16.1532539Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:16.1533703Z ^ 2025-05-07T19:55:16.1533971Z 2025-05-07T19:55:16.1534450Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:16.1535145Z 2025-05-07T19:55:16.1536790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:16.1539462Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:16.1540630Z ^ 2025-05-07T19:55:16.1540996Z 2025-05-07T19:55:22.7966355Z [239/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o 2025-05-07T19:55:22.7989303Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:22.7992077Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:22.7993135Z ^ 2025-05-07T19:55:22.7993356Z 2025-05-07T19:55:22.7993788Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:22.7994538Z 2025-05-07T19:55:22.7996080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:22.7998549Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:22.7999709Z ^ 2025-05-07T19:55:22.8000046Z 2025-05-07T19:55:22.8001523Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.8003495Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:22.8004195Z ^ 2025-05-07T19:55:22.8004482Z 2025-05-07T19:55:22.8005896Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.8008132Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.8008632Z ^ 2025-05-07T19:55:22.8008949Z 2025-05-07T19:55:22.8010210Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.8011979Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.8012492Z ^ 2025-05-07T19:55:22.8012766Z 2025-05-07T19:55:22.8014301Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.8016184Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.8016774Z ^ 2025-05-07T19:55:22.8017051Z 2025-05-07T19:55:22.8018795Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:22.8021151Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:22.8022256Z ^ 2025-05-07T19:55:22.8022509Z 2025-05-07T19:55:22.8022942Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:22.8023601Z 2025-05-07T19:55:22.8025140Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:22.8027719Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:22.8029091Z ^ 2025-05-07T19:55:22.8029471Z 2025-05-07T19:55:22.8030941Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.8033126Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:22.8033841Z ^ 2025-05-07T19:55:22.8034164Z 2025-05-07T19:55:22.8035682Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.8037473Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.8038027Z ^ 2025-05-07T19:55:22.8038289Z 2025-05-07T19:55:22.8039745Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.8041580Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.8042121Z ^ 2025-05-07T19:55:22.8042376Z 2025-05-07T19:55:22.8043858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.8045715Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.8046292Z ^ 2025-05-07T19:55:22.8046541Z 2025-05-07T19:55:22.8048075Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:22.8050802Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:22.8051994Z ^ 2025-05-07T19:55:22.8052226Z 2025-05-07T19:55:22.8052632Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:22.8053271Z 2025-05-07T19:55:22.8054852Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:22.8057623Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:22.8058806Z ^ 2025-05-07T19:55:22.8059184Z 2025-05-07T19:55:22.8060689Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.8062723Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:22.8063496Z ^ 2025-05-07T19:55:22.8063803Z 2025-05-07T19:55:22.8065368Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.8067531Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.8068110Z ^ 2025-05-07T19:55:22.8068373Z 2025-05-07T19:55:22.8070027Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.8071955Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.8072518Z ^ 2025-05-07T19:55:22.8072803Z 2025-05-07T19:55:22.8074495Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.8076241Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.8076716Z ^ 2025-05-07T19:55:22.8076992Z 2025-05-07T19:55:22.8078493Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:22.8081020Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:22.8082140Z ^ 2025-05-07T19:55:22.8082417Z 2025-05-07T19:55:22.8082842Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:22.8083421Z 2025-05-07T19:55:22.8085106Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:22.8087887Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:22.8089036Z ^ 2025-05-07T19:55:22.8089376Z 2025-05-07T19:55:22.8090961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.8093202Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:22.8093969Z ^ 2025-05-07T19:55:22.8094224Z 2025-05-07T19:55:22.8095780Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.8097554Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.8098136Z ^ 2025-05-07T19:55:22.8098430Z 2025-05-07T19:55:22.8100004Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.8101828Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.8102391Z ^ 2025-05-07T19:55:22.8102675Z 2025-05-07T19:55:22.8104170Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.8106000Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.8106537Z ^ 2025-05-07T19:55:22.8106806Z 2025-05-07T19:55:22.8108285Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:22.8110851Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:22.8111968Z ^ 2025-05-07T19:55:22.8112216Z 2025-05-07T19:55:22.8112924Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:22.8113592Z 2025-05-07T19:55:22.8115442Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:22.8118140Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:22.8119315Z ^ 2025-05-07T19:55:22.8119690Z 2025-05-07T19:55:22.8121357Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.8123458Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:22.8124226Z ^ 2025-05-07T19:55:22.8124496Z 2025-05-07T19:55:22.8125966Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.8127987Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.8128819Z ^ 2025-05-07T19:55:22.8129104Z 2025-05-07T19:55:22.8130611Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.8132516Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.8133032Z ^ 2025-05-07T19:55:22.8133280Z 2025-05-07T19:55:22.8134693Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.8136779Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:22.8137345Z ^ 2025-05-07T19:55:22.8137632Z 2025-05-07T19:55:31.7074973Z [240/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o 2025-05-07T19:55:31.7099742Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:31.7102687Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:31.7104070Z ^ 2025-05-07T19:55:31.7104350Z 2025-05-07T19:55:31.7104825Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:31.7105470Z 2025-05-07T19:55:31.7107046Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:31.7109694Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:31.7111425Z ^ 2025-05-07T19:55:31.7111803Z 2025-05-07T19:55:31.7113229Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.7115500Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:31.7116209Z ^ 2025-05-07T19:55:31.7116507Z 2025-05-07T19:55:31.7118065Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.7119908Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:31.7120413Z ^ 2025-05-07T19:55:31.7120669Z 2025-05-07T19:55:31.7122187Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.7124163Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:31.7124762Z ^ 2025-05-07T19:55:31.7125056Z 2025-05-07T19:55:31.7126693Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.7129012Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:31.7129610Z ^ 2025-05-07T19:55:31.7129899Z 2025-05-07T19:55:31.7131470Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:31.7134316Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:31.7135529Z ^ 2025-05-07T19:55:31.7135771Z 2025-05-07T19:55:31.7136193Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:31.7136969Z 2025-05-07T19:55:31.7138564Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:31.7141349Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:31.7142479Z ^ 2025-05-07T19:55:31.7142862Z 2025-05-07T19:55:31.7144398Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.7146491Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:31.7147257Z ^ 2025-05-07T19:55:31.7147592Z 2025-05-07T19:55:31.7149180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.7151206Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:31.7151807Z ^ 2025-05-07T19:55:31.7152116Z 2025-05-07T19:55:31.7153687Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.7156051Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:31.7156613Z ^ 2025-05-07T19:55:31.7156891Z 2025-05-07T19:55:31.7158317Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.7160239Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:31.7160816Z ^ 2025-05-07T19:55:31.7161118Z 2025-05-07T19:55:31.7162764Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:31.7165440Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:31.7166688Z ^ 2025-05-07T19:55:31.7166968Z 2025-05-07T19:55:31.7167412Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:31.7168079Z 2025-05-07T19:55:31.7169700Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:31.7172487Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:31.7173751Z ^ 2025-05-07T19:55:31.7174137Z 2025-05-07T19:55:31.7175701Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.7178179Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:31.7178992Z ^ 2025-05-07T19:55:31.7179286Z 2025-05-07T19:55:31.7180892Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.7183027Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:31.7183620Z ^ 2025-05-07T19:55:31.7183906Z 2025-05-07T19:55:31.7185484Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.7187408Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:31.7188001Z ^ 2025-05-07T19:55:31.7188290Z 2025-05-07T19:55:31.7189866Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.7191866Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:31.7192429Z ^ 2025-05-07T19:55:31.7192752Z 2025-05-07T19:55:31.7194489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:31.7197121Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:31.7198278Z ^ 2025-05-07T19:55:31.7198546Z 2025-05-07T19:55:31.7198926Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:31.7199579Z 2025-05-07T19:55:31.7201268Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:31.7204100Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:31.7205221Z ^ 2025-05-07T19:55:31.7205561Z 2025-05-07T19:55:31.7207045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.7209124Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:31.7209931Z ^ 2025-05-07T19:55:31.7210244Z 2025-05-07T19:55:31.7211837Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.7213924Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:31.7214502Z ^ 2025-05-07T19:55:31.7214810Z 2025-05-07T19:55:31.7216363Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.7218252Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:31.7218795Z ^ 2025-05-07T19:55:31.7219109Z 2025-05-07T19:55:31.7220644Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.7222941Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:31.7223532Z ^ 2025-05-07T19:55:31.7223855Z 2025-05-07T19:55:31.7225563Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:31.7228699Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:31.7230241Z ^ 2025-05-07T19:55:31.7230523Z 2025-05-07T19:55:31.7231031Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:31.7231695Z 2025-05-07T19:55:31.7233496Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:31.7236452Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:31.7237571Z ^ 2025-05-07T19:55:31.7237881Z 2025-05-07T19:55:31.7239202Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.7241335Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:31.7242109Z ^ 2025-05-07T19:55:31.7242388Z 2025-05-07T19:55:31.7243936Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.7246077Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:31.7246655Z ^ 2025-05-07T19:55:31.7246971Z 2025-05-07T19:55:31.7248372Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.7250293Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:31.7250813Z ^ 2025-05-07T19:55:31.7251038Z 2025-05-07T19:55:31.7252534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.7254386Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:31.7254970Z ^ 2025-05-07T19:55:31.7255239Z 2025-05-07T19:55:45.0681890Z [241/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o 2025-05-07T19:55:45.0705857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:45.0708760Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:45.0710017Z ^ 2025-05-07T19:55:45.0710313Z 2025-05-07T19:55:45.0710833Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:45.0711866Z 2025-05-07T19:55:45.0713774Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:45.0716655Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:45.0717857Z ^ 2025-05-07T19:55:45.0718230Z 2025-05-07T19:55:45.0719835Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:45.0722518Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:45.0723763Z ^ 2025-05-07T19:55:45.0724032Z 2025-05-07T19:55:45.0724501Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:45.0725218Z 2025-05-07T19:55:45.0726948Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:45.0729864Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:45.0731056Z ^ 2025-05-07T19:55:45.0731577Z 2025-05-07T19:55:45.0733453Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:45.0736575Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:45.0737777Z ^ 2025-05-07T19:55:45.0738039Z 2025-05-07T19:55:45.0738521Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:45.0739361Z 2025-05-07T19:55:45.0741092Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:45.0743857Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:45.0745106Z ^ 2025-05-07T19:55:45.0745482Z 2025-05-07T19:55:45.0747188Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:45.0750024Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:45.0751365Z ^ 2025-05-07T19:55:45.0751597Z 2025-05-07T19:55:45.0752016Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:45.0752683Z 2025-05-07T19:55:45.0754646Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:45.0757298Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:45.0758517Z ^ 2025-05-07T19:55:45.0759112Z 2025-05-07T19:55:45.0760830Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:45.0763554Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:45.0764742Z ^ 2025-05-07T19:55:45.0765009Z 2025-05-07T19:55:45.0765483Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:45.0766147Z 2025-05-07T19:55:45.0767786Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:45.0770734Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:45.0771988Z ^ 2025-05-07T19:55:45.0772329Z 2025-05-07T19:55:47.8963123Z [242/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o 2025-05-07T19:55:47.8984687Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:47.8987442Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:47.8988557Z ^ 2025-05-07T19:55:47.8988846Z 2025-05-07T19:55:47.8989266Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:47.8989914Z 2025-05-07T19:55:47.8991459Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:47.8993761Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:47.8995014Z ^ 2025-05-07T19:55:47.8995382Z 2025-05-07T19:55:47.8997014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:47.8999139Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:47.8999712Z ^ 2025-05-07T19:55:47.9000005Z 2025-05-07T19:55:47.9001655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:47.9003756Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:47.9004270Z ^ 2025-05-07T19:55:47.9004528Z 2025-05-07T19:55:47.9006143Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:47.9008034Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:47.9008595Z ^ 2025-05-07T19:55:47.9008898Z 2025-05-07T19:55:47.9010677Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:47.9013480Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:47.9014646Z ^ 2025-05-07T19:55:47.9014930Z 2025-05-07T19:55:47.9015376Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:47.9016037Z 2025-05-07T19:55:47.9017883Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:47.9020646Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:47.9022063Z ^ 2025-05-07T19:55:47.9022442Z 2025-05-07T19:55:47.9024082Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:47.9026163Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:47.9026759Z ^ 2025-05-07T19:55:47.9027077Z 2025-05-07T19:55:47.9029011Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:47.9031237Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:47.9031822Z ^ 2025-05-07T19:55:47.9032154Z 2025-05-07T19:55:47.9033758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:47.9035908Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:47.9036385Z ^ 2025-05-07T19:55:47.9036690Z 2025-05-07T19:55:47.9038221Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:47.9041215Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:47.9042425Z ^ 2025-05-07T19:55:47.9042663Z 2025-05-07T19:55:47.9043143Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:47.9043803Z 2025-05-07T19:55:47.9045519Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:47.9048360Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:47.9049563Z ^ 2025-05-07T19:55:47.9049920Z 2025-05-07T19:55:47.9051817Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:47.9053843Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:47.9054451Z ^ 2025-05-07T19:55:47.9054768Z 2025-05-07T19:55:47.9056274Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:47.9058638Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:47.9059231Z ^ 2025-05-07T19:55:47.9059554Z 2025-05-07T19:55:47.9061088Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:47.9063145Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:47.9063751Z ^ 2025-05-07T19:55:47.9064104Z 2025-05-07T19:55:47.9065808Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:47.9068525Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:47.9069712Z ^ 2025-05-07T19:55:47.9069967Z 2025-05-07T19:55:47.9070443Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:47.9071107Z 2025-05-07T19:55:47.9072786Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:47.9075699Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:47.9077111Z ^ 2025-05-07T19:55:47.9077496Z 2025-05-07T19:55:47.9079130Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:47.9081176Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:47.9081726Z ^ 2025-05-07T19:55:47.9082054Z 2025-05-07T19:55:47.9083680Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:47.9085775Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:47.9086380Z ^ 2025-05-07T19:55:47.9086712Z 2025-05-07T19:55:47.9088334Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:47.9090451Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:47.9091019Z ^ 2025-05-07T19:55:47.9091526Z 2025-05-07T19:55:47.9093201Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:47.9095942Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:47.9097128Z ^ 2025-05-07T19:55:47.9097421Z 2025-05-07T19:55:47.9097875Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:47.9098742Z 2025-05-07T19:55:47.9100460Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:47.9103246Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:47.9104435Z ^ 2025-05-07T19:55:47.9104807Z 2025-05-07T19:55:47.9106351Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:47.9108311Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:47.9109058Z ^ 2025-05-07T19:55:47.9109349Z 2025-05-07T19:55:47.9111005Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:47.9113078Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:47.9113714Z ^ 2025-05-07T19:55:47.9114032Z 2025-05-07T19:55:47.9115869Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:47.9118022Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:47.9118606Z ^ 2025-05-07T19:55:47.9118945Z 2025-05-07T19:55:56.8670517Z [243/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_grad_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o 2025-05-07T19:55:56.8695116Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:56.8697785Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:56.8699082Z ^ 2025-05-07T19:55:56.8699312Z 2025-05-07T19:55:56.8699735Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:56.8700387Z 2025-05-07T19:55:56.8701987Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:56.8704709Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:56.8705765Z ^ 2025-05-07T19:55:56.8706112Z 2025-05-07T19:55:56.8707838Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:56.8710468Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:56.8711582Z ^ 2025-05-07T19:55:56.8711799Z 2025-05-07T19:55:56.8714106Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:56.8714835Z 2025-05-07T19:55:56.8716377Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:56.8718955Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:56.8720137Z ^ 2025-05-07T19:55:56.8720503Z 2025-05-07T19:55:56.8722161Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:56.8724803Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:56.8726018Z ^ 2025-05-07T19:55:56.8726277Z 2025-05-07T19:55:56.8726728Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:56.8727416Z 2025-05-07T19:55:56.8729331Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:56.8732013Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:56.8733151Z ^ 2025-05-07T19:55:56.8733490Z 2025-05-07T19:55:56.8735506Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:56.8738114Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:56.8739281Z ^ 2025-05-07T19:55:56.8739547Z 2025-05-07T19:55:56.8740199Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:56.8740874Z 2025-05-07T19:55:56.8742575Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:56.8745259Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:56.8746471Z ^ 2025-05-07T19:55:56.8746846Z 2025-05-07T19:55:56.8748500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:56.8751181Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:56.8752387Z ^ 2025-05-07T19:55:56.8752680Z 2025-05-07T19:55:56.8753137Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:56.8753816Z 2025-05-07T19:55:56.8755645Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:56.8758229Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:56.8759679Z ^ 2025-05-07T19:55:56.8760059Z 2025-05-07T19:55:58.6425014Z [244/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o 2025-05-07T19:55:58.6445734Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:58.6448003Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:58.6448999Z ^ 2025-05-07T19:55:58.6449216Z 2025-05-07T19:55:58.6449589Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:58.6450127Z 2025-05-07T19:55:58.6451485Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:58.6453654Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:58.6454636Z ^ 2025-05-07T19:55:58.6454937Z 2025-05-07T19:55:58.6455995Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6457865Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6458618Z ^ 2025-05-07T19:55:58.6461331Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:55:58.6463826Z 2025-05-07T19:55:58.6464886Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6466470Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6467234Z ^ 2025-05-07T19:55:58.6469941Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:55:58.6472423Z 2025-05-07T19:55:58.6473457Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6475532Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6476309Z ^ 2025-05-07T19:55:58.6479008Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:55:58.6481711Z 2025-05-07T19:55:58.6482693Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6484182Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6484999Z ^ 2025-05-07T19:55:58.6487919Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:55:58.6490839Z 2025-05-07T19:55:58.6492033Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6493854Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6494716Z ^ 2025-05-07T19:55:58.6498045Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:55:58.6500957Z 2025-05-07T19:55:58.6502164Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6503906Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6504727Z ^ 2025-05-07T19:55:58.6507734Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:55:58.6510905Z 2025-05-07T19:55:58.6512162Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6513983Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6515003Z ^ 2025-05-07T19:55:58.6518567Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:55:58.6521208Z 2025-05-07T19:55:58.6522393Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6524305Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6525211Z ^ 2025-05-07T19:55:58.6528858Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:55:58.6531971Z 2025-05-07T19:55:58.6533281Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6535252Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6536160Z ^ 2025-05-07T19:55:58.6539544Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:55:58.6542856Z 2025-05-07T19:55:58.6544182Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6546124Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6547012Z ^ 2025-05-07T19:55:58.6550036Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:55:58.6553166Z 2025-05-07T19:55:58.6554644Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6556622Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6557578Z ^ 2025-05-07T19:55:58.6561340Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:55:58.6564646Z 2025-05-07T19:55:58.6565966Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6567953Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6568861Z ^ 2025-05-07T19:55:58.6572153Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:55:58.6575303Z 2025-05-07T19:55:58.6576561Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6578552Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6579431Z ^ 2025-05-07T19:55:58.6582837Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:55:58.6586268Z 2025-05-07T19:55:58.6587603Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6589601Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6590499Z ^ 2025-05-07T19:55:58.6593747Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:55:58.6597023Z 2025-05-07T19:55:58.6598365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6600385Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6601276Z ^ 2025-05-07T19:55:58.6604893Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:55:58.6607912Z 2025-05-07T19:55:58.6609218Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6611321Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6612243Z ^ 2025-05-07T19:55:58.6615759Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:55:58.6618917Z 2025-05-07T19:55:58.6620243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6622222Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6623152Z ^ 2025-05-07T19:55:58.6626445Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:55:58.6629878Z 2025-05-07T19:55:58.6631160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6633110Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6634036Z ^ 2025-05-07T19:55:58.6637618Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:55:58.6640713Z 2025-05-07T19:55:58.6642008Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6644009Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6644905Z ^ 2025-05-07T19:55:58.6648497Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:55:58.6651660Z 2025-05-07T19:55:58.6652973Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6655123Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6655971Z ^ 2025-05-07T19:55:58.6659382Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:55:58.6662592Z 2025-05-07T19:55:58.6663908Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6666204Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6667094Z ^ 2025-05-07T19:55:58.6670422Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:55:58.6674067Z 2025-05-07T19:55:58.6675557Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6677629Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6678555Z ^ 2025-05-07T19:55:58.6682049Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:55:58.6685435Z 2025-05-07T19:55:58.6686792Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6688740Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6689633Z ^ 2025-05-07T19:55:58.6692966Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:55:58.6696024Z 2025-05-07T19:55:58.6697588Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6699571Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6700542Z ^ 2025-05-07T19:55:58.6703857Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:55:58.6706900Z 2025-05-07T19:55:58.6708585Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:58.6711362Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:58.6712629Z ^ 2025-05-07T19:55:58.6712913Z 2025-05-07T19:55:58.6713373Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:58.6714074Z 2025-05-07T19:55:58.6715883Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:58.6718623Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:58.6719958Z ^ 2025-05-07T19:55:58.6720313Z 2025-05-07T19:55:58.6721600Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6723515Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6724427Z ^ 2025-05-07T19:55:58.6727787Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:55:58.6731224Z 2025-05-07T19:55:58.6732563Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6734508Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6735426Z ^ 2025-05-07T19:55:58.6739075Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:55:58.6742294Z 2025-05-07T19:55:58.6743616Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6745698Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6746645Z ^ 2025-05-07T19:55:58.6749909Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:55:58.6752963Z 2025-05-07T19:55:58.6754346Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6756281Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6757186Z ^ 2025-05-07T19:55:58.6760455Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:55:58.6763664Z 2025-05-07T19:55:58.6764915Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6766743Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6767685Z ^ 2025-05-07T19:55:58.6771030Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:55:58.6774215Z 2025-05-07T19:55:58.6775749Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6777781Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6778702Z ^ 2025-05-07T19:55:58.6782025Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:55:58.6784952Z 2025-05-07T19:55:58.6786446Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6788349Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6789255Z ^ 2025-05-07T19:55:58.6792505Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:55:58.6795907Z 2025-05-07T19:55:58.6797204Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6799071Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6799936Z ^ 2025-05-07T19:55:58.6803272Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:55:58.6806410Z 2025-05-07T19:55:58.6807757Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6809871Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6810785Z ^ 2025-05-07T19:55:58.6814371Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:55:58.6817525Z 2025-05-07T19:55:58.6818928Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6821002Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6821943Z ^ 2025-05-07T19:55:58.6825535Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:55:58.6829159Z 2025-05-07T19:55:58.6830859Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6832904Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6833952Z ^ 2025-05-07T19:55:58.6837718Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:55:58.6841103Z 2025-05-07T19:55:58.6842456Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6844754Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6845657Z ^ 2025-05-07T19:55:58.6848988Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:55:58.6852211Z 2025-05-07T19:55:58.6853512Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6855686Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6856560Z ^ 2025-05-07T19:55:58.6859949Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:55:58.6863165Z 2025-05-07T19:55:58.6864543Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6866659Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6867749Z ^ 2025-05-07T19:55:58.6871140Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:55:58.6874647Z 2025-05-07T19:55:58.6875960Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6878077Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6878939Z ^ 2025-05-07T19:55:58.6882444Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:55:58.6885791Z 2025-05-07T19:55:58.6887121Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6889117Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6890057Z ^ 2025-05-07T19:55:58.6893547Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:55:58.6896852Z 2025-05-07T19:55:58.6898227Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6900287Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6901356Z ^ 2025-05-07T19:55:58.6904979Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:55:58.6908277Z 2025-05-07T19:55:58.6909661Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6911706Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6912633Z ^ 2025-05-07T19:55:58.6916254Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:55:58.6919667Z 2025-05-07T19:55:58.6921033Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6923042Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6925677Z ^ 2025-05-07T19:55:58.6929751Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:55:58.6933360Z 2025-05-07T19:55:58.6934743Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6936833Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6937806Z ^ 2025-05-07T19:55:58.6941391Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:55:58.6944648Z 2025-05-07T19:55:58.6945965Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6947758Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6948627Z ^ 2025-05-07T19:55:58.6952378Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:55:58.6955874Z 2025-05-07T19:55:58.6957020Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6959077Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6959979Z ^ 2025-05-07T19:55:58.6963469Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:55:58.6966734Z 2025-05-07T19:55:58.6968071Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6970147Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6971074Z ^ 2025-05-07T19:55:58.6974826Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:55:58.6978237Z 2025-05-07T19:55:58.6979530Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.6981554Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.6982476Z ^ 2025-05-07T19:55:58.6985964Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:55:58.6989162Z 2025-05-07T19:55:58.6990864Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:58.6993589Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:58.6994948Z ^ 2025-05-07T19:55:58.6995211Z 2025-05-07T19:55:58.6995712Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:58.6996529Z 2025-05-07T19:55:58.6998233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:58.7000907Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:58.7002103Z ^ 2025-05-07T19:55:58.7002445Z 2025-05-07T19:55:58.7003703Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7005542Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7006421Z ^ 2025-05-07T19:55:58.7009893Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:55:58.7013114Z 2025-05-07T19:55:58.7014638Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7016891Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7018030Z ^ 2025-05-07T19:55:58.7021606Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:55:58.7025015Z 2025-05-07T19:55:58.7026243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7028547Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7029636Z ^ 2025-05-07T19:55:58.7033016Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:55:58.7036269Z 2025-05-07T19:55:58.7037737Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7039806Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7040736Z ^ 2025-05-07T19:55:58.7044534Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:55:58.7048139Z 2025-05-07T19:55:58.7049550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7051658Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7052628Z ^ 2025-05-07T19:55:58.7056312Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:55:58.7059824Z 2025-05-07T19:55:58.7061228Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7063337Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7064314Z ^ 2025-05-07T19:55:58.7068154Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:55:58.7071587Z 2025-05-07T19:55:58.7072739Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7074885Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7075757Z ^ 2025-05-07T19:55:58.7079341Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:55:58.7082725Z 2025-05-07T19:55:58.7084100Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7086229Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7087196Z ^ 2025-05-07T19:55:58.7090432Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:55:58.7093511Z 2025-05-07T19:55:58.7094654Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7096743Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7097683Z ^ 2025-05-07T19:55:58.7101257Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:55:58.7104591Z 2025-05-07T19:55:58.7105906Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7108026Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7108998Z ^ 2025-05-07T19:55:58.7112814Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:55:58.7116303Z 2025-05-07T19:55:58.7117672Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7119702Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7120585Z ^ 2025-05-07T19:55:58.7124010Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:55:58.7127414Z 2025-05-07T19:55:58.7128974Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7130927Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7131829Z ^ 2025-05-07T19:55:58.7135160Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:55:58.7138528Z 2025-05-07T19:55:58.7139852Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7141837Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7142716Z ^ 2025-05-07T19:55:58.7145993Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:55:58.7149203Z 2025-05-07T19:55:58.7150603Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7152800Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7153718Z ^ 2025-05-07T19:55:58.7157434Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:55:58.7160646Z 2025-05-07T19:55:58.7161985Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7164126Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7165019Z ^ 2025-05-07T19:55:58.7168354Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:55:58.7171617Z 2025-05-07T19:55:58.7172860Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7174801Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7175628Z ^ 2025-05-07T19:55:58.7178975Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:55:58.7182345Z 2025-05-07T19:55:58.7183583Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7185367Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7186200Z ^ 2025-05-07T19:55:58.7189371Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:55:58.7192455Z 2025-05-07T19:55:58.7193669Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7196045Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7196894Z ^ 2025-05-07T19:55:58.7200421Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:55:58.7203491Z 2025-05-07T19:55:58.7204821Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7206854Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7207637Z ^ 2025-05-07T19:55:58.7211150Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:55:58.7214441Z 2025-05-07T19:55:58.7215815Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7217879Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7218834Z ^ 2025-05-07T19:55:58.7222336Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:55:58.7225585Z 2025-05-07T19:55:58.7226897Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7229189Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7230297Z ^ 2025-05-07T19:55:58.7233882Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:55:58.7237403Z 2025-05-07T19:55:58.7238750Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7240672Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7241618Z ^ 2025-05-07T19:55:58.7245388Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:55:58.7248643Z 2025-05-07T19:55:58.7250223Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7252276Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7253347Z ^ 2025-05-07T19:55:58.7256857Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:55:58.7260014Z 2025-05-07T19:55:58.7261332Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7263262Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7264141Z ^ 2025-05-07T19:55:58.7266986Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:55:58.7269578Z 2025-05-07T19:55:58.7271113Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:58.7273907Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:58.7275109Z ^ 2025-05-07T19:55:58.7275384Z 2025-05-07T19:55:58.7275836Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:58.7276476Z 2025-05-07T19:55:58.7278194Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:58.7280826Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:58.7281980Z ^ 2025-05-07T19:55:58.7282327Z 2025-05-07T19:55:58.7283701Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7285803Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7286770Z ^ 2025-05-07T19:55:58.7290721Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:55:58.7293993Z 2025-05-07T19:55:58.7295363Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7297563Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7298525Z ^ 2025-05-07T19:55:58.7302091Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:55:58.7305418Z 2025-05-07T19:55:58.7306793Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7308909Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7309881Z ^ 2025-05-07T19:55:58.7313449Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:55:58.7316820Z 2025-05-07T19:55:58.7318158Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7320228Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7321150Z ^ 2025-05-07T19:55:58.7324884Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:55:58.7328170Z 2025-05-07T19:55:58.7329741Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7331831Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7332755Z ^ 2025-05-07T19:55:58.7336112Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:55:58.7339174Z 2025-05-07T19:55:58.7340670Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7342804Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7343705Z ^ 2025-05-07T19:55:58.7346762Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:55:58.7350105Z 2025-05-07T19:55:58.7351520Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7353621Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7354721Z ^ 2025-05-07T19:55:58.7358334Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:55:58.7361547Z 2025-05-07T19:55:58.7362909Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7365147Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7366089Z ^ 2025-05-07T19:55:58.7369600Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:55:58.7372825Z 2025-05-07T19:55:58.7373986Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7376047Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7376984Z ^ 2025-05-07T19:55:58.7380484Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:55:58.7383917Z 2025-05-07T19:55:58.7385445Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7387550Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7388511Z ^ 2025-05-07T19:55:58.7392056Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:55:58.7395588Z 2025-05-07T19:55:58.7396896Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7398458Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7399198Z ^ 2025-05-07T19:55:58.7402354Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:55:58.7405535Z 2025-05-07T19:55:58.7406832Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7409123Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7410063Z ^ 2025-05-07T19:55:58.7413527Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:55:58.7416496Z 2025-05-07T19:55:58.7417670Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7419737Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7420665Z ^ 2025-05-07T19:55:58.7424374Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:55:58.7427682Z 2025-05-07T19:55:58.7429255Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7431668Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7432615Z ^ 2025-05-07T19:55:58.7436208Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:55:58.7439599Z 2025-05-07T19:55:58.7440950Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7443174Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7444131Z ^ 2025-05-07T19:55:58.7447632Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:55:58.7450887Z 2025-05-07T19:55:58.7452238Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7454155Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7455346Z ^ 2025-05-07T19:55:58.7458318Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:55:58.7461473Z 2025-05-07T19:55:58.7462826Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7464859Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7465812Z ^ 2025-05-07T19:55:58.7469312Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:55:58.7472537Z 2025-05-07T19:55:58.7473930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7476007Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7477093Z ^ 2025-05-07T19:55:58.7480629Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:55:58.7483924Z 2025-05-07T19:55:58.7485058Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7486972Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7487909Z ^ 2025-05-07T19:55:58.7491476Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:55:58.7494861Z 2025-05-07T19:55:58.7496167Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7498152Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7499038Z ^ 2025-05-07T19:55:58.7503765Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:55:58.7507007Z 2025-05-07T19:55:58.7508291Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7510422Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7511353Z ^ 2025-05-07T19:55:58.7515194Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:55:58.7518673Z 2025-05-07T19:55:58.7519943Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7521872Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7522687Z ^ 2025-05-07T19:55:58.7526457Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:55:58.7530005Z 2025-05-07T19:55:58.7531108Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7533113Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7534019Z ^ 2025-05-07T19:55:58.7537512Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:55:58.7540687Z 2025-05-07T19:55:58.7542045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7544030Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7569086Z ^ 2025-05-07T19:55:58.7571948Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:55:58.7574876Z 2025-05-07T19:55:58.7576156Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:58.7578575Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:58.7579612Z ^ 2025-05-07T19:55:58.7579888Z 2025-05-07T19:55:58.7580355Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:58.7580982Z 2025-05-07T19:55:58.7582589Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:58.7584800Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:58.7585822Z ^ 2025-05-07T19:55:58.7586141Z 2025-05-07T19:55:58.7587239Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7588903Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7590021Z ^ 2025-05-07T19:55:58.7592839Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:55:58.7595585Z 2025-05-07T19:55:58.7596612Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7598207Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7598947Z ^ 2025-05-07T19:55:58.7601680Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:55:58.7604231Z 2025-05-07T19:55:58.7605244Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7606844Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7607750Z ^ 2025-05-07T19:55:58.7610707Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:55:58.7613286Z 2025-05-07T19:55:58.7614342Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7615983Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7616724Z ^ 2025-05-07T19:55:58.7619564Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:55:58.7622184Z 2025-05-07T19:55:58.7623239Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7624889Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7625669Z ^ 2025-05-07T19:55:58.7629140Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:55:58.7631752Z 2025-05-07T19:55:58.7632864Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7634410Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7635137Z ^ 2025-05-07T19:55:58.7638004Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:55:58.7640830Z 2025-05-07T19:55:58.7642018Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7643829Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7644709Z ^ 2025-05-07T19:55:58.7647917Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:55:58.7650990Z 2025-05-07T19:55:58.7652125Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7654298Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7655145Z ^ 2025-05-07T19:55:58.7658230Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:55:58.7661507Z 2025-05-07T19:55:58.7662810Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7664683Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7665582Z ^ 2025-05-07T19:55:58.7669142Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:55:58.7671803Z 2025-05-07T19:55:58.7673083Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7675257Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7676185Z ^ 2025-05-07T19:55:58.7679815Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:55:58.7682979Z 2025-05-07T19:55:58.7684276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7686270Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7687160Z ^ 2025-05-07T19:55:58.7690589Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:55:58.7693883Z 2025-05-07T19:55:58.7695138Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7697104Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7697997Z ^ 2025-05-07T19:55:58.7701318Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:55:58.7704619Z 2025-05-07T19:55:58.7705947Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7708061Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7709046Z ^ 2025-05-07T19:55:58.7712858Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:55:58.7716353Z 2025-05-07T19:55:58.7717728Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7719704Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7720649Z ^ 2025-05-07T19:55:58.7724063Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:55:58.7727189Z 2025-05-07T19:55:58.7728764Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7730786Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7731733Z ^ 2025-05-07T19:55:58.7735158Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:55:58.7738565Z 2025-05-07T19:55:58.7740033Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7742041Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7743020Z ^ 2025-05-07T19:55:58.7746540Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:55:58.7749869Z 2025-05-07T19:55:58.7751288Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7753320Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7754523Z ^ 2025-05-07T19:55:58.7758137Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:55:58.7761359Z 2025-05-07T19:55:58.7762718Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7765018Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7765948Z ^ 2025-05-07T19:55:58.7769361Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:55:58.7772587Z 2025-05-07T19:55:58.7773851Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7775802Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7776673Z ^ 2025-05-07T19:55:58.7780230Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:55:58.7783769Z 2025-05-07T19:55:58.7785117Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7787209Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7788104Z ^ 2025-05-07T19:55:58.7791748Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:55:58.7794958Z 2025-05-07T19:55:58.7796298Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7798201Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7799113Z ^ 2025-05-07T19:55:58.7802703Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:55:58.7805806Z 2025-05-07T19:55:58.7807417Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7809442Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7810500Z ^ 2025-05-07T19:55:58.7814038Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:55:58.7817149Z 2025-05-07T19:55:58.7818460Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7820490Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7821435Z ^ 2025-05-07T19:55:58.7824956Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:55:58.7828109Z 2025-05-07T19:55:58.7829670Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7831941Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7832823Z ^ 2025-05-07T19:55:58.7836276Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:55:58.7839266Z 2025-05-07T19:56:02.1868143Z [245/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_dense_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o 2025-05-07T19:56:02.1891297Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.1893899Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.1895215Z ^ 2025-05-07T19:56:02.1895479Z 2025-05-07T19:56:02.1896322Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:02.1897015Z 2025-05-07T19:56:02.1898743Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.1901739Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.1903001Z ^ 2025-05-07T19:56:02.1903375Z 2025-05-07T19:56:02.1905087Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.1907780Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.1909092Z ^ 2025-05-07T19:56:02.1909321Z 2025-05-07T19:56:02.1909754Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:02.1910414Z 2025-05-07T19:56:02.1911987Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.1914752Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.1915983Z ^ 2025-05-07T19:56:02.1916366Z 2025-05-07T19:56:02.1918299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.1921060Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.1922262Z ^ 2025-05-07T19:56:02.1922519Z 2025-05-07T19:56:02.1923079Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:02.1923743Z 2025-05-07T19:56:02.1925448Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.1928072Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.1929573Z ^ 2025-05-07T19:56:02.1929940Z 2025-05-07T19:56:02.1931641Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.1934279Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.1935467Z ^ 2025-05-07T19:56:02.1935729Z 2025-05-07T19:56:02.1936213Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:02.1936862Z 2025-05-07T19:56:02.1938323Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.1940559Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.1941568Z ^ 2025-05-07T19:56:02.1942163Z 2025-05-07T19:56:02.1943586Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.1945999Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.1947134Z ^ 2025-05-07T19:56:02.1947369Z 2025-05-07T19:56:02.1947754Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:02.1948361Z 2025-05-07T19:56:02.1949957Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.1952499Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.1953757Z ^ 2025-05-07T19:56:02.1954269Z 2025-05-07T19:56:02.4263624Z [246/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o 2025-05-07T19:56:02.4284897Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.4287358Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.4288236Z ^ 2025-05-07T19:56:02.4288435Z 2025-05-07T19:56:02.4288799Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:02.4289305Z 2025-05-07T19:56:02.4290587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.4292624Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.4293460Z ^ 2025-05-07T19:56:02.4293714Z 2025-05-07T19:56:02.4295000Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.4297102Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.4298134Z ^ 2025-05-07T19:56:02.4298337Z 2025-05-07T19:56:02.4298697Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:02.4299325Z 2025-05-07T19:56:02.4300764Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.4303150Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.4304110Z ^ 2025-05-07T19:56:02.4304646Z 2025-05-07T19:56:02.4306040Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.4308340Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.4309353Z ^ 2025-05-07T19:56:02.4309570Z 2025-05-07T19:56:02.4309990Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:02.4310538Z 2025-05-07T19:56:02.4311890Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.4314065Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.4315215Z ^ 2025-05-07T19:56:02.4315510Z 2025-05-07T19:56:02.4316829Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.4319014Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.4319964Z ^ 2025-05-07T19:56:02.4320207Z 2025-05-07T19:56:02.4320573Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:02.4321111Z 2025-05-07T19:56:02.4322479Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.4324741Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.4325727Z ^ 2025-05-07T19:56:02.4326042Z 2025-05-07T19:56:02.4327365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.4329757Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.4330738Z ^ 2025-05-07T19:56:02.4330949Z 2025-05-07T19:56:02.4331313Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:02.4331874Z 2025-05-07T19:56:02.4333211Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:02.4335368Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:02.4336493Z ^ 2025-05-07T19:56:02.4336814Z 2025-05-07T19:56:16.9657590Z [247/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o 2025-05-07T19:56:16.9677207Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:16.9679302Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:16.9680209Z ^ 2025-05-07T19:56:16.9680417Z 2025-05-07T19:56:16.9680796Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:16.9681310Z 2025-05-07T19:56:16.9682562Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:16.9684677Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:16.9685604Z ^ 2025-05-07T19:56:16.9685916Z 2025-05-07T19:56:16.9687171Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:16.9688750Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:16.9689209Z ^ 2025-05-07T19:56:16.9689466Z 2025-05-07T19:56:16.9690696Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:16.9692459Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:16.9692959Z ^ 2025-05-07T19:56:16.9693225Z 2025-05-07T19:56:16.9694443Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:16.9696154Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:16.9696637Z ^ 2025-05-07T19:56:16.9696886Z 2025-05-07T19:56:16.9698101Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:16.9700250Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:16.9701227Z ^ 2025-05-07T19:56:16.9701439Z 2025-05-07T19:56:16.9701795Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:16.9702311Z 2025-05-07T19:56:16.9703828Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:16.9705829Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:16.9706881Z ^ 2025-05-07T19:56:16.9707147Z 2025-05-07T19:56:16.9708305Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:16.9709762Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:16.9710341Z ^ 2025-05-07T19:56:16.9710579Z 2025-05-07T19:56:16.9711761Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:16.9713621Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:16.9714208Z ^ 2025-05-07T19:56:16.9714463Z 2025-05-07T19:56:16.9715665Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:16.9717159Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:16.9717576Z ^ 2025-05-07T19:56:16.9717802Z 2025-05-07T19:56:16.9719067Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:16.9721062Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:16.9722010Z ^ 2025-05-07T19:56:16.9722202Z 2025-05-07T19:56:16.9722575Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:16.9723072Z 2025-05-07T19:56:16.9724310Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:16.9726451Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:16.9727674Z ^ 2025-05-07T19:56:16.9727952Z 2025-05-07T19:56:16.9729588Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:16.9731208Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:16.9731734Z ^ 2025-05-07T19:56:16.9731973Z 2025-05-07T19:56:16.9733151Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:16.9734620Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:16.9735214Z ^ 2025-05-07T19:56:16.9735442Z 2025-05-07T19:56:16.9736625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:16.9738126Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:16.9738581Z ^ 2025-05-07T19:56:16.9738813Z 2025-05-07T19:56:16.9740031Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:16.9741980Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:16.9742903Z ^ 2025-05-07T19:56:16.9743110Z 2025-05-07T19:56:16.9743475Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:16.9743979Z 2025-05-07T19:56:16.9745225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:16.9747451Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:16.9748357Z ^ 2025-05-07T19:56:16.9748683Z 2025-05-07T19:56:16.9749853Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:16.9751405Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:16.9751880Z ^ 2025-05-07T19:56:16.9752166Z 2025-05-07T19:56:16.9753605Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:16.9755270Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:16.9755726Z ^ 2025-05-07T19:56:16.9755959Z 2025-05-07T19:56:16.9757237Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:16.9758832Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:16.9759345Z ^ 2025-05-07T19:56:16.9759608Z 2025-05-07T19:56:16.9761015Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:16.9763815Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:16.9764938Z ^ 2025-05-07T19:56:16.9765159Z 2025-05-07T19:56:16.9765545Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:16.9766330Z 2025-05-07T19:56:16.9768319Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:16.9770641Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:16.9771574Z ^ 2025-05-07T19:56:16.9771918Z 2025-05-07T19:56:16.9773123Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:16.9774755Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:16.9775201Z ^ 2025-05-07T19:56:16.9775466Z 2025-05-07T19:56:16.9776677Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:16.9778195Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:16.9778648Z ^ 2025-05-07T19:56:16.9778877Z 2025-05-07T19:56:16.9780089Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:16.9781637Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:16.9782262Z ^ 2025-05-07T19:56:16.9782498Z 2025-05-07T19:56:32.2727385Z [248/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o 2025-05-07T19:56:32.2751213Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:32.2753860Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:32.2754950Z ^ 2025-05-07T19:56:32.2755180Z 2025-05-07T19:56:32.2755607Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:32.2756224Z 2025-05-07T19:56:32.2757656Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:32.2760319Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:32.2761372Z ^ 2025-05-07T19:56:32.2761695Z 2025-05-07T19:56:32.2763264Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.2765722Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:32.2766695Z ^ 2025-05-07T19:56:32.2767000Z 2025-05-07T19:56:32.2768391Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.2770296Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.2770781Z ^ 2025-05-07T19:56:32.2771092Z 2025-05-07T19:56:32.2772528Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.2774594Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.2775106Z ^ 2025-05-07T19:56:32.2775358Z 2025-05-07T19:56:32.2776784Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.2778641Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.2779171Z ^ 2025-05-07T19:56:32.2779449Z 2025-05-07T19:56:32.2780909Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:32.2783569Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:32.2784761Z ^ 2025-05-07T19:56:32.2786748Z 2025-05-07T19:56:32.2787191Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:32.2787806Z 2025-05-07T19:56:32.2789396Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:32.2791792Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:32.2792800Z ^ 2025-05-07T19:56:32.2793149Z 2025-05-07T19:56:32.2794610Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.2796442Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:32.2797143Z ^ 2025-05-07T19:56:32.2797416Z 2025-05-07T19:56:32.2798732Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.2800356Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.2800863Z ^ 2025-05-07T19:56:32.2801111Z 2025-05-07T19:56:32.2802667Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.2804586Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.2805069Z ^ 2025-05-07T19:56:32.2805310Z 2025-05-07T19:56:32.2806820Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.2808787Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.2809279Z ^ 2025-05-07T19:56:32.2809530Z 2025-05-07T19:56:32.2810853Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:32.2813398Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:32.2814469Z ^ 2025-05-07T19:56:32.2814737Z 2025-05-07T19:56:32.2815126Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:32.2815749Z 2025-05-07T19:56:32.2817243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:32.2819647Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:32.2820797Z ^ 2025-05-07T19:56:32.2821143Z 2025-05-07T19:56:32.2822770Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.2824775Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:32.2825506Z ^ 2025-05-07T19:56:32.2825775Z 2025-05-07T19:56:32.2827629Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.2829542Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.2830050Z ^ 2025-05-07T19:56:32.2830491Z 2025-05-07T19:56:32.2831836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.2833557Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.2834231Z ^ 2025-05-07T19:56:32.2834518Z 2025-05-07T19:56:32.2835979Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.2837879Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.2838373Z ^ 2025-05-07T19:56:32.2838664Z 2025-05-07T19:56:32.2840143Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:32.2842606Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:32.2843683Z ^ 2025-05-07T19:56:32.2843925Z 2025-05-07T19:56:32.2844369Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:32.2844978Z 2025-05-07T19:56:32.2846782Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:32.2849724Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:32.2850953Z ^ 2025-05-07T19:56:32.2851343Z 2025-05-07T19:56:32.2852890Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.2855059Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:32.2855828Z ^ 2025-05-07T19:56:32.2856121Z 2025-05-07T19:56:32.2857624Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.2859632Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.2860129Z ^ 2025-05-07T19:56:32.2860400Z 2025-05-07T19:56:32.2861820Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.2863795Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.2864352Z ^ 2025-05-07T19:56:32.2864662Z 2025-05-07T19:56:32.2866222Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.2868149Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.2868641Z ^ 2025-05-07T19:56:32.2868908Z 2025-05-07T19:56:32.2870861Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:32.2873611Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:32.2875041Z ^ 2025-05-07T19:56:32.2875312Z 2025-05-07T19:56:32.2875818Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:32.2876470Z 2025-05-07T19:56:32.2878198Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:32.2880818Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:32.2881964Z ^ 2025-05-07T19:56:32.2882314Z 2025-05-07T19:56:32.2883763Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.2886039Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:32.2886842Z ^ 2025-05-07T19:56:32.2887176Z 2025-05-07T19:56:32.2888794Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.2890805Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.2891390Z ^ 2025-05-07T19:56:32.2891694Z 2025-05-07T19:56:32.2893474Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.2895610Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.2896209Z ^ 2025-05-07T19:56:32.2896487Z 2025-05-07T19:56:32.2898060Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.2900003Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.2900585Z ^ 2025-05-07T19:56:32.2900860Z 2025-05-07T19:56:38.5257554Z [249/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o 2025-05-07T19:56:38.5281880Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.5284839Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:38.5286133Z ^ 2025-05-07T19:56:38.5286372Z 2025-05-07T19:56:38.5286788Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:38.5287823Z 2025-05-07T19:56:38.5289496Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.5292398Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:38.5293342Z ^ 2025-05-07T19:56:38.5293655Z 2025-05-07T19:56:38.5294886Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.5296665Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:38.5297484Z ^ 2025-05-07T19:56:38.5297762Z 2025-05-07T19:56:38.5299065Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.5300717Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:38.5301220Z ^ 2025-05-07T19:56:38.5301470Z 2025-05-07T19:56:38.5302848Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.5304503Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:38.5305046Z ^ 2025-05-07T19:56:38.5305279Z 2025-05-07T19:56:38.5306918Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.5308649Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:38.5309150Z ^ 2025-05-07T19:56:38.5309402Z 2025-05-07T19:56:38.5310858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.5313412Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:38.5314685Z ^ 2025-05-07T19:56:38.5314920Z 2025-05-07T19:56:38.5315309Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:38.5315856Z 2025-05-07T19:56:38.5317197Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.5319439Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:38.5320505Z ^ 2025-05-07T19:56:38.5320863Z 2025-05-07T19:56:38.5322209Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.5324125Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:38.5324844Z ^ 2025-05-07T19:56:38.5325103Z 2025-05-07T19:56:38.5326590Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.5328818Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:38.5329345Z ^ 2025-05-07T19:56:38.5329906Z 2025-05-07T19:56:38.5331271Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.5333071Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:38.5333571Z ^ 2025-05-07T19:56:38.5333799Z 2025-05-07T19:56:38.5335168Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.5336960Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:38.5337464Z ^ 2025-05-07T19:56:38.5337754Z 2025-05-07T19:56:38.5339592Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.5342061Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:38.5343182Z ^ 2025-05-07T19:56:38.5343449Z 2025-05-07T19:56:38.5343900Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:38.5344598Z 2025-05-07T19:56:38.5346215Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.5349035Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:38.5350305Z ^ 2025-05-07T19:56:38.5350657Z 2025-05-07T19:56:38.5352183Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.5354688Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:38.5355401Z ^ 2025-05-07T19:56:38.5355694Z 2025-05-07T19:56:38.5357351Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.5359516Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:38.5360059Z ^ 2025-05-07T19:56:38.5360335Z 2025-05-07T19:56:38.5361790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.5363509Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:38.5364012Z ^ 2025-05-07T19:56:38.5364308Z 2025-05-07T19:56:38.5365640Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.5367399Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:38.5367931Z ^ 2025-05-07T19:56:38.5368215Z 2025-05-07T19:56:38.5369744Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.5372613Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:38.5373782Z ^ 2025-05-07T19:56:38.5374037Z 2025-05-07T19:56:38.5374518Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:38.5375194Z 2025-05-07T19:56:38.5376826Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.5379514Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:38.5380486Z ^ 2025-05-07T19:56:38.5380780Z 2025-05-07T19:56:38.5382152Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.5384094Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:38.5384877Z ^ 2025-05-07T19:56:38.5385171Z 2025-05-07T19:56:38.5386712Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.5388680Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:38.5389249Z ^ 2025-05-07T19:56:38.5389555Z 2025-05-07T19:56:38.5391199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.5393204Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:38.5393765Z ^ 2025-05-07T19:56:38.5394295Z 2025-05-07T19:56:38.5395918Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.5398064Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:38.5398647Z ^ 2025-05-07T19:56:38.5398934Z 2025-05-07T19:56:38.5400587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.5403243Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:38.5404573Z ^ 2025-05-07T19:56:38.5404815Z 2025-05-07T19:56:38.5405260Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:38.5405849Z 2025-05-07T19:56:38.5407469Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.5410124Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:38.5411190Z ^ 2025-05-07T19:56:38.5411561Z 2025-05-07T19:56:38.5413045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.5415226Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:38.5415874Z ^ 2025-05-07T19:56:38.5416146Z 2025-05-07T19:56:38.5417487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.5419388Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:38.5420195Z ^ 2025-05-07T19:56:38.5420489Z 2025-05-07T19:56:38.5422040Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.5424098Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:38.5424684Z ^ 2025-05-07T19:56:38.5424973Z 2025-05-07T19:56:38.5426585Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:38.5428963Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:38.5429686Z ^ 2025-05-07T19:56:38.5429965Z 2025-05-07T19:56:39.4895939Z [250/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o 2025-05-07T19:56:39.4917990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.4920281Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:39.4921288Z ^ 2025-05-07T19:56:39.4921545Z 2025-05-07T19:56:39.4921935Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:39.4922491Z 2025-05-07T19:56:39.4923927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.4925962Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:39.4926969Z ^ 2025-05-07T19:56:39.4927265Z 2025-05-07T19:56:39.4928825Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.4930528Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:39.4931153Z ^ 2025-05-07T19:56:39.4931412Z 2025-05-07T19:56:39.4932745Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.4934423Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:39.4934961Z ^ 2025-05-07T19:56:39.4935582Z 2025-05-07T19:56:39.4936952Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.4938531Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:39.4939209Z ^ 2025-05-07T19:56:39.4939442Z 2025-05-07T19:56:39.4940746Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.4942314Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:39.4942791Z ^ 2025-05-07T19:56:39.4943019Z 2025-05-07T19:56:39.4944396Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.4946575Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:39.4947644Z ^ 2025-05-07T19:56:39.4947897Z 2025-05-07T19:56:39.4948321Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:39.4948877Z 2025-05-07T19:56:39.4950218Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.4952358Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:39.4953398Z ^ 2025-05-07T19:56:39.4953711Z 2025-05-07T19:56:39.4955431Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.4956978Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:39.4957594Z ^ 2025-05-07T19:56:39.4957849Z 2025-05-07T19:56:39.4959123Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.4960752Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:39.4961248Z ^ 2025-05-07T19:56:39.4961490Z 2025-05-07T19:56:39.4962793Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.4964703Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:39.4965150Z ^ 2025-05-07T19:56:39.4965422Z 2025-05-07T19:56:39.4966770Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.4968560Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:39.4969067Z ^ 2025-05-07T19:56:39.4969347Z 2025-05-07T19:56:39.4970896Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.4973645Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:39.4974737Z ^ 2025-05-07T19:56:39.4974972Z 2025-05-07T19:56:39.4975406Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:39.4976010Z 2025-05-07T19:56:39.4977511Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.4980182Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:39.4981273Z ^ 2025-05-07T19:56:39.4981619Z 2025-05-07T19:56:39.4983220Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.4985148Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:39.4985833Z ^ 2025-05-07T19:56:39.4986094Z 2025-05-07T19:56:39.4987470Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.4989271Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:39.4989755Z ^ 2025-05-07T19:56:39.4990033Z 2025-05-07T19:56:39.4991401Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.4993185Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:39.4993688Z ^ 2025-05-07T19:56:39.4994270Z 2025-05-07T19:56:39.4995608Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.4997344Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:39.4997840Z ^ 2025-05-07T19:56:39.4998093Z 2025-05-07T19:56:39.4999554Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.5001968Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:39.5003014Z ^ 2025-05-07T19:56:39.5003241Z 2025-05-07T19:56:39.5003689Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:39.5004278Z 2025-05-07T19:56:39.5005738Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.5008178Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:39.5009242Z ^ 2025-05-07T19:56:39.5009596Z 2025-05-07T19:56:39.5010977Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.5012928Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:39.5013591Z ^ 2025-05-07T19:56:39.5014115Z 2025-05-07T19:56:39.5015438Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.5017248Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:39.5017726Z ^ 2025-05-07T19:56:39.5017983Z 2025-05-07T19:56:39.5019328Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.5021016Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:39.5021521Z ^ 2025-05-07T19:56:39.5021777Z 2025-05-07T19:56:39.5023156Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.5024850Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:39.5025360Z ^ 2025-05-07T19:56:39.5025603Z 2025-05-07T19:56:39.5027014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.5029646Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:39.5030676Z ^ 2025-05-07T19:56:39.5030918Z 2025-05-07T19:56:39.5031294Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:39.5031951Z 2025-05-07T19:56:39.5033407Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.5036192Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:39.5037209Z ^ 2025-05-07T19:56:39.5037549Z 2025-05-07T19:56:39.5038882Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.5040796Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:39.5041391Z ^ 2025-05-07T19:56:39.5041637Z 2025-05-07T19:56:39.5043077Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.5044756Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:39.5045255Z ^ 2025-05-07T19:56:39.5045499Z 2025-05-07T19:56:39.5046861Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.5048574Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:39.5049088Z ^ 2025-05-07T19:56:39.5049361Z 2025-05-07T19:56:39.5050752Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.5052418Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:39.5053317Z ^ 2025-05-07T19:56:39.5053587Z 2025-05-07T19:56:42.3244615Z [251/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o 2025-05-07T19:56:42.3266574Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.3269120Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.3270158Z ^ 2025-05-07T19:56:42.3270410Z 2025-05-07T19:56:42.3270790Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:42.3271411Z 2025-05-07T19:56:42.3272919Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.3275403Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.3276320Z ^ 2025-05-07T19:56:42.3276600Z 2025-05-07T19:56:42.3278080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.3279821Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:42.3280452Z ^ 2025-05-07T19:56:42.3280701Z 2025-05-07T19:56:42.3282033Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.3283677Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:42.3284138Z ^ 2025-05-07T19:56:42.3284365Z 2025-05-07T19:56:42.3285598Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.3287160Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:42.3287629Z ^ 2025-05-07T19:56:42.3287868Z 2025-05-07T19:56:42.3289120Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.3290716Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:42.3291183Z ^ 2025-05-07T19:56:42.3291437Z 2025-05-07T19:56:42.3292860Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.3295058Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.3296022Z ^ 2025-05-07T19:56:42.3296226Z 2025-05-07T19:56:42.3296643Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:42.3297322Z 2025-05-07T19:56:42.3298744Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.3301002Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.3302004Z ^ 2025-05-07T19:56:42.3302308Z 2025-05-07T19:56:42.3303581Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.3305316Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:42.3305953Z ^ 2025-05-07T19:56:42.3306201Z 2025-05-07T19:56:42.3307477Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.3309116Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:42.3309584Z ^ 2025-05-07T19:56:42.3309824Z 2025-05-07T19:56:42.3311092Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.3312753Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:42.3313201Z ^ 2025-05-07T19:56:42.3313458Z 2025-05-07T19:56:42.3315039Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.3316658Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:42.3317094Z ^ 2025-05-07T19:56:42.3317311Z 2025-05-07T19:56:42.3318772Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.3321089Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.3322095Z ^ 2025-05-07T19:56:42.3322307Z 2025-05-07T19:56:42.3322709Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:42.3323260Z 2025-05-07T19:56:42.3324643Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.3326939Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.3327907Z ^ 2025-05-07T19:56:42.3328238Z 2025-05-07T19:56:42.3329978Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.3331737Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:42.3332366Z ^ 2025-05-07T19:56:42.3332607Z 2025-05-07T19:56:42.3333874Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.3337471Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:42.3337915Z ^ 2025-05-07T19:56:42.3338152Z 2025-05-07T19:56:42.3339431Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.3341014Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:42.3341466Z ^ 2025-05-07T19:56:42.3341696Z 2025-05-07T19:56:42.3342996Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.3344574Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:42.3345053Z ^ 2025-05-07T19:56:42.3345297Z 2025-05-07T19:56:42.3346686Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.3348943Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.3349973Z ^ 2025-05-07T19:56:42.3350191Z 2025-05-07T19:56:42.3350577Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:42.3351151Z 2025-05-07T19:56:42.3352583Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.3355185Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.3356188Z ^ 2025-05-07T19:56:42.3356504Z 2025-05-07T19:56:42.3357777Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.3359646Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:42.3360310Z ^ 2025-05-07T19:56:42.3360561Z 2025-05-07T19:56:42.3361897Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.3363501Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:42.3363978Z ^ 2025-05-07T19:56:42.3364207Z 2025-05-07T19:56:42.3365473Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.3367095Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:42.3367554Z ^ 2025-05-07T19:56:42.3367776Z 2025-05-07T19:56:42.3369042Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.3370673Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:42.3371149Z ^ 2025-05-07T19:56:42.3371386Z 2025-05-07T19:56:42.3372786Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.3375237Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.3376242Z ^ 2025-05-07T19:56:42.3376471Z 2025-05-07T19:56:42.3376858Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:42.3377430Z 2025-05-07T19:56:42.3378875Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.3381155Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.3382150Z ^ 2025-05-07T19:56:42.3382473Z 2025-05-07T19:56:42.3383798Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.3385595Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:42.3386292Z ^ 2025-05-07T19:56:42.3386541Z 2025-05-07T19:56:42.3387836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.3389436Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:42.3389890Z ^ 2025-05-07T19:56:42.3390140Z 2025-05-07T19:56:42.3391633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.3393257Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:42.3393701Z ^ 2025-05-07T19:56:42.3394074Z 2025-05-07T19:56:42.3395366Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.3397061Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:42.3397526Z ^ 2025-05-07T19:56:42.3397760Z 2025-05-07T19:56:42.8486033Z [252/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o 2025-05-07T19:56:42.8513215Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.8516362Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.8517695Z ^ 2025-05-07T19:56:42.8517995Z 2025-05-07T19:56:42.8518521Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:42.8519255Z 2025-05-07T19:56:42.8521396Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.8524370Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.8525821Z ^ 2025-05-07T19:56:42.8526228Z 2025-05-07T19:56:42.8527991Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.8531170Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.8532454Z ^ 2025-05-07T19:56:42.8532768Z 2025-05-07T19:56:42.8533259Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:42.8534014Z 2025-05-07T19:56:42.8535928Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.8538850Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.8540155Z ^ 2025-05-07T19:56:42.8540559Z 2025-05-07T19:56:42.8542364Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.8545240Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.8546802Z ^ 2025-05-07T19:56:42.8547089Z 2025-05-07T19:56:42.8547599Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:42.8548246Z 2025-05-07T19:56:42.8549843Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.8552764Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.8554387Z ^ 2025-05-07T19:56:42.8554803Z 2025-05-07T19:56:42.8556622Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.8559580Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.8560893Z ^ 2025-05-07T19:56:42.8561224Z 2025-05-07T19:56:42.8561732Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:42.8562478Z 2025-05-07T19:56:42.8564344Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.8567519Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.8568849Z ^ 2025-05-07T19:56:42.8569246Z 2025-05-07T19:56:42.8572404Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.8575311Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.8576740Z ^ 2025-05-07T19:56:42.8577020Z 2025-05-07T19:56:42.8577510Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:42.8578254Z 2025-05-07T19:56:42.8580034Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.8582953Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.8584254Z ^ 2025-05-07T19:56:42.8584695Z 2025-05-07T19:56:48.1216842Z [253/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:56:48.1243837Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:48.1247070Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:48.1248817Z ^ 2025-05-07T19:56:48.1249121Z 2025-05-07T19:56:48.1249635Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:48.1250359Z 2025-05-07T19:56:48.1252140Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:48.1255206Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:48.1256252Z ^ 2025-05-07T19:56:48.1256523Z 2025-05-07T19:56:48.1257970Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:48.1260889Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:48.1262213Z ^ 2025-05-07T19:56:48.1262505Z 2025-05-07T19:56:48.1262890Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:48.1263598Z 2025-05-07T19:56:48.1265369Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:48.1268280Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:48.1269464Z ^ 2025-05-07T19:56:48.1269866Z 2025-05-07T19:56:48.1271690Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:48.1274978Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:48.1276308Z ^ 2025-05-07T19:56:48.1276601Z 2025-05-07T19:56:48.1277121Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:48.1277846Z 2025-05-07T19:56:48.1279639Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:48.1282581Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:48.1283916Z ^ 2025-05-07T19:56:48.1284355Z 2025-05-07T19:56:48.1286122Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:48.1289038Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:48.1290344Z ^ 2025-05-07T19:56:48.1290668Z 2025-05-07T19:56:48.1291163Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:48.1291676Z 2025-05-07T19:56:48.1293523Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:48.1296958Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:48.1298336Z ^ 2025-05-07T19:56:48.1298749Z 2025-05-07T19:56:48.1300594Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:48.1303719Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:48.1305040Z ^ 2025-05-07T19:56:48.1305327Z 2025-05-07T19:56:48.1305818Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:48.1306574Z 2025-05-07T19:56:48.1308367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:48.1311140Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:48.1312301Z ^ 2025-05-07T19:56:48.1312740Z 2025-05-07T19:56:50.4639830Z [254/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o 2025-05-07T19:56:50.4663379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.4666202Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:50.4667508Z ^ 2025-05-07T19:56:50.4667770Z 2025-05-07T19:56:50.4668170Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:50.4668837Z 2025-05-07T19:56:50.4670406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.4672936Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:50.4674215Z ^ 2025-05-07T19:56:50.4674597Z 2025-05-07T19:56:50.4676171Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.4678685Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:50.4679823Z ^ 2025-05-07T19:56:50.4680058Z 2025-05-07T19:56:50.4680504Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:50.4681146Z 2025-05-07T19:56:50.4682767Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.4685481Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:50.4686861Z ^ 2025-05-07T19:56:50.4687263Z 2025-05-07T19:56:50.4689041Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.4691868Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:50.4692993Z ^ 2025-05-07T19:56:50.4693206Z 2025-05-07T19:56:50.4693610Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:50.4694237Z 2025-05-07T19:56:50.4695812Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.4698393Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:50.4699498Z ^ 2025-05-07T19:56:50.4699873Z 2025-05-07T19:56:50.4701443Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.4704009Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:50.4705146Z ^ 2025-05-07T19:56:50.4705405Z 2025-05-07T19:56:50.4705884Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:50.4706503Z 2025-05-07T19:56:50.4708327Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.4710991Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:50.4712275Z ^ 2025-05-07T19:56:50.4712651Z 2025-05-07T19:56:50.4714362Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.4716945Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:50.4718106Z ^ 2025-05-07T19:56:50.4718387Z 2025-05-07T19:56:50.4718831Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:50.4719458Z 2025-05-07T19:56:50.4721078Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.4723644Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:50.4724791Z ^ 2025-05-07T19:56:50.4725133Z 2025-05-07T19:56:51.6732402Z [255/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o 2025-05-07T19:56:51.6770395Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:51.6773394Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:51.6774448Z ^ 2025-05-07T19:56:51.6774688Z 2025-05-07T19:56:51.6775120Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:51.6775755Z 2025-05-07T19:56:51.6777290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:51.6779701Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:51.6780801Z ^ 2025-05-07T19:56:51.6781137Z 2025-05-07T19:56:51.6782655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:51.6785150Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:51.6786242Z ^ 2025-05-07T19:56:51.6786475Z 2025-05-07T19:56:51.6786873Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:51.6787489Z 2025-05-07T19:56:51.6789213Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:51.6791692Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:51.6792818Z ^ 2025-05-07T19:56:51.6793151Z 2025-05-07T19:56:51.6794956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:51.6797476Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:51.6798623Z ^ 2025-05-07T19:56:51.6798863Z 2025-05-07T19:56:51.6799513Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:51.6800048Z 2025-05-07T19:56:51.6801522Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:51.6804261Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:51.6805539Z ^ 2025-05-07T19:56:51.6805926Z 2025-05-07T19:56:51.6807417Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:51.6810069Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:51.6811168Z ^ 2025-05-07T19:56:51.6811426Z 2025-05-07T19:56:51.6811847Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:51.6812459Z 2025-05-07T19:56:51.6814080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:51.6816457Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:51.6817628Z ^ 2025-05-07T19:56:51.6817978Z 2025-05-07T19:56:51.6819544Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:51.6821969Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:51.6823101Z ^ 2025-05-07T19:56:51.6823336Z 2025-05-07T19:56:51.6823737Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:51.6824309Z 2025-05-07T19:56:51.6826055Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:51.6828773Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:51.6829890Z ^ 2025-05-07T19:56:51.6830271Z 2025-05-07T19:56:51.9220318Z [256/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adagrad_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o 2025-05-07T19:56:51.9243193Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:51.9245798Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:51.9246975Z ^ 2025-05-07T19:56:51.9247252Z 2025-05-07T19:56:51.9247744Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:51.9248416Z 2025-05-07T19:56:51.9250057Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:51.9252433Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:51.9253567Z ^ 2025-05-07T19:56:51.9253898Z 2025-05-07T19:56:51.9255352Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:51.9257688Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:51.9258988Z ^ 2025-05-07T19:56:51.9259219Z 2025-05-07T19:56:51.9259670Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:51.9260308Z 2025-05-07T19:56:51.9261924Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:51.9264398Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:51.9265489Z ^ 2025-05-07T19:56:51.9265835Z 2025-05-07T19:56:51.9267377Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:51.9269838Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:51.9270941Z ^ 2025-05-07T19:56:51.9271212Z 2025-05-07T19:56:51.9271626Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:51.9272241Z 2025-05-07T19:56:51.9273780Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:51.9276576Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:51.9277733Z ^ 2025-05-07T19:56:51.9278086Z 2025-05-07T19:56:51.9280003Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:51.9282505Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:51.9283752Z ^ 2025-05-07T19:56:51.9284002Z 2025-05-07T19:56:51.9284468Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:51.9285106Z 2025-05-07T19:56:51.9286684Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:51.9289245Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:51.9290430Z ^ 2025-05-07T19:56:51.9290811Z 2025-05-07T19:56:51.9292431Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:51.9294949Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:51.9296074Z ^ 2025-05-07T19:56:51.9296355Z 2025-05-07T19:56:51.9296799Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:51.9297458Z 2025-05-07T19:56:51.9299075Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:51.9301613Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:51.9303077Z ^ 2025-05-07T19:56:51.9303435Z 2025-05-07T19:56:55.2905269Z [257/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adagrad_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o 2025-05-07T19:56:55.2925796Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:55.2928156Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:55.2929410Z ^ 2025-05-07T19:56:55.2929631Z 2025-05-07T19:56:55.2930035Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:55.2930602Z 2025-05-07T19:56:55.2931964Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:55.2934195Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:55.2935189Z ^ 2025-05-07T19:56:55.2935530Z 2025-05-07T19:56:55.2936913Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:55.2939331Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:55.2940325Z ^ 2025-05-07T19:56:55.2940577Z 2025-05-07T19:56:55.2940969Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:55.2941541Z 2025-05-07T19:56:55.2942924Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:55.2945002Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:55.2946248Z ^ 2025-05-07T19:56:55.2946609Z 2025-05-07T19:56:55.2948106Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:55.2950664Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:55.2951768Z ^ 2025-05-07T19:56:55.2952022Z 2025-05-07T19:56:55.2952438Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:55.2953083Z 2025-05-07T19:56:55.2954713Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:55.2957637Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:55.2958728Z ^ 2025-05-07T19:56:55.2959099Z 2025-05-07T19:56:55.2960523Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:55.2962977Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:55.2964050Z ^ 2025-05-07T19:56:55.2964324Z 2025-05-07T19:56:55.2964743Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:55.2965337Z 2025-05-07T19:56:55.2966810Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:55.2969171Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:55.2970288Z ^ 2025-05-07T19:56:55.2970627Z 2025-05-07T19:56:55.2972060Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:55.2974397Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:55.2975624Z ^ 2025-05-07T19:56:55.2975872Z 2025-05-07T19:56:55.2976293Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:55.2976942Z 2025-05-07T19:56:55.2978543Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:55.2981153Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:55.2982276Z ^ 2025-05-07T19:56:55.2982627Z 2025-05-07T19:57:03.3715909Z [258/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o 2025-05-07T19:57:03.3740378Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.3743211Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.3744385Z ^ 2025-05-07T19:57:03.3744659Z 2025-05-07T19:57:03.3745114Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:03.3745793Z 2025-05-07T19:57:03.3747517Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.3750249Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.3751737Z ^ 2025-05-07T19:57:03.3752101Z 2025-05-07T19:57:03.3754006Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.3756682Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.3757794Z ^ 2025-05-07T19:57:03.3758037Z 2025-05-07T19:57:03.3758508Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:03.3759140Z 2025-05-07T19:57:03.3760444Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.3762776Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.3763929Z ^ 2025-05-07T19:57:03.3764304Z 2025-05-07T19:57:03.3765638Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.3768171Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.3769249Z ^ 2025-05-07T19:57:03.3769532Z 2025-05-07T19:57:03.3769955Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:03.3770623Z 2025-05-07T19:57:03.3772597Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.3775181Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.3776463Z ^ 2025-05-07T19:57:03.3776814Z 2025-05-07T19:57:03.3778374Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.3780969Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.3782187Z ^ 2025-05-07T19:57:03.3782460Z 2025-05-07T19:57:03.3782914Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:03.3783593Z 2025-05-07T19:57:03.3785149Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.3787642Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.3788782Z ^ 2025-05-07T19:57:03.3789152Z 2025-05-07T19:57:03.3790803Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.3793412Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.3794956Z ^ 2025-05-07T19:57:03.3795241Z 2025-05-07T19:57:03.3795694Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:03.3796211Z 2025-05-07T19:57:03.3797839Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.3800592Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.3801823Z ^ 2025-05-07T19:57:03.3802193Z 2025-05-07T19:57:06.6217626Z [259/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o 2025-05-07T19:57:06.6241096Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:06.6243777Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:06.6244896Z ^ 2025-05-07T19:57:06.6245157Z 2025-05-07T19:57:06.6245616Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:06.6246647Z 2025-05-07T19:57:06.6248195Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:06.6250826Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:06.6251987Z ^ 2025-05-07T19:57:06.6252376Z 2025-05-07T19:57:06.6254025Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:06.6256460Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:06.6257594Z ^ 2025-05-07T19:57:06.6257857Z 2025-05-07T19:57:06.6258298Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:06.6258980Z 2025-05-07T19:57:06.6260444Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:06.6262854Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:06.6264010Z ^ 2025-05-07T19:57:06.6264386Z 2025-05-07T19:57:06.6266194Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:06.6268924Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:06.6270142Z ^ 2025-05-07T19:57:06.6270410Z 2025-05-07T19:57:06.6270891Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:06.6273619Z 2025-05-07T19:57:06.6275386Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:06.6278059Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:06.6279230Z ^ 2025-05-07T19:57:06.6279599Z 2025-05-07T19:57:06.6281281Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:06.6283740Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:06.6284894Z ^ 2025-05-07T19:57:06.6285192Z 2025-05-07T19:57:06.6285598Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:06.6286194Z 2025-05-07T19:57:06.6287599Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:06.6290059Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:06.6291139Z ^ 2025-05-07T19:57:06.6291480Z 2025-05-07T19:57:06.6293234Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:06.6295586Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:06.6296664Z ^ 2025-05-07T19:57:06.6296900Z 2025-05-07T19:57:06.6297302Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:06.6297949Z 2025-05-07T19:57:06.6299499Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:06.6302143Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:06.6303357Z ^ 2025-05-07T19:57:06.6303749Z 2025-05-07T19:57:11.5870931Z [260/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o 2025-05-07T19:57:11.5894644Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:11.5897852Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:11.5899045Z ^ 2025-05-07T19:57:11.5899329Z 2025-05-07T19:57:11.5899759Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:11.5900389Z 2025-05-07T19:57:11.5901940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:11.5904565Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:11.5905737Z ^ 2025-05-07T19:57:11.5906098Z 2025-05-07T19:57:11.5907744Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:11.5910233Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:11.5911423Z ^ 2025-05-07T19:57:11.5911680Z 2025-05-07T19:57:11.5912144Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:11.5912850Z 2025-05-07T19:57:11.5914661Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:11.5917675Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:11.5918866Z ^ 2025-05-07T19:57:11.5919461Z 2025-05-07T19:57:11.5921094Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:11.5923899Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:11.5924968Z ^ 2025-05-07T19:57:11.5925229Z 2025-05-07T19:57:11.5925703Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:11.5926378Z 2025-05-07T19:57:11.5928047Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:11.5931088Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:11.5932272Z ^ 2025-05-07T19:57:11.5932647Z 2025-05-07T19:57:11.5934191Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:11.5936739Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:11.5937803Z ^ 2025-05-07T19:57:11.5938079Z 2025-05-07T19:57:11.5938482Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:11.5939116Z 2025-05-07T19:57:11.5940798Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:11.5943640Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:11.5944945Z ^ 2025-05-07T19:57:11.5945317Z 2025-05-07T19:57:11.5947016Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:11.5949847Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:11.5951065Z ^ 2025-05-07T19:57:11.5951318Z 2025-05-07T19:57:11.5951800Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:11.5952466Z 2025-05-07T19:57:11.5954275Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:11.5957071Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:11.5958240Z ^ 2025-05-07T19:57:11.5958642Z 2025-05-07T19:57:12.3293749Z [261/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:57:12.3317793Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.3320524Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.3321720Z ^ 2025-05-07T19:57:12.3321966Z 2025-05-07T19:57:12.3322423Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:12.3323133Z 2025-05-07T19:57:12.3324793Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.3327500Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.3328955Z ^ 2025-05-07T19:57:12.3329327Z 2025-05-07T19:57:12.3331023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.3333680Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.3335058Z ^ 2025-05-07T19:57:12.3335326Z 2025-05-07T19:57:12.3335807Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:12.3336478Z 2025-05-07T19:57:12.3338535Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.3341307Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.3342607Z ^ 2025-05-07T19:57:12.3342977Z 2025-05-07T19:57:12.3344676Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.3347414Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.3348776Z ^ 2025-05-07T19:57:12.3349050Z 2025-05-07T19:57:12.3349506Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:12.3350191Z 2025-05-07T19:57:12.3351908Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.3354767Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.3356011Z ^ 2025-05-07T19:57:12.3356360Z 2025-05-07T19:57:12.3357932Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.3360585Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.3361793Z ^ 2025-05-07T19:57:12.3362213Z 2025-05-07T19:57:12.3362655Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:12.3363364Z 2025-05-07T19:57:12.3365075Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.3367836Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.3368975Z ^ 2025-05-07T19:57:12.3369366Z 2025-05-07T19:57:12.3371023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.3373738Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.3374935Z ^ 2025-05-07T19:57:12.3375212Z 2025-05-07T19:57:12.3375674Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:12.3376354Z 2025-05-07T19:57:12.3378077Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.3380496Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.3381599Z ^ 2025-05-07T19:57:12.3381961Z 2025-05-07T19:57:12.5863737Z [262/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o 2025-05-07T19:57:12.5881175Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.5883140Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.5883968Z ^ 2025-05-07T19:57:12.5884194Z 2025-05-07T19:57:12.5884526Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:12.5885062Z 2025-05-07T19:57:12.5886265Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.5888155Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.5889038Z ^ 2025-05-07T19:57:12.5889318Z 2025-05-07T19:57:12.5890483Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.5892386Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.5893551Z ^ 2025-05-07T19:57:12.5893755Z 2025-05-07T19:57:12.5894087Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:12.5894597Z 2025-05-07T19:57:12.5895761Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.5897764Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.5898713Z ^ 2025-05-07T19:57:12.5899014Z 2025-05-07T19:57:12.5900158Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.5902246Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.5903094Z ^ 2025-05-07T19:57:12.5903310Z 2025-05-07T19:57:12.5903631Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:12.5904113Z 2025-05-07T19:57:12.5905273Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.5907161Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.5908113Z ^ 2025-05-07T19:57:12.5908372Z 2025-05-07T19:57:12.5909486Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.5911525Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.5912375Z ^ 2025-05-07T19:57:12.5912565Z 2025-05-07T19:57:12.5912878Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:12.5913369Z 2025-05-07T19:57:12.5914647Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.5916500Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.5917326Z ^ 2025-05-07T19:57:12.5917608Z 2025-05-07T19:57:12.5918744Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.5920577Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.5921396Z ^ 2025-05-07T19:57:12.5921577Z 2025-05-07T19:57:12.5921914Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:12.5922370Z 2025-05-07T19:57:12.5923499Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.5925536Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.5926396Z ^ 2025-05-07T19:57:12.5926650Z 2025-05-07T19:57:12.8380415Z [263/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o 2025-05-07T19:57:12.8402640Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.8405201Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.8406302Z ^ 2025-05-07T19:57:12.8406551Z 2025-05-07T19:57:12.8406952Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:12.8407662Z 2025-05-07T19:57:12.8409349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.8411792Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.8412884Z ^ 2025-05-07T19:57:12.8413214Z 2025-05-07T19:57:12.8415028Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.8417426Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.8418679Z ^ 2025-05-07T19:57:12.8418926Z 2025-05-07T19:57:12.8419376Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:12.8419990Z 2025-05-07T19:57:12.8421558Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.8423841Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.8424923Z ^ 2025-05-07T19:57:12.8425351Z 2025-05-07T19:57:12.8426873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.8429682Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.8430801Z ^ 2025-05-07T19:57:12.8431093Z 2025-05-07T19:57:12.8431524Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:12.8432133Z 2025-05-07T19:57:12.8433690Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.8436375Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.8437768Z ^ 2025-05-07T19:57:12.8438199Z 2025-05-07T19:57:12.8439667Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.8442370Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.8443475Z ^ 2025-05-07T19:57:12.8443718Z 2025-05-07T19:57:12.8444125Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:12.8444758Z 2025-05-07T19:57:12.8446147Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.8448429Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.8449512Z ^ 2025-05-07T19:57:12.8449897Z 2025-05-07T19:57:12.8451396Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.8453783Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.8454803Z ^ 2025-05-07T19:57:12.8455080Z 2025-05-07T19:57:12.8455488Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:12.8456120Z 2025-05-07T19:57:12.8458020Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.8460657Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:12.8462081Z ^ 2025-05-07T19:57:12.8462444Z 2025-05-07T19:57:15.7927788Z [264/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:57:15.7951989Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:15.7954627Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:15.7955808Z ^ 2025-05-07T19:57:15.7956074Z 2025-05-07T19:57:15.7956562Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:15.7957177Z 2025-05-07T19:57:15.7958749Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:15.7961676Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:15.7962816Z ^ 2025-05-07T19:57:15.7963172Z 2025-05-07T19:57:15.7964882Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:15.7967646Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:15.7968842Z ^ 2025-05-07T19:57:15.7969085Z 2025-05-07T19:57:15.7969550Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:15.7970230Z 2025-05-07T19:57:15.7971955Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:15.7974641Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:15.7975896Z ^ 2025-05-07T19:57:15.7976277Z 2025-05-07T19:57:15.7977907Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:15.7980607Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:15.7981845Z ^ 2025-05-07T19:57:15.7982071Z 2025-05-07T19:57:15.7982552Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:15.7983438Z 2025-05-07T19:57:15.7985018Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:15.7987505Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:15.7988651Z ^ 2025-05-07T19:57:15.7988999Z 2025-05-07T19:57:15.7990568Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:15.7993202Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:15.7994549Z ^ 2025-05-07T19:57:15.7994844Z 2025-05-07T19:57:15.7995286Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:15.7995946Z 2025-05-07T19:57:15.7997614Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:15.8000287Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:15.8001491Z ^ 2025-05-07T19:57:15.8001856Z 2025-05-07T19:57:15.8003450Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:15.8006078Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:15.8007262Z ^ 2025-05-07T19:57:15.8007517Z 2025-05-07T19:57:15.8007928Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:15.8008624Z 2025-05-07T19:57:15.8010264Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:15.8012835Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:15.8013871Z ^ 2025-05-07T19:57:15.8014223Z 2025-05-07T19:57:27.9350498Z [265/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:57:27.9373486Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:27.9376078Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:27.9377238Z ^ 2025-05-07T19:57:27.9377489Z 2025-05-07T19:57:27.9378422Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:27.9379104Z 2025-05-07T19:57:27.9380669Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:27.9383566Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:27.9384761Z ^ 2025-05-07T19:57:27.9385088Z 2025-05-07T19:57:27.9386700Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:27.9389274Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:27.9390483Z ^ 2025-05-07T19:57:27.9390743Z 2025-05-07T19:57:27.9391173Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:27.9391778Z 2025-05-07T19:57:27.9393201Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:27.9395887Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:27.9397090Z ^ 2025-05-07T19:57:27.9397443Z 2025-05-07T19:57:27.9398897Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:27.9401648Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:27.9402710Z ^ 2025-05-07T19:57:27.9402977Z 2025-05-07T19:57:27.9403381Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:27.9404004Z 2025-05-07T19:57:27.9405833Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:27.9408422Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:27.9409557Z ^ 2025-05-07T19:57:27.9409881Z 2025-05-07T19:57:27.9411349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:27.9413813Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:27.9415075Z ^ 2025-05-07T19:57:27.9415320Z 2025-05-07T19:57:27.9415721Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:27.9416372Z 2025-05-07T19:57:27.9418064Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:27.9420485Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:27.9421579Z ^ 2025-05-07T19:57:27.9422226Z 2025-05-07T19:57:27.9423732Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:27.9426355Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:27.9427395Z ^ 2025-05-07T19:57:27.9427651Z 2025-05-07T19:57:27.9428062Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:27.9428924Z 2025-05-07T19:57:27.9430381Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:27.9432844Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:27.9434114Z ^ 2025-05-07T19:57:27.9434567Z 2025-05-07T19:57:29.7053085Z [266/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o 2025-05-07T19:57:29.7075713Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.7078738Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:29.7079851Z ^ 2025-05-07T19:57:29.7080188Z 2025-05-07T19:57:29.7080599Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:29.7081403Z 2025-05-07T19:57:29.7082992Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.7085704Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:29.7086864Z ^ 2025-05-07T19:57:29.7087223Z 2025-05-07T19:57:29.7088806Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.7091282Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:29.7092260Z ^ 2025-05-07T19:57:29.7092487Z 2025-05-07T19:57:29.7092897Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:29.7093477Z 2025-05-07T19:57:29.7095064Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.7097541Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:29.7098663Z ^ 2025-05-07T19:57:29.7099260Z 2025-05-07T19:57:29.7100809Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.7103408Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:29.7104610Z ^ 2025-05-07T19:57:29.7104885Z 2025-05-07T19:57:29.7105278Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:29.7105897Z 2025-05-07T19:57:29.7107430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.7110040Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:29.7111223Z ^ 2025-05-07T19:57:29.7111587Z 2025-05-07T19:57:29.7113244Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.7116071Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:29.7117132Z ^ 2025-05-07T19:57:29.7117376Z 2025-05-07T19:57:29.7117861Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:29.7118436Z 2025-05-07T19:57:29.7120226Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.7122855Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:29.7124016Z ^ 2025-05-07T19:57:29.7124503Z 2025-05-07T19:57:29.7126100Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.7129018Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:29.7130358Z ^ 2025-05-07T19:57:29.7130622Z 2025-05-07T19:57:29.7131069Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:29.7131752Z 2025-05-07T19:57:29.7133497Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.7136247Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:29.7137465Z ^ 2025-05-07T19:57:29.7137793Z 2025-05-07T19:57:39.6213216Z [267/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o 2025-05-07T19:57:39.6235918Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.6239246Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.6240473Z ^ 2025-05-07T19:57:39.6240739Z 2025-05-07T19:57:39.6241195Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:39.6242063Z 2025-05-07T19:57:39.6243723Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.6246479Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.6247693Z ^ 2025-05-07T19:57:39.6248091Z 2025-05-07T19:57:39.6249750Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.6252127Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.6253279Z ^ 2025-05-07T19:57:39.6253551Z 2025-05-07T19:57:39.6254000Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:39.6254561Z 2025-05-07T19:57:39.6256456Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.6259307Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.6260523Z ^ 2025-05-07T19:57:39.6260911Z 2025-05-07T19:57:39.6262556Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.6265009Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.6266169Z ^ 2025-05-07T19:57:39.6266436Z 2025-05-07T19:57:39.6266839Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:39.6267480Z 2025-05-07T19:57:39.6269260Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.6271774Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.6272910Z ^ 2025-05-07T19:57:39.6273287Z 2025-05-07T19:57:39.6274889Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.6277278Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.6278315Z ^ 2025-05-07T19:57:39.6278553Z 2025-05-07T19:57:39.6279308Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:39.6279895Z 2025-05-07T19:57:39.6281383Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.6283898Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.6285005Z ^ 2025-05-07T19:57:39.6285346Z 2025-05-07T19:57:39.6286894Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.6289381Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.6290485Z ^ 2025-05-07T19:57:39.6290726Z 2025-05-07T19:57:39.6291138Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:39.6291779Z 2025-05-07T19:57:39.6293282Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.6295972Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.6297030Z ^ 2025-05-07T19:57:39.6297369Z 2025-05-07T19:57:41.1532886Z [268/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o 2025-05-07T19:57:41.1556129Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:41.1559032Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:41.1560287Z ^ 2025-05-07T19:57:41.1560558Z 2025-05-07T19:57:41.1561033Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:41.1561723Z 2025-05-07T19:57:41.1563467Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:41.1566258Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:41.1567486Z ^ 2025-05-07T19:57:41.1567879Z 2025-05-07T19:57:41.1569497Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:41.1572606Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:41.1573840Z ^ 2025-05-07T19:57:41.1574394Z 2025-05-07T19:57:41.1574885Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:41.1575592Z 2025-05-07T19:57:41.1577389Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:41.1580202Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:41.1581477Z ^ 2025-05-07T19:57:41.1581854Z 2025-05-07T19:57:41.1583362Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:41.1585232Z int error_code = 0; 2025-05-07T19:57:41.1585867Z ^ 2025-05-07T19:57:41.1586092Z 2025-05-07T19:57:41.1587584Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:41.1589422Z int64_t error_value; 2025-05-07T19:57:41.1589912Z ^ 2025-05-07T19:57:41.1590150Z 2025-05-07T19:57:41.1591589Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:41.1593437Z int error_code = 0; 2025-05-07T19:57:41.1594058Z ^ 2025-05-07T19:57:41.1594279Z 2025-05-07T19:57:41.1595726Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:41.1597574Z int64_t error_value; 2025-05-07T19:57:41.1598388Z ^ 2025-05-07T19:57:41.1598662Z 2025-05-07T19:57:41.1600306Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:41.1602254Z int error_code = 0; 2025-05-07T19:57:41.1602715Z ^ 2025-05-07T19:57:41.1602943Z 2025-05-07T19:57:41.1604456Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:41.1606348Z int64_t error_value; 2025-05-07T19:57:41.1606846Z ^ 2025-05-07T19:57:41.1607101Z 2025-05-07T19:57:41.1608619Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:41.1610503Z int error_code = 0; 2025-05-07T19:57:41.1610996Z ^ 2025-05-07T19:57:41.1611219Z 2025-05-07T19:57:41.1612708Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:41.1614569Z int64_t error_value; 2025-05-07T19:57:41.1614942Z ^ 2025-05-07T19:57:41.1615180Z 2025-05-07T19:57:41.1616900Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:41.1619656Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:41.1620715Z ^ 2025-05-07T19:57:41.1621010Z 2025-05-07T19:57:41.1621593Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:41.1622259Z 2025-05-07T19:57:41.1623963Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:41.1626568Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:41.1627786Z ^ 2025-05-07T19:57:41.1628119Z 2025-05-07T19:57:41.1629620Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:41.1631282Z int error_code = 0; 2025-05-07T19:57:41.1631726Z ^ 2025-05-07T19:57:41.1631927Z 2025-05-07T19:57:41.1633255Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:41.1635002Z int64_t error_value; 2025-05-07T19:57:41.1635466Z ^ 2025-05-07T19:57:41.1635712Z 2025-05-07T19:57:41.1637379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:41.1638979Z int error_code = 0; 2025-05-07T19:57:41.1639410Z ^ 2025-05-07T19:57:41.1639653Z 2025-05-07T19:57:41.1641045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:41.1642839Z int64_t error_value; 2025-05-07T19:57:41.1643638Z ^ 2025-05-07T19:57:41.1643855Z 2025-05-07T19:57:41.1645129Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:41.1646751Z int error_code = 0; 2025-05-07T19:57:41.1647193Z ^ 2025-05-07T19:57:41.1647391Z 2025-05-07T19:57:41.1648532Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:41.1650140Z int64_t error_value; 2025-05-07T19:57:41.1650573Z ^ 2025-05-07T19:57:41.1650790Z 2025-05-07T19:57:41.1652110Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:41.1653781Z int error_code = 0; 2025-05-07T19:57:41.1654172Z ^ 2025-05-07T19:57:41.1654375Z 2025-05-07T19:57:41.1655739Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:41.1657529Z int64_t error_value; 2025-05-07T19:57:41.1657990Z ^ 2025-05-07T19:57:41.1658233Z 2025-05-07T19:57:41.1659922Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:41.1662624Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:41.1663873Z ^ 2025-05-07T19:57:41.1664115Z 2025-05-07T19:57:41.1664842Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:41.1665556Z 2025-05-07T19:57:41.1667208Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:41.1670022Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:41.1671173Z ^ 2025-05-07T19:57:41.1671519Z 2025-05-07T19:57:41.1672882Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:41.1674822Z int error_code = 0; 2025-05-07T19:57:41.1675240Z ^ 2025-05-07T19:57:41.1675477Z 2025-05-07T19:57:41.1676857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:41.1678604Z int64_t error_value; 2025-05-07T19:57:41.1679041Z ^ 2025-05-07T19:57:41.1679281Z 2025-05-07T19:57:41.1680641Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:41.1682315Z int error_code = 0; 2025-05-07T19:57:41.1682764Z ^ 2025-05-07T19:57:41.1682970Z 2025-05-07T19:57:41.1684376Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:41.1686133Z int64_t error_value; 2025-05-07T19:57:41.1686837Z ^ 2025-05-07T19:57:41.1687076Z 2025-05-07T19:57:41.1688454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:41.1690298Z int error_code = 0; 2025-05-07T19:57:41.1690768Z ^ 2025-05-07T19:57:41.1690981Z 2025-05-07T19:57:41.1692300Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:41.1694053Z int64_t error_value; 2025-05-07T19:57:41.1694488Z ^ 2025-05-07T19:57:41.1694740Z 2025-05-07T19:57:41.1696104Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:41.1697872Z int error_code = 0; 2025-05-07T19:57:41.1698305Z ^ 2025-05-07T19:57:41.1698516Z 2025-05-07T19:57:41.1699912Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:41.1701857Z int64_t error_value; 2025-05-07T19:57:41.1702333Z ^ 2025-05-07T19:57:41.1702568Z 2025-05-07T19:57:41.1704228Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:41.1706860Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:41.1708050Z ^ 2025-05-07T19:57:41.1708307Z 2025-05-07T19:57:41.1708765Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:41.1709564Z 2025-05-07T19:57:41.1711214Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:41.1714035Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:41.1715205Z ^ 2025-05-07T19:57:41.1715594Z 2025-05-07T19:57:41.1716931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:41.1718668Z int error_code = 0; 2025-05-07T19:57:41.1719104Z ^ 2025-05-07T19:57:41.1719323Z 2025-05-07T19:57:41.1720732Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:41.1722441Z int64_t error_value; 2025-05-07T19:57:41.1722926Z ^ 2025-05-07T19:57:41.1723183Z 2025-05-07T19:57:41.1724660Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:41.1726455Z int error_code = 0; 2025-05-07T19:57:41.1726922Z ^ 2025-05-07T19:57:41.1727131Z 2025-05-07T19:57:41.1728756Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:41.1730216Z int64_t error_value; 2025-05-07T19:57:41.1730668Z ^ 2025-05-07T19:57:41.1731165Z 2025-05-07T19:57:41.1732464Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:41.1734128Z int error_code = 0; 2025-05-07T19:57:41.1734659Z ^ 2025-05-07T19:57:41.1734872Z 2025-05-07T19:57:41.1736264Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:41.1738083Z int64_t error_value; 2025-05-07T19:57:41.1738551Z ^ 2025-05-07T19:57:41.1738799Z 2025-05-07T19:57:41.1740126Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:41.1741867Z int error_code = 0; 2025-05-07T19:57:41.1742350Z ^ 2025-05-07T19:57:41.1742565Z 2025-05-07T19:57:41.1743995Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:41.1745772Z int64_t error_value; 2025-05-07T19:57:41.1746277Z ^ 2025-05-07T19:57:41.1746528Z 2025-05-07T19:57:43.7073834Z [269/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:57:43.7098442Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.7101583Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:43.7102799Z ^ 2025-05-07T19:57:43.7103060Z 2025-05-07T19:57:43.7103537Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:43.7104209Z 2025-05-07T19:57:43.7105900Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.7108630Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:43.7109867Z ^ 2025-05-07T19:57:43.7110236Z 2025-05-07T19:57:43.7111877Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.7114692Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:43.7115867Z ^ 2025-05-07T19:57:43.7116150Z 2025-05-07T19:57:43.7116593Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:43.7117240Z 2025-05-07T19:57:43.7118792Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.7121541Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:43.7122745Z ^ 2025-05-07T19:57:43.7123075Z 2025-05-07T19:57:43.7124342Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.7127031Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:43.7128252Z ^ 2025-05-07T19:57:43.7128789Z 2025-05-07T19:57:43.7129251Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:43.7129965Z 2025-05-07T19:57:43.7131715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.7134500Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:43.7135718Z ^ 2025-05-07T19:57:43.7136114Z 2025-05-07T19:57:43.7137800Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.7140547Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:43.7141731Z ^ 2025-05-07T19:57:43.7142008Z 2025-05-07T19:57:43.7142778Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:43.7143584Z 2025-05-07T19:57:43.7145259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.7148159Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:43.7149339Z ^ 2025-05-07T19:57:43.7149708Z 2025-05-07T19:57:43.7151320Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.7154107Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:43.7155133Z ^ 2025-05-07T19:57:43.7155348Z 2025-05-07T19:57:43.7155769Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:43.7156458Z 2025-05-07T19:57:43.7158142Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.7160856Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:43.7162061Z ^ 2025-05-07T19:57:43.7162469Z 2025-05-07T19:57:45.4698673Z [270/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o 2025-05-07T19:57:45.4722369Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:45.4725135Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:45.4726353Z ^ 2025-05-07T19:57:45.4726616Z 2025-05-07T19:57:45.4727087Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:45.4727802Z 2025-05-07T19:57:45.4729857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:45.4732620Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:45.4733846Z ^ 2025-05-07T19:57:45.4734245Z 2025-05-07T19:57:45.4735918Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:45.4738620Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:45.4739825Z ^ 2025-05-07T19:57:45.4740084Z 2025-05-07T19:57:45.4740828Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:45.4741497Z 2025-05-07T19:57:45.4743172Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:45.4745918Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:45.4747137Z ^ 2025-05-07T19:57:45.4747510Z 2025-05-07T19:57:45.4749177Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:45.4751886Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:45.4753076Z ^ 2025-05-07T19:57:45.4753332Z 2025-05-07T19:57:45.4753783Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:45.4754645Z 2025-05-07T19:57:45.4756331Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:45.4758913Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:45.4759978Z ^ 2025-05-07T19:57:45.4760348Z 2025-05-07T19:57:45.4762372Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:45.4765090Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:45.4766314Z ^ 2025-05-07T19:57:45.4766584Z 2025-05-07T19:57:45.4767191Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:45.4767900Z 2025-05-07T19:57:45.4769623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:45.4772417Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:45.4773669Z ^ 2025-05-07T19:57:45.4774033Z 2025-05-07T19:57:45.4775786Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:45.4778601Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:45.4779820Z ^ 2025-05-07T19:57:45.4780113Z 2025-05-07T19:57:45.4780595Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:45.4781445Z 2025-05-07T19:57:45.4783137Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:45.4785829Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:45.4787191Z ^ 2025-05-07T19:57:45.4787553Z 2025-05-07T19:57:48.8981163Z [271/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o 2025-05-07T19:57:48.9003478Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:48.9006036Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:48.9007131Z ^ 2025-05-07T19:57:48.9007377Z 2025-05-07T19:57:48.9007775Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:48.9008413Z 2025-05-07T19:57:48.9009927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:48.9012414Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:48.9013509Z ^ 2025-05-07T19:57:48.9013861Z 2025-05-07T19:57:48.9015477Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:48.9018290Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:48.9019360Z ^ 2025-05-07T19:57:48.9019635Z 2025-05-07T19:57:48.9020040Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:48.9020656Z 2025-05-07T19:57:48.9022167Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:48.9024605Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:48.9025672Z ^ 2025-05-07T19:57:48.9026027Z 2025-05-07T19:57:48.9027472Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:48.9030145Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:48.9031217Z ^ 2025-05-07T19:57:48.9031448Z 2025-05-07T19:57:48.9031857Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:48.9032476Z 2025-05-07T19:57:48.9034124Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:48.9037078Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:48.9038197Z ^ 2025-05-07T19:57:48.9038543Z 2025-05-07T19:57:48.9040102Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:48.9042678Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:48.9043738Z ^ 2025-05-07T19:57:48.9043979Z 2025-05-07T19:57:48.9044370Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:48.9044918Z 2025-05-07T19:57:48.9046496Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:48.9049051Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:48.9050187Z ^ 2025-05-07T19:57:48.9050482Z 2025-05-07T19:57:48.9051843Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:48.9054095Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:48.9055121Z ^ 2025-05-07T19:57:48.9055330Z 2025-05-07T19:57:48.9055675Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:48.9056236Z 2025-05-07T19:57:48.9057635Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:48.9060133Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:48.9061140Z ^ 2025-05-07T19:57:48.9061455Z 2025-05-07T19:57:49.0540719Z [272/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o 2025-05-07T19:57:49.0562411Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:49.0565006Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:49.0566061Z ^ 2025-05-07T19:57:49.0566336Z 2025-05-07T19:57:49.0566801Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:49.0567412Z 2025-05-07T19:57:49.0568962Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:49.0571459Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:49.0572904Z ^ 2025-05-07T19:57:49.0573291Z 2025-05-07T19:57:49.0574713Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:49.0577233Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:49.0578419Z ^ 2025-05-07T19:57:49.0578637Z 2025-05-07T19:57:49.0579033Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:49.0579611Z 2025-05-07T19:57:49.0581211Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:49.0583837Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:49.0584830Z ^ 2025-05-07T19:57:49.0585151Z 2025-05-07T19:57:49.0586298Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:57:49.0588044Z const auto offset_idx = idx * D_emb; 2025-05-07T19:57:49.0588509Z ^ 2025-05-07T19:57:49.0588765Z 2025-05-07T19:57:49.0590248Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:49.0592868Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:49.0594110Z ^ 2025-05-07T19:57:49.0594377Z 2025-05-07T19:57:49.0594790Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:49.0595402Z 2025-05-07T19:57:49.0596893Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:49.0599636Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:49.0600681Z ^ 2025-05-07T19:57:49.0601011Z 2025-05-07T19:57:49.0602251Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:57:49.0603756Z const auto offset_idx = idx * D_emb; 2025-05-07T19:57:49.0604266Z ^ 2025-05-07T19:57:49.0604506Z 2025-05-07T19:57:49.0605980Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:49.0608494Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:49.0609596Z ^ 2025-05-07T19:57:49.0609850Z 2025-05-07T19:57:49.0610246Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:49.0610822Z 2025-05-07T19:57:49.0612365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:49.0614763Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:49.0615836Z ^ 2025-05-07T19:57:49.0616195Z 2025-05-07T19:57:49.0617391Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:57:49.0618926Z const auto offset_idx = idx * D_emb; 2025-05-07T19:57:49.0619427Z ^ 2025-05-07T19:57:49.0619656Z 2025-05-07T19:57:49.0621163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:49.0623604Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:49.0624687Z ^ 2025-05-07T19:57:49.0624928Z 2025-05-07T19:57:49.0625334Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:49.0625985Z 2025-05-07T19:57:49.0627513Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:49.0630308Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:49.0631402Z ^ 2025-05-07T19:57:49.0631787Z 2025-05-07T19:57:49.0633304Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:57:49.0635081Z const auto offset_idx = idx * D_emb; 2025-05-07T19:57:49.0635646Z ^ 2025-05-07T19:57:49.0635886Z 2025-05-07T19:57:50.4589593Z [273/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o 2025-05-07T19:57:50.4614207Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.4617129Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.4636276Z ^ 2025-05-07T19:57:50.4636580Z 2025-05-07T19:57:50.4637061Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:50.4637727Z 2025-05-07T19:57:50.4639410Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.4641965Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.4643139Z ^ 2025-05-07T19:57:50.4643515Z 2025-05-07T19:57:50.4645506Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.4648650Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.4649990Z ^ 2025-05-07T19:57:50.4650264Z 2025-05-07T19:57:50.4650739Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:50.4651471Z 2025-05-07T19:57:50.4653234Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.4656067Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.4657320Z ^ 2025-05-07T19:57:50.4657702Z 2025-05-07T19:57:50.4659606Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.4662316Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.4663524Z ^ 2025-05-07T19:57:50.4663786Z 2025-05-07T19:57:50.4664268Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:50.4664948Z 2025-05-07T19:57:50.4666563Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.4669498Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.4670742Z ^ 2025-05-07T19:57:50.4671113Z 2025-05-07T19:57:50.4672807Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.4675717Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.4676906Z ^ 2025-05-07T19:57:50.4677185Z 2025-05-07T19:57:50.4677650Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:50.4678324Z 2025-05-07T19:57:50.4680071Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.4682809Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.4684053Z ^ 2025-05-07T19:57:50.4684423Z 2025-05-07T19:57:50.4686142Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.4689011Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.4690209Z ^ 2025-05-07T19:57:50.4690471Z 2025-05-07T19:57:50.4690948Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:50.4691855Z 2025-05-07T19:57:50.4693618Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.4696528Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.4697816Z ^ 2025-05-07T19:57:50.4698228Z 2025-05-07T19:57:50.9127031Z [274/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:57:50.9150995Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.9153545Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.9154664Z ^ 2025-05-07T19:57:50.9154892Z 2025-05-07T19:57:50.9155303Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:50.9155943Z 2025-05-07T19:57:50.9157868Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.9160336Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.9161654Z ^ 2025-05-07T19:57:50.9162138Z 2025-05-07T19:57:50.9163507Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.9166009Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.9167168Z ^ 2025-05-07T19:57:50.9167429Z 2025-05-07T19:57:50.9167902Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:50.9168575Z 2025-05-07T19:57:50.9170346Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.9173119Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.9174505Z ^ 2025-05-07T19:57:50.9174857Z 2025-05-07T19:57:50.9176377Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.9179103Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.9180180Z ^ 2025-05-07T19:57:50.9180440Z 2025-05-07T19:57:50.9180898Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:50.9181758Z 2025-05-07T19:57:50.9183311Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.9185999Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.9187165Z ^ 2025-05-07T19:57:50.9187538Z 2025-05-07T19:57:50.9189209Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.9191916Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.9193073Z ^ 2025-05-07T19:57:50.9193345Z 2025-05-07T19:57:50.9193988Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:50.9194597Z 2025-05-07T19:57:50.9196203Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.9199063Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.9200269Z ^ 2025-05-07T19:57:50.9200629Z 2025-05-07T19:57:50.9202376Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.9205132Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.9206160Z ^ 2025-05-07T19:57:50.9206615Z 2025-05-07T19:57:50.9207047Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:50.9207767Z 2025-05-07T19:57:50.9209370Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.9211923Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.9213137Z ^ 2025-05-07T19:57:50.9213478Z 2025-05-07T19:57:51.8582665Z [275/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o 2025-05-07T19:57:51.8603977Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.8606362Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.8607486Z ^ 2025-05-07T19:57:51.8607759Z 2025-05-07T19:57:51.8608550Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:51.8609108Z 2025-05-07T19:57:51.8610402Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.8612565Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.8613492Z ^ 2025-05-07T19:57:51.8613820Z 2025-05-07T19:57:51.8615148Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.8617506Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.8618581Z ^ 2025-05-07T19:57:51.8618800Z 2025-05-07T19:57:51.8619195Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:51.8619902Z 2025-05-07T19:57:51.8621598Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.8624195Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.8625419Z ^ 2025-05-07T19:57:51.8625813Z 2025-05-07T19:57:51.8627430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.8630356Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.8631374Z ^ 2025-05-07T19:57:51.8631627Z 2025-05-07T19:57:51.8632080Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:51.8632681Z 2025-05-07T19:57:51.8634309Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.8636789Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.8638001Z ^ 2025-05-07T19:57:51.8638384Z 2025-05-07T19:57:51.8639894Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.8642179Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.8643189Z ^ 2025-05-07T19:57:51.8643426Z 2025-05-07T19:57:51.8643826Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:51.8644443Z 2025-05-07T19:57:51.8645986Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.8648475Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.8650032Z ^ 2025-05-07T19:57:51.8650346Z 2025-05-07T19:57:51.8651629Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.8654299Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.8655344Z ^ 2025-05-07T19:57:51.8655591Z 2025-05-07T19:57:51.8655968Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:51.8656521Z 2025-05-07T19:57:51.8658001Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.8660581Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.8661747Z ^ 2025-05-07T19:57:51.8662105Z 2025-05-07T19:57:53.4156361Z [276/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o 2025-05-07T19:57:53.4178381Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.4181081Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.4182346Z ^ 2025-05-07T19:57:53.4182754Z 2025-05-07T19:57:53.4183148Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:53.4183744Z 2025-05-07T19:57:53.4185159Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.4187323Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.4188357Z ^ 2025-05-07T19:57:53.4188683Z 2025-05-07T19:57:53.4190159Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.4192488Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.4193465Z ^ 2025-05-07T19:57:53.4193701Z 2025-05-07T19:57:53.4194268Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:53.4194853Z 2025-05-07T19:57:53.4196432Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.4198928Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.4200135Z ^ 2025-05-07T19:57:53.4200476Z 2025-05-07T19:57:53.4201946Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.4204488Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.4205590Z ^ 2025-05-07T19:57:53.4205825Z 2025-05-07T19:57:53.4206245Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:53.4206853Z 2025-05-07T19:57:53.4208393Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.4210989Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.4212038Z ^ 2025-05-07T19:57:53.4212351Z 2025-05-07T19:57:53.4213745Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.4216261Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.4217344Z ^ 2025-05-07T19:57:53.4217594Z 2025-05-07T19:57:53.4218000Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:53.4218610Z 2025-05-07T19:57:53.4220386Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.4222890Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.4224107Z ^ 2025-05-07T19:57:53.4224430Z 2025-05-07T19:57:53.4225955Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.4228278Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.4229661Z ^ 2025-05-07T19:57:53.4229891Z 2025-05-07T19:57:53.4230286Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:53.4230930Z 2025-05-07T19:57:53.4232253Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.4234575Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.4235610Z ^ 2025-05-07T19:57:53.4235881Z 2025-05-07T19:58:00.9445336Z [277/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o 2025-05-07T19:58:00.9466536Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:00.9469323Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:00.9470333Z ^ 2025-05-07T19:58:00.9470590Z 2025-05-07T19:58:00.9470960Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:00.9471471Z 2025-05-07T19:58:00.9472846Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:00.9475520Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:00.9476622Z ^ 2025-05-07T19:58:00.9476966Z 2025-05-07T19:58:00.9478510Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:00.9480772Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:00.9481941Z ^ 2025-05-07T19:58:00.9482180Z 2025-05-07T19:58:00.9482585Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:00.9483230Z 2025-05-07T19:58:00.9484663Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:00.9487309Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:00.9488406Z ^ 2025-05-07T19:58:00.9488771Z 2025-05-07T19:58:00.9490193Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:00.9492610Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:00.9493672Z ^ 2025-05-07T19:58:00.9493940Z 2025-05-07T19:58:00.9494358Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:00.9494984Z 2025-05-07T19:58:00.9496500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:00.9498986Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:00.9500124Z ^ 2025-05-07T19:58:00.9500481Z 2025-05-07T19:58:00.9501990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:00.9504438Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:00.9505520Z ^ 2025-05-07T19:58:00.9506033Z 2025-05-07T19:58:00.9506433Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:00.9507054Z 2025-05-07T19:58:00.9508536Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:00.9511082Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:00.9512156Z ^ 2025-05-07T19:58:00.9512531Z 2025-05-07T19:58:00.9514210Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:00.9516697Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:00.9517920Z ^ 2025-05-07T19:58:00.9518154Z 2025-05-07T19:58:00.9518577Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:00.9519178Z 2025-05-07T19:58:00.9520671Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:00.9523137Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:00.9524256Z ^ 2025-05-07T19:58:00.9524614Z 2025-05-07T19:58:11.9403370Z [278/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o 2025-05-07T19:58:11.9427501Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:11.9430470Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:11.9431592Z ^ 2025-05-07T19:58:11.9431876Z 2025-05-07T19:58:11.9432362Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:11.9433016Z 2025-05-07T19:58:11.9434784Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:11.9437698Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:11.9438849Z ^ 2025-05-07T19:58:11.9439166Z 2025-05-07T19:58:11.9440593Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:11.9443264Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:11.9444790Z ^ 2025-05-07T19:58:11.9445161Z 2025-05-07T19:58:11.9445609Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:11.9446310Z 2025-05-07T19:58:11.9447906Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:11.9450386Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:11.9451472Z ^ 2025-05-07T19:58:11.9451857Z 2025-05-07T19:58:11.9453437Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:11.9455870Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:11.9456968Z ^ 2025-05-07T19:58:11.9457241Z 2025-05-07T19:58:11.9457693Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:11.9458359Z 2025-05-07T19:58:11.9460009Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:11.9462705Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:11.9463920Z ^ 2025-05-07T19:58:11.9464300Z 2025-05-07T19:58:11.9466324Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:11.9468927Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:11.9470220Z ^ 2025-05-07T19:58:11.9470449Z 2025-05-07T19:58:11.9470900Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:11.9471581Z 2025-05-07T19:58:11.9473241Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:11.9476109Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:11.9477308Z ^ 2025-05-07T19:58:11.9477731Z 2025-05-07T19:58:11.9479245Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:11.9481738Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:11.9482875Z ^ 2025-05-07T19:58:11.9483119Z 2025-05-07T19:58:11.9483594Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:11.9484243Z 2025-05-07T19:58:11.9485853Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:11.9488724Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:11.9490296Z ^ 2025-05-07T19:58:11.9490679Z 2025-05-07T19:58:14.4837975Z [279/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:58:14.4862380Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:14.4865063Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:14.4866253Z ^ 2025-05-07T19:58:14.4866501Z 2025-05-07T19:58:14.4866958Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:14.4867659Z 2025-05-07T19:58:14.4869314Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:14.4871684Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:14.4872899Z ^ 2025-05-07T19:58:14.4873296Z 2025-05-07T19:58:14.4875114Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:14.4878177Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:14.4879268Z ^ 2025-05-07T19:58:14.4879520Z 2025-05-07T19:58:14.4879930Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:14.4880530Z 2025-05-07T19:58:14.4881945Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:14.4884441Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:14.4885609Z ^ 2025-05-07T19:58:14.4885936Z 2025-05-07T19:58:14.4887488Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:14.4890003Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:14.4891061Z ^ 2025-05-07T19:58:14.4891297Z 2025-05-07T19:58:14.4891678Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:14.4892307Z 2025-05-07T19:58:14.4893853Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:14.4896557Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:14.4897592Z ^ 2025-05-07T19:58:14.4897926Z 2025-05-07T19:58:14.4899590Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:14.4902245Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:14.4903411Z ^ 2025-05-07T19:58:14.4903653Z 2025-05-07T19:58:14.4904079Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:14.4904661Z 2025-05-07T19:58:14.4906166Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:14.4908659Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:14.4909763Z ^ 2025-05-07T19:58:14.4910090Z 2025-05-07T19:58:14.4911687Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:14.4914464Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:14.4915461Z ^ 2025-05-07T19:58:14.4915696Z 2025-05-07T19:58:14.4916049Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:14.4916848Z 2025-05-07T19:58:14.4918535Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:14.4921232Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:14.4922438Z ^ 2025-05-07T19:58:14.4922806Z 2025-05-07T19:58:17.7973734Z [280/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o 2025-05-07T19:58:17.7998379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:17.8000968Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:17.8002073Z ^ 2025-05-07T19:58:17.8002317Z 2025-05-07T19:58:17.8002719Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:17.8003319Z 2025-05-07T19:58:17.8005086Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:17.8008207Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:17.8009464Z ^ 2025-05-07T19:58:17.8009851Z 2025-05-07T19:58:17.8011328Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:17.8013897Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:17.8015103Z ^ 2025-05-07T19:58:17.8015381Z 2025-05-07T19:58:17.8015852Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:17.8016530Z 2025-05-07T19:58:17.8018427Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:17.8020684Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:17.8021708Z ^ 2025-05-07T19:58:17.8022036Z 2025-05-07T19:58:17.8023402Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:17.8025762Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:17.8026918Z ^ 2025-05-07T19:58:17.8027154Z 2025-05-07T19:58:17.8027866Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:17.8028801Z 2025-05-07T19:58:17.8030435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:17.8035799Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:17.8037250Z ^ 2025-05-07T19:58:17.8037655Z 2025-05-07T19:58:17.8039288Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:17.8041901Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:17.8043130Z ^ 2025-05-07T19:58:17.8043425Z 2025-05-07T19:58:17.8043868Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:17.8044529Z 2025-05-07T19:58:17.8046286Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:17.8048922Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:17.8050064Z ^ 2025-05-07T19:58:17.8050376Z 2025-05-07T19:58:17.8052091Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:17.8054937Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:17.8056119Z ^ 2025-05-07T19:58:17.8056383Z 2025-05-07T19:58:17.8056840Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:17.8057510Z 2025-05-07T19:58:17.8059147Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:17.8061809Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:17.8062979Z ^ 2025-05-07T19:58:17.8063369Z 2025-05-07T19:58:18.4399903Z [281/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o 2025-05-07T19:58:18.4423331Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.4426071Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.4427312Z ^ 2025-05-07T19:58:18.4427588Z 2025-05-07T19:58:18.4428090Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:18.4429323Z 2025-05-07T19:58:18.4431012Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.4433864Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.4435100Z ^ 2025-05-07T19:58:18.4435480Z 2025-05-07T19:58:18.4437170Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.4439881Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.4441133Z ^ 2025-05-07T19:58:18.4441384Z 2025-05-07T19:58:18.4441846Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:18.4442529Z 2025-05-07T19:58:18.4444220Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.4447000Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.4448227Z ^ 2025-05-07T19:58:18.4448642Z 2025-05-07T19:58:18.4450172Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.4453181Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.4454405Z ^ 2025-05-07T19:58:18.4454692Z 2025-05-07T19:58:18.4455146Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:18.4456079Z 2025-05-07T19:58:18.4457666Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.4460181Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.4461371Z ^ 2025-05-07T19:58:18.4461740Z 2025-05-07T19:58:18.4463350Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.4465759Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.4466911Z ^ 2025-05-07T19:58:18.4467129Z 2025-05-07T19:58:18.4467507Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:18.4468083Z 2025-05-07T19:58:18.4469459Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.4471986Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.4473019Z ^ 2025-05-07T19:58:18.4473632Z 2025-05-07T19:58:18.4475426Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.4478277Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.4479455Z ^ 2025-05-07T19:58:18.4479740Z 2025-05-07T19:58:18.4480202Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:18.4480883Z 2025-05-07T19:58:18.4482602Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.4485279Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.4486446Z ^ 2025-05-07T19:58:18.4486776Z 2025-05-07T19:58:20.7164323Z [282/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o 2025-05-07T19:58:20.7187710Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:20.7190645Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:20.7191838Z ^ 2025-05-07T19:58:20.7192064Z 2025-05-07T19:58:20.7192450Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:20.7193089Z 2025-05-07T19:58:20.7194927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:20.7197358Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:20.7198596Z ^ 2025-05-07T19:58:20.7198933Z 2025-05-07T19:58:20.7200498Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:20.7203159Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:20.7204325Z ^ 2025-05-07T19:58:20.7204586Z 2025-05-07T19:58:20.7205075Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:20.7205747Z 2025-05-07T19:58:20.7207396Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:20.7210145Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:20.7211431Z ^ 2025-05-07T19:58:20.7212065Z 2025-05-07T19:58:20.7213518Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:20.7215942Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:20.7217039Z ^ 2025-05-07T19:58:20.7217271Z 2025-05-07T19:58:20.7217684Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:20.7218295Z 2025-05-07T19:58:20.7219783Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:20.7222147Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:20.7223287Z ^ 2025-05-07T19:58:20.7223636Z 2025-05-07T19:58:20.7225256Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:20.7227584Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:20.7229051Z ^ 2025-05-07T19:58:20.7229288Z 2025-05-07T19:58:20.7229713Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:20.7230346Z 2025-05-07T19:58:20.7232013Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:20.7235060Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:20.7236271Z ^ 2025-05-07T19:58:20.7236651Z 2025-05-07T19:58:20.7238417Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:20.7241108Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:20.7242393Z ^ 2025-05-07T19:58:20.7242659Z 2025-05-07T19:58:20.7243090Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:20.7243731Z 2025-05-07T19:58:20.7245408Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:20.7248064Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:20.7249255Z ^ 2025-05-07T19:58:20.7249611Z 2025-05-07T19:58:20.7413877Z [283/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o 2025-05-07T19:58:20.7437670Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:20.7440466Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:20.7441645Z ^ 2025-05-07T19:58:20.7441932Z 2025-05-07T19:58:20.7442382Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:20.7443047Z 2025-05-07T19:58:20.7444701Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:20.7447408Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:20.7448628Z ^ 2025-05-07T19:58:20.7449000Z 2025-05-07T19:58:20.7450643Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:20.7453298Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:20.7454442Z ^ 2025-05-07T19:58:20.7454694Z 2025-05-07T19:58:20.7455150Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:20.7455841Z 2025-05-07T19:58:20.7457875Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:20.7460594Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:20.7461775Z ^ 2025-05-07T19:58:20.7462270Z 2025-05-07T19:58:20.7463942Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:20.7466361Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:20.7467441Z ^ 2025-05-07T19:58:20.7467673Z 2025-05-07T19:58:20.7468111Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:20.7468724Z 2025-05-07T19:58:20.7470405Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:20.7473020Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:20.7474363Z ^ 2025-05-07T19:58:20.7474737Z 2025-05-07T19:58:20.7476358Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:20.7479036Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:20.7480261Z ^ 2025-05-07T19:58:20.7480556Z 2025-05-07T19:58:20.7481261Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:20.7481920Z 2025-05-07T19:58:20.7483631Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:20.7486356Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:20.7487458Z ^ 2025-05-07T19:58:20.7487814Z 2025-05-07T19:58:20.7489490Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:20.7492118Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:20.7493274Z ^ 2025-05-07T19:58:20.7493527Z 2025-05-07T19:58:20.7494007Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:20.7494559Z 2025-05-07T19:58:20.7495920Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:20.7498597Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:20.7499801Z ^ 2025-05-07T19:58:20.7500191Z 2025-05-07T19:58:23.2353932Z [284/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:58:23.2377603Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.2380284Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.2381664Z ^ 2025-05-07T19:58:23.2381927Z 2025-05-07T19:58:23.2382419Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:23.2383058Z 2025-05-07T19:58:23.2384808Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.2387529Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.2388749Z ^ 2025-05-07T19:58:23.2389089Z 2025-05-07T19:58:23.2390794Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.2394021Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.2395230Z ^ 2025-05-07T19:58:23.2395516Z 2025-05-07T19:58:23.2396244Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:23.2396906Z 2025-05-07T19:58:23.2398556Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.2401219Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.2402425Z ^ 2025-05-07T19:58:23.2402797Z 2025-05-07T19:58:23.2404420Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.2407135Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.2408268Z ^ 2025-05-07T19:58:23.2408522Z 2025-05-07T19:58:23.2408961Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:23.2409659Z 2025-05-07T19:58:23.2411362Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.2413942Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.2415092Z ^ 2025-05-07T19:58:23.2415490Z 2025-05-07T19:58:23.2417158Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.2419968Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.2421433Z ^ 2025-05-07T19:58:23.2421723Z 2025-05-07T19:58:23.2422211Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:23.2422908Z 2025-05-07T19:58:23.2424652Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.2427414Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.2428901Z ^ 2025-05-07T19:58:23.2429283Z 2025-05-07T19:58:23.2431016Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.2433924Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.2435175Z ^ 2025-05-07T19:58:23.2435443Z 2025-05-07T19:58:23.2435918Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:23.2436636Z 2025-05-07T19:58:23.2438385Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.2441318Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.2442887Z ^ 2025-05-07T19:58:23.2443290Z 2025-05-07T19:58:23.7230635Z [285/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o 2025-05-07T19:58:23.7255279Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.7258184Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.7259422Z ^ 2025-05-07T19:58:23.7259695Z 2025-05-07T19:58:23.7260101Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:23.7260777Z 2025-05-07T19:58:23.7262539Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.7265307Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.7266558Z ^ 2025-05-07T19:58:23.7266948Z 2025-05-07T19:58:23.7269141Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.7272002Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.7273348Z ^ 2025-05-07T19:58:23.7273613Z 2025-05-07T19:58:23.7274249Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:23.7274895Z 2025-05-07T19:58:23.7276511Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.7279216Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.7280395Z ^ 2025-05-07T19:58:23.7280797Z 2025-05-07T19:58:23.7282482Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.7285136Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.7286336Z ^ 2025-05-07T19:58:23.7286635Z 2025-05-07T19:58:23.7287093Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:23.7287756Z 2025-05-07T19:58:23.7289410Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.7292098Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.7293451Z ^ 2025-05-07T19:58:23.7293822Z 2025-05-07T19:58:23.7295497Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.7298219Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.7299541Z ^ 2025-05-07T19:58:23.7299803Z 2025-05-07T19:58:23.7300256Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:23.7300954Z 2025-05-07T19:58:23.7302846Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.7305662Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.7306901Z ^ 2025-05-07T19:58:23.7307313Z 2025-05-07T19:58:23.7309141Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.7311888Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.7313055Z ^ 2025-05-07T19:58:23.7313342Z 2025-05-07T19:58:23.7313909Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:23.7314573Z 2025-05-07T19:58:23.7316410Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.7319160Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.7320443Z ^ 2025-05-07T19:58:23.7320810Z 2025-05-07T19:58:24.1027994Z [286/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o 2025-05-07T19:58:24.1052350Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.1055182Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:24.1056524Z ^ 2025-05-07T19:58:24.1056793Z 2025-05-07T19:58:24.1057286Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:24.1057980Z 2025-05-07T19:58:24.1059592Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.1062284Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:24.1063887Z ^ 2025-05-07T19:58:24.1064214Z 2025-05-07T19:58:24.1065831Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.1068812Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:24.1069999Z ^ 2025-05-07T19:58:24.1070277Z 2025-05-07T19:58:24.1070732Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:24.1071408Z 2025-05-07T19:58:24.1073133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.1076028Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:24.1077273Z ^ 2025-05-07T19:58:24.1077650Z 2025-05-07T19:58:24.1079045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:58:24.1080853Z const auto offset_idx = idx * D_emb; 2025-05-07T19:58:24.1081426Z ^ 2025-05-07T19:58:24.1081695Z 2025-05-07T19:58:24.1083351Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.1086079Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:24.1087480Z ^ 2025-05-07T19:58:24.1087733Z 2025-05-07T19:58:24.1088153Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:24.1088839Z 2025-05-07T19:58:24.1090285Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.1092939Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:24.1094135Z ^ 2025-05-07T19:58:24.1094527Z 2025-05-07T19:58:24.1095904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:58:24.1097701Z const auto offset_idx = idx * D_emb; 2025-05-07T19:58:24.1098311Z ^ 2025-05-07T19:58:24.1098581Z 2025-05-07T19:58:24.1100252Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.1102972Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:24.1104028Z ^ 2025-05-07T19:58:24.1104284Z 2025-05-07T19:58:24.1104734Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:24.1105403Z 2025-05-07T19:58:24.1107121Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.1110053Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:24.1111317Z ^ 2025-05-07T19:58:24.1111680Z 2025-05-07T19:58:24.1113084Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:58:24.1115062Z const auto offset_idx = idx * D_emb; 2025-05-07T19:58:24.1115657Z ^ 2025-05-07T19:58:24.1115922Z 2025-05-07T19:58:24.1117629Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.1120329Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:24.1121531Z ^ 2025-05-07T19:58:24.1121800Z 2025-05-07T19:58:24.1122261Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:24.1122982Z 2025-05-07T19:58:24.1124669Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.1127360Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:24.1128678Z ^ 2025-05-07T19:58:24.1129058Z 2025-05-07T19:58:24.1130209Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:58:24.1132139Z const auto offset_idx = idx * D_emb; 2025-05-07T19:58:24.1132685Z ^ 2025-05-07T19:58:24.1132944Z 2025-05-07T19:58:24.2435273Z [287/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o 2025-05-07T19:58:24.2459401Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.2462235Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:24.2463483Z ^ 2025-05-07T19:58:24.2463756Z 2025-05-07T19:58:24.2464225Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:24.2464958Z 2025-05-07T19:58:24.2466722Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.2469524Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:24.2470769Z ^ 2025-05-07T19:58:24.2471192Z 2025-05-07T19:58:24.2473077Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.2476210Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:24.2477357Z ^ 2025-05-07T19:58:24.2477656Z 2025-05-07T19:58:24.2478105Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:24.2478762Z 2025-05-07T19:58:24.2480411Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.2482767Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:24.2483879Z ^ 2025-05-07T19:58:24.2484245Z 2025-05-07T19:58:24.2485939Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.2488479Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:24.2489658Z ^ 2025-05-07T19:58:24.2489901Z 2025-05-07T19:58:24.2490344Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:24.2491010Z 2025-05-07T19:58:24.2492631Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.2495484Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:24.2496626Z ^ 2025-05-07T19:58:24.2496988Z 2025-05-07T19:58:24.2498578Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.2501245Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:24.2502401Z ^ 2025-05-07T19:58:24.2502659Z 2025-05-07T19:58:24.2503116Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:24.2503712Z 2025-05-07T19:58:24.2505380Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.2508208Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:24.2509415Z ^ 2025-05-07T19:58:24.2509792Z 2025-05-07T19:58:24.2511472Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.2514327Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:24.2515455Z ^ 2025-05-07T19:58:24.2515668Z 2025-05-07T19:58:24.2516082Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:24.2516745Z 2025-05-07T19:58:24.2518455Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.2521318Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:24.2522544Z ^ 2025-05-07T19:58:24.2522922Z 2025-05-07T19:58:28.4144000Z [288/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:58:28.4160204Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:28.4161974Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:28.4162742Z ^ 2025-05-07T19:58:28.4162914Z 2025-05-07T19:58:28.4163213Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:28.4163662Z 2025-05-07T19:58:28.4164741Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:28.4166495Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:28.4167481Z ^ 2025-05-07T19:58:28.4167736Z 2025-05-07T19:58:28.4168797Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:28.4170531Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:28.4171281Z ^ 2025-05-07T19:58:28.4171463Z 2025-05-07T19:58:28.4171756Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:28.4172184Z 2025-05-07T19:58:28.4173257Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:28.4174994Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:28.4175783Z ^ 2025-05-07T19:58:28.4176026Z 2025-05-07T19:58:28.4177093Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:28.4178841Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:28.4179623Z ^ 2025-05-07T19:58:28.4179794Z 2025-05-07T19:58:28.4180089Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:28.4180542Z 2025-05-07T19:58:28.4181816Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:28.4183597Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:28.4184448Z ^ 2025-05-07T19:58:28.4184684Z 2025-05-07T19:58:28.4185763Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:28.4187475Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:28.4188245Z ^ 2025-05-07T19:58:28.4188416Z 2025-05-07T19:58:28.4188741Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:28.4189181Z 2025-05-07T19:58:28.4190258Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:28.4192009Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:28.4192798Z ^ 2025-05-07T19:58:28.4193035Z 2025-05-07T19:58:28.4194267Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:28.4196191Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:28.4197215Z ^ 2025-05-07T19:58:28.4197436Z 2025-05-07T19:58:28.4197784Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:28.4198272Z 2025-05-07T19:58:28.4199655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:28.4202429Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:28.4203708Z ^ 2025-05-07T19:58:28.4204088Z 2025-05-07T19:58:30.3248739Z [289/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o 2025-05-07T19:58:30.3271497Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:30.3274159Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:30.3275293Z ^ 2025-05-07T19:58:30.3275549Z 2025-05-07T19:58:30.3275984Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:30.3276606Z 2025-05-07T19:58:30.3278172Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:30.3280941Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:30.3282069Z ^ 2025-05-07T19:58:30.3282409Z 2025-05-07T19:58:30.3283925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:30.3286442Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:30.3287592Z ^ 2025-05-07T19:58:30.3287830Z 2025-05-07T19:58:30.3288416Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:30.3289040Z 2025-05-07T19:58:30.3290690Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:30.3293307Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:30.3294503Z ^ 2025-05-07T19:58:30.3294859Z 2025-05-07T19:58:30.3296489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:30.3299237Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:30.3300306Z ^ 2025-05-07T19:58:30.3300565Z 2025-05-07T19:58:30.3300983Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:30.3301602Z 2025-05-07T19:58:30.3303116Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:30.3305707Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:30.3306824Z ^ 2025-05-07T19:58:30.3307171Z 2025-05-07T19:58:30.3308937Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:30.3311510Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:30.3312672Z ^ 2025-05-07T19:58:30.3312915Z 2025-05-07T19:58:30.3313376Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:30.3314179Z 2025-05-07T19:58:30.3315695Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:30.3318394Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:30.3319568Z ^ 2025-05-07T19:58:30.3319933Z 2025-05-07T19:58:30.3321505Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:30.3324307Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:30.3325397Z ^ 2025-05-07T19:58:30.3325659Z 2025-05-07T19:58:30.3326066Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:30.3326668Z 2025-05-07T19:58:30.3328240Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:30.3331209Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:30.3332523Z ^ 2025-05-07T19:58:30.3332891Z 2025-05-07T19:58:30.6361099Z [290/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o 2025-05-07T19:58:30.6382781Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:30.6385571Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:30.6387015Z ^ 2025-05-07T19:58:30.6387309Z 2025-05-07T19:58:30.6387768Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:30.6388443Z 2025-05-07T19:58:30.6390180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:30.6392673Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:30.6394048Z ^ 2025-05-07T19:58:30.6394422Z 2025-05-07T19:58:30.6396063Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:30.6398941Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:30.6400175Z ^ 2025-05-07T19:58:30.6400446Z 2025-05-07T19:58:30.6400916Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:30.6401658Z 2025-05-07T19:58:30.6403517Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:30.6406209Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:30.6407472Z ^ 2025-05-07T19:58:30.6407945Z 2025-05-07T19:58:30.6409827Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:30.6412660Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:30.6413770Z ^ 2025-05-07T19:58:30.6414025Z 2025-05-07T19:58:30.6414478Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:30.6415134Z 2025-05-07T19:58:30.6416822Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:30.6419577Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:30.6420838Z ^ 2025-05-07T19:58:30.6421214Z 2025-05-07T19:58:30.6422880Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:30.6425620Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:30.6426849Z ^ 2025-05-07T19:58:30.6427109Z 2025-05-07T19:58:30.6427836Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:30.6428735Z 2025-05-07T19:58:30.6430428Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:30.6433167Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:30.6434634Z ^ 2025-05-07T19:58:30.6434993Z 2025-05-07T19:58:30.6436594Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:30.6439377Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:30.6440576Z ^ 2025-05-07T19:58:30.6440862Z 2025-05-07T19:58:30.6441322Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:30.6442000Z 2025-05-07T19:58:30.6443751Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:30.6446502Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:30.6447756Z ^ 2025-05-07T19:58:30.6448134Z 2025-05-07T19:58:33.2086119Z [291/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:58:33.2107400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:33.2110300Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:33.2111459Z ^ 2025-05-07T19:58:33.2111708Z 2025-05-07T19:58:33.2112123Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:33.2112770Z 2025-05-07T19:58:33.2114376Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:33.2116908Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:33.2118058Z ^ 2025-05-07T19:58:33.2118436Z 2025-05-07T19:58:33.2119968Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:33.2122359Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:33.2123524Z ^ 2025-05-07T19:58:33.2123769Z 2025-05-07T19:58:33.2124202Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:33.2124802Z 2025-05-07T19:58:33.2126426Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:33.2129511Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:33.2130707Z ^ 2025-05-07T19:58:33.2131107Z 2025-05-07T19:58:33.2132591Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:33.2135243Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:33.2136317Z ^ 2025-05-07T19:58:33.2136562Z 2025-05-07T19:58:33.2136965Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:33.2137586Z 2025-05-07T19:58:33.2138831Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:33.2141183Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:33.2142149Z ^ 2025-05-07T19:58:33.2142535Z 2025-05-07T19:58:33.2143917Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:33.2145946Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:33.2146787Z ^ 2025-05-07T19:58:33.2147014Z 2025-05-07T19:58:33.2147453Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:33.2148244Z 2025-05-07T19:58:33.2149651Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:33.2152035Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:33.2153009Z ^ 2025-05-07T19:58:33.2153297Z 2025-05-07T19:58:33.2154979Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:33.2157483Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:33.2158455Z ^ 2025-05-07T19:58:33.2158724Z 2025-05-07T19:58:33.2159140Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:33.2159726Z 2025-05-07T19:58:33.2161255Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:33.2163801Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:33.2164813Z ^ 2025-05-07T19:58:33.2165130Z 2025-05-07T19:58:34.6605436Z [292/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o 2025-05-07T19:58:34.6627528Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:34.6630319Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:34.6631457Z ^ 2025-05-07T19:58:34.6631706Z 2025-05-07T19:58:34.6632128Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:34.6632779Z 2025-05-07T19:58:34.6634445Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:34.6637015Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:34.6638165Z ^ 2025-05-07T19:58:34.6638490Z 2025-05-07T19:58:34.6639921Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:34.6642360Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:34.6643499Z ^ 2025-05-07T19:58:34.6643752Z 2025-05-07T19:58:34.6644220Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:34.6645145Z 2025-05-07T19:58:34.6646747Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:34.6649388Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:34.6650545Z ^ 2025-05-07T19:58:34.6650904Z 2025-05-07T19:58:34.6652494Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:34.6654989Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:34.6656096Z ^ 2025-05-07T19:58:34.6656344Z 2025-05-07T19:58:34.6656702Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:34.6657313Z 2025-05-07T19:58:34.6658927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:34.6661457Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:34.6662627Z ^ 2025-05-07T19:58:34.6662981Z 2025-05-07T19:58:34.6664577Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:34.6667142Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:34.6668447Z ^ 2025-05-07T19:58:34.6668698Z 2025-05-07T19:58:34.6669154Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:34.6669768Z 2025-05-07T19:58:34.6671301Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:34.6673707Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:34.6674963Z ^ 2025-05-07T19:58:34.6675337Z 2025-05-07T19:58:34.6676944Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:34.6679545Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:34.6680674Z ^ 2025-05-07T19:58:34.6680950Z 2025-05-07T19:58:34.6681391Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:34.6682041Z 2025-05-07T19:58:34.6683692Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:34.6686265Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:34.6687412Z ^ 2025-05-07T19:58:34.6687770Z 2025-05-07T19:58:35.0245366Z [293/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:58:35.0265830Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:35.0268112Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:35.0269095Z ^ 2025-05-07T19:58:35.0269285Z 2025-05-07T19:58:35.0269629Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:35.0270081Z 2025-05-07T19:58:35.0271186Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:35.0273296Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:35.0274292Z ^ 2025-05-07T19:58:35.0274547Z 2025-05-07T19:58:35.0275636Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:35.0277722Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:35.0278579Z ^ 2025-05-07T19:58:35.0278765Z 2025-05-07T19:58:35.0279084Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:35.0279652Z 2025-05-07T19:58:35.0280754Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:35.0282567Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:35.0283381Z ^ 2025-05-07T19:58:35.0283632Z 2025-05-07T19:58:35.0284770Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:35.0286552Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:35.0287376Z ^ 2025-05-07T19:58:35.0287561Z 2025-05-07T19:58:35.0287895Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:35.0288344Z 2025-05-07T19:58:35.0289524Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:35.0291823Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:35.0293059Z ^ 2025-05-07T19:58:35.0293608Z 2025-05-07T19:58:35.0295355Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:35.0298057Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:35.0299277Z ^ 2025-05-07T19:58:35.0299582Z 2025-05-07T19:58:35.0300030Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:35.0300654Z 2025-05-07T19:58:35.0302325Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:35.0305025Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:35.0323371Z ^ 2025-05-07T19:58:35.0323931Z 2025-05-07T19:58:35.0325582Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:35.0328617Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:35.0329873Z ^ 2025-05-07T19:58:35.0330144Z 2025-05-07T19:58:35.0330575Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:35.0331220Z 2025-05-07T19:58:35.0333195Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:35.0335912Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:35.0337093Z ^ 2025-05-07T19:58:35.0337627Z 2025-05-07T19:58:36.3225355Z [294/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:58:36.3249280Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.3252072Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.3253248Z ^ 2025-05-07T19:58:36.3253509Z 2025-05-07T19:58:36.3253958Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:36.3254603Z 2025-05-07T19:58:36.3256294Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.3258953Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.3260153Z ^ 2025-05-07T19:58:36.3260851Z 2025-05-07T19:58:36.3262477Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.3266475Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.3267644Z ^ 2025-05-07T19:58:36.3267897Z 2025-05-07T19:58:36.3268362Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:36.3269003Z 2025-05-07T19:58:36.3270651Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.3273287Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.3274543Z ^ 2025-05-07T19:58:36.3274937Z 2025-05-07T19:58:36.3276560Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.3279209Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.3280365Z ^ 2025-05-07T19:58:36.3280641Z 2025-05-07T19:58:36.3281076Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:36.3281720Z 2025-05-07T19:58:36.3283411Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.3286175Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.3287372Z ^ 2025-05-07T19:58:36.3287707Z 2025-05-07T19:58:36.3289362Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.3291936Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.3293077Z ^ 2025-05-07T19:58:36.3293325Z 2025-05-07T19:58:36.3293768Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:36.3294432Z 2025-05-07T19:58:36.3296076Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.3298695Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.3299856Z ^ 2025-05-07T19:58:36.3300244Z 2025-05-07T19:58:36.3301848Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.3304462Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.3305617Z ^ 2025-05-07T19:58:36.3305895Z 2025-05-07T19:58:36.3306527Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:36.3307162Z 2025-05-07T19:58:36.3308797Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.3311487Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.3312673Z ^ 2025-05-07T19:58:36.3313036Z 2025-05-07T19:58:36.8485353Z [295/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o 2025-05-07T19:58:36.8509180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.8511941Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.8513115Z ^ 2025-05-07T19:58:36.8513369Z 2025-05-07T19:58:36.8513880Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:36.8514517Z 2025-05-07T19:58:36.8516579Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.8519249Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.8520671Z ^ 2025-05-07T19:58:36.8521231Z 2025-05-07T19:58:36.8522909Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.8525647Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.8526891Z ^ 2025-05-07T19:58:36.8527157Z 2025-05-07T19:58:36.8527651Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:36.8528786Z 2025-05-07T19:58:36.8530655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.8533391Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.8534619Z ^ 2025-05-07T19:58:36.8534978Z 2025-05-07T19:58:36.8536487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.8539157Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.8540318Z ^ 2025-05-07T19:58:36.8540603Z 2025-05-07T19:58:36.8541070Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:36.8541932Z 2025-05-07T19:58:36.8543916Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.8546550Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.8547764Z ^ 2025-05-07T19:58:36.8548140Z 2025-05-07T19:58:36.8549954Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.8552594Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.8554091Z ^ 2025-05-07T19:58:36.8554345Z 2025-05-07T19:58:36.8554784Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:36.8555448Z 2025-05-07T19:58:36.8557124Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.8559859Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.8561072Z ^ 2025-05-07T19:58:36.8561467Z 2025-05-07T19:58:36.8563392Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.8566145Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.8567127Z ^ 2025-05-07T19:58:36.8567398Z 2025-05-07T19:58:36.8567827Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:36.8568581Z 2025-05-07T19:58:36.8570218Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.8573208Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.8574467Z ^ 2025-05-07T19:58:36.8574853Z 2025-05-07T19:58:43.6136506Z [296/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o 2025-05-07T19:58:43.6159763Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.6162486Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:43.6163637Z ^ 2025-05-07T19:58:43.6163916Z 2025-05-07T19:58:43.6164752Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:43.6165480Z 2025-05-07T19:58:43.6167100Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.6169848Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:43.6171139Z ^ 2025-05-07T19:58:43.6171560Z 2025-05-07T19:58:43.6173225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.6175986Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:43.6177100Z ^ 2025-05-07T19:58:43.6177395Z 2025-05-07T19:58:43.6177853Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:43.6178472Z 2025-05-07T19:58:43.6180043Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.6182533Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:43.6183669Z ^ 2025-05-07T19:58:43.6184029Z 2025-05-07T19:58:43.6185745Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.6188546Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:43.6189822Z ^ 2025-05-07T19:58:43.6190094Z 2025-05-07T19:58:43.6190555Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:43.6191256Z 2025-05-07T19:58:43.6192908Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.6195754Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:43.6196944Z ^ 2025-05-07T19:58:43.6197333Z 2025-05-07T19:58:43.6198995Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.6201678Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:43.6202870Z ^ 2025-05-07T19:58:43.6203159Z 2025-05-07T19:58:43.6203598Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:43.6204265Z 2025-05-07T19:58:43.6205901Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.6208561Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:43.6209982Z ^ 2025-05-07T19:58:43.6210400Z 2025-05-07T19:58:43.6211925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.6214489Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:43.6215575Z ^ 2025-05-07T19:58:43.6215858Z 2025-05-07T19:58:43.6216323Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:43.6216945Z 2025-05-07T19:58:43.6218544Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.6221344Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:43.6222521Z ^ 2025-05-07T19:58:43.6222891Z 2025-05-07T19:58:45.5844704Z [297/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:58:45.5869667Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.5872219Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:45.5873361Z ^ 2025-05-07T19:58:45.5873916Z 2025-05-07T19:58:45.5874370Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:45.5874913Z 2025-05-07T19:58:45.5876513Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.5879257Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:45.5880798Z ^ 2025-05-07T19:58:45.5881186Z 2025-05-07T19:58:45.5882926Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.5885656Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:45.5886853Z ^ 2025-05-07T19:58:45.5887124Z 2025-05-07T19:58:45.5887587Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:45.5888294Z 2025-05-07T19:58:45.5890004Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.5892549Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:45.5893839Z ^ 2025-05-07T19:58:45.5894216Z 2025-05-07T19:58:45.5895980Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.5898563Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:45.5899709Z ^ 2025-05-07T19:58:45.5899963Z 2025-05-07T19:58:45.5900417Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:45.5900967Z 2025-05-07T19:58:45.5902672Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.5905230Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:45.5906375Z ^ 2025-05-07T19:58:45.5906703Z 2025-05-07T19:58:45.5908297Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.5910915Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:45.5912207Z ^ 2025-05-07T19:58:45.5912477Z 2025-05-07T19:58:45.5912936Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:45.5913791Z 2025-05-07T19:58:45.5915857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.5918312Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:45.5919353Z ^ 2025-05-07T19:58:45.5919688Z 2025-05-07T19:58:45.5921589Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.5923791Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:45.5924851Z ^ 2025-05-07T19:58:45.5925090Z 2025-05-07T19:58:45.5925495Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:45.5926103Z 2025-05-07T19:58:45.5927667Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.5930320Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:45.5931508Z ^ 2025-05-07T19:58:45.5931843Z 2025-05-07T19:58:47.2524950Z [298/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o 2025-05-07T19:58:47.2549526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.2552581Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:47.2553987Z ^ 2025-05-07T19:58:47.2554246Z 2025-05-07T19:58:47.2554706Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:47.2555402Z 2025-05-07T19:58:47.2557132Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.2560004Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:47.2561269Z ^ 2025-05-07T19:58:47.2561689Z 2025-05-07T19:58:47.2563398Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.2566448Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:47.2567902Z ^ 2025-05-07T19:58:47.2568208Z 2025-05-07T19:58:47.2568607Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:47.2569250Z 2025-05-07T19:58:47.2571032Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.2573995Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:47.2575257Z ^ 2025-05-07T19:58:47.2575638Z 2025-05-07T19:58:47.2577347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.2580216Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:47.2581302Z ^ 2025-05-07T19:58:47.2581569Z 2025-05-07T19:58:47.2582022Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:47.2582807Z 2025-05-07T19:58:47.2584634Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.2587291Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:47.2588642Z ^ 2025-05-07T19:58:47.2589035Z 2025-05-07T19:58:47.2590637Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.2593457Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:47.2595177Z ^ 2025-05-07T19:58:47.2595490Z 2025-05-07T19:58:47.2595932Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:47.2596640Z 2025-05-07T19:58:47.2598408Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.2601269Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:47.2602556Z ^ 2025-05-07T19:58:47.2602870Z 2025-05-07T19:58:47.2604371Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.2607089Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:47.2608292Z ^ 2025-05-07T19:58:47.2608561Z 2025-05-07T19:58:47.2608998Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:47.2609700Z 2025-05-07T19:58:47.2611348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.2614209Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:47.2615254Z ^ 2025-05-07T19:58:47.2615645Z 2025-05-07T19:58:50.2793594Z [299/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o 2025-05-07T19:58:50.2817092Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:50.2819423Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:50.2820489Z ^ 2025-05-07T19:58:50.2820752Z 2025-05-07T19:58:50.2821254Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:50.2821885Z 2025-05-07T19:58:50.2823398Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:50.2825945Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:50.2827058Z ^ 2025-05-07T19:58:50.2827399Z 2025-05-07T19:58:50.2829209Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:50.2831093Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:50.2831978Z ^ 2025-05-07T19:58:50.2832299Z 2025-05-07T19:58:50.2834022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:50.2836006Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:50.2836569Z ^ 2025-05-07T19:58:50.2836894Z 2025-05-07T19:58:50.2838445Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:50.2840474Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:50.2841059Z ^ 2025-05-07T19:58:50.2841368Z 2025-05-07T19:58:50.2843034Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:50.2845451Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:50.2846471Z ^ 2025-05-07T19:58:50.2846707Z 2025-05-07T19:58:50.2847167Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:50.2847811Z 2025-05-07T19:58:50.2849352Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:50.2852023Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:50.2853577Z ^ 2025-05-07T19:58:50.2853980Z 2025-05-07T19:58:50.2855535Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:50.2857630Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:50.2858183Z ^ 2025-05-07T19:58:50.2858504Z 2025-05-07T19:58:50.2859952Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:50.2861894Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:50.2862452Z ^ 2025-05-07T19:58:50.2862761Z 2025-05-07T19:58:50.2864335Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:50.2866307Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:50.2866887Z ^ 2025-05-07T19:58:50.2867186Z 2025-05-07T19:58:50.2868768Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:50.2871367Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:50.2872550Z ^ 2025-05-07T19:58:50.2872813Z 2025-05-07T19:58:50.2873290Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:50.2874108Z 2025-05-07T19:58:50.2875754Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:50.2878456Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:50.2879500Z ^ 2025-05-07T19:58:50.2879862Z 2025-05-07T19:58:50.2881391Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:50.2883207Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:50.2883726Z ^ 2025-05-07T19:58:50.2884046Z 2025-05-07T19:58:50.2885677Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:50.2887724Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:50.2888289Z ^ 2025-05-07T19:58:50.2888583Z 2025-05-07T19:58:50.2890123Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:50.2892107Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:50.2892687Z ^ 2025-05-07T19:58:50.2892982Z 2025-05-07T19:58:50.2894638Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:50.2897471Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:50.2898585Z ^ 2025-05-07T19:58:50.2898834Z 2025-05-07T19:58:50.2899240Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:50.2899916Z 2025-05-07T19:58:50.2901504Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:50.2903829Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:50.2904839Z ^ 2025-05-07T19:58:50.2905211Z 2025-05-07T19:58:50.2906672Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:50.2908550Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:50.2909072Z ^ 2025-05-07T19:58:50.2909379Z 2025-05-07T19:58:50.2910819Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:50.2912659Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:50.2913189Z ^ 2025-05-07T19:58:50.2913470Z 2025-05-07T19:58:50.2915104Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:50.2916996Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:50.2917717Z ^ 2025-05-07T19:58:50.2918019Z 2025-05-07T19:58:50.2919546Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:50.2921893Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:50.2922955Z ^ 2025-05-07T19:58:50.2923209Z 2025-05-07T19:58:50.2923655Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:50.2924367Z 2025-05-07T19:58:50.2925935Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:50.2928774Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:50.2929924Z ^ 2025-05-07T19:58:50.2930318Z 2025-05-07T19:58:50.2931872Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:50.2933879Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:50.2934428Z ^ 2025-05-07T19:58:50.2934730Z 2025-05-07T19:58:50.2936304Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:50.2938221Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:50.2939152Z ^ 2025-05-07T19:58:50.2939461Z 2025-05-07T19:58:50.2941023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:50.2943096Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:50.2943655Z ^ 2025-05-07T19:58:50.2943950Z 2025-05-07T19:58:51.9754980Z [300/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o 2025-05-07T19:58:51.9778949Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:51.9781747Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:51.9782932Z ^ 2025-05-07T19:58:51.9783193Z 2025-05-07T19:58:51.9783666Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:51.9784333Z 2025-05-07T19:58:51.9786017Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:51.9789081Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:51.9790249Z ^ 2025-05-07T19:58:51.9790590Z 2025-05-07T19:58:51.9792307Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:51.9795358Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:51.9796550Z ^ 2025-05-07T19:58:51.9796783Z 2025-05-07T19:58:51.9797229Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:51.9797940Z 2025-05-07T19:58:51.9799665Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:51.9802359Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:51.9803539Z ^ 2025-05-07T19:58:51.9803913Z 2025-05-07T19:58:51.9805537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:51.9808310Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:51.9809522Z ^ 2025-05-07T19:58:51.9809782Z 2025-05-07T19:58:51.9810267Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:51.9811092Z 2025-05-07T19:58:51.9812807Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:51.9815611Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:51.9816856Z ^ 2025-05-07T19:58:51.9817209Z 2025-05-07T19:58:51.9818895Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:51.9821461Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:51.9822620Z ^ 2025-05-07T19:58:51.9822882Z 2025-05-07T19:58:51.9823316Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:51.9823959Z 2025-05-07T19:58:51.9825690Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:51.9828790Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:51.9830022Z ^ 2025-05-07T19:58:51.9830403Z 2025-05-07T19:58:51.9832131Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:51.9835097Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:51.9836351Z ^ 2025-05-07T19:58:51.9836608Z 2025-05-07T19:58:51.9837090Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:51.9837896Z 2025-05-07T19:58:51.9839623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:51.9842421Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:51.9843659Z ^ 2025-05-07T19:58:51.9844038Z 2025-05-07T19:58:55.2440993Z [301/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o 2025-05-07T19:58:55.2461734Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.2464170Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.2465285Z ^ 2025-05-07T19:58:55.2465527Z 2025-05-07T19:58:55.2465949Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:55.2466955Z 2025-05-07T19:58:55.2468520Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.2470966Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.2471994Z ^ 2025-05-07T19:58:55.2472308Z 2025-05-07T19:58:55.2473885Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.2476124Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.2477143Z ^ 2025-05-07T19:58:55.2477411Z 2025-05-07T19:58:55.2477794Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:55.2478352Z 2025-05-07T19:58:55.2479752Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.2482080Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.2482960Z ^ 2025-05-07T19:58:55.2483294Z 2025-05-07T19:58:55.2484902Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.2487280Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.2488576Z ^ 2025-05-07T19:58:55.2488835Z 2025-05-07T19:58:55.2489284Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:55.2489880Z 2025-05-07T19:58:55.2491325Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.2493641Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.2494804Z ^ 2025-05-07T19:58:55.2495158Z 2025-05-07T19:58:55.2496741Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.2499105Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.2500241Z ^ 2025-05-07T19:58:55.2500514Z 2025-05-07T19:58:55.2500982Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:55.2501590Z 2025-05-07T19:58:55.2503033Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.2505374Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.2506466Z ^ 2025-05-07T19:58:55.2506862Z 2025-05-07T19:58:55.2508683Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.2511156Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.2512329Z ^ 2025-05-07T19:58:55.2512605Z 2025-05-07T19:58:55.2513030Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:55.2513836Z 2025-05-07T19:58:55.2515405Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:55.2517700Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:55.2518836Z ^ 2025-05-07T19:58:55.2519172Z 2025-05-07T19:58:56.6465664Z [302/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lamb_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o 2025-05-07T19:58:56.6486900Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.6489747Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.6490912Z ^ 2025-05-07T19:58:56.6491141Z 2025-05-07T19:58:56.6491528Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:56.6492097Z 2025-05-07T19:58:56.6493853Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.6496402Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.6497511Z ^ 2025-05-07T19:58:56.6497836Z 2025-05-07T19:58:56.6499050Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.6501084Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.6502021Z ^ 2025-05-07T19:58:56.6502287Z 2025-05-07T19:58:56.6502715Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:56.6503171Z 2025-05-07T19:58:56.6504448Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.6507060Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.6508099Z ^ 2025-05-07T19:58:56.6508407Z 2025-05-07T19:58:56.6509827Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.6512436Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.6513486Z ^ 2025-05-07T19:58:56.6513904Z 2025-05-07T19:58:56.6514291Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:56.6514765Z 2025-05-07T19:58:56.6516025Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.6518415Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.6519584Z ^ 2025-05-07T19:58:56.6519922Z 2025-05-07T19:58:56.6521369Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.6523633Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.6524740Z ^ 2025-05-07T19:58:56.6524991Z 2025-05-07T19:58:56.6525424Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:56.6526075Z 2025-05-07T19:58:56.6527514Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.6530290Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.6531197Z ^ 2025-05-07T19:58:56.6531486Z 2025-05-07T19:58:56.6532940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.6535529Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.6536559Z ^ 2025-05-07T19:58:56.6536779Z 2025-05-07T19:58:56.6537104Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:56.6537675Z 2025-05-07T19:58:56.6539011Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.6541338Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.6542374Z ^ 2025-05-07T19:58:56.6542701Z 2025-05-07T19:58:56.8369333Z [303/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o 2025-05-07T19:58:56.8390095Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.8392682Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.8394171Z ^ 2025-05-07T19:58:56.8394420Z 2025-05-07T19:58:56.8394855Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:56.8395455Z 2025-05-07T19:58:56.8396903Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.8399409Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.8400456Z ^ 2025-05-07T19:58:56.8400773Z 2025-05-07T19:58:56.8402213Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.8404624Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.8405664Z ^ 2025-05-07T19:58:56.8405922Z 2025-05-07T19:58:56.8406319Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:56.8406921Z 2025-05-07T19:58:56.8408338Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.8410754Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.8411637Z ^ 2025-05-07T19:58:56.8411948Z 2025-05-07T19:58:56.8413378Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.8415807Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.8416833Z ^ 2025-05-07T19:58:56.8417078Z 2025-05-07T19:58:56.8417548Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:56.8418198Z 2025-05-07T19:58:56.8419819Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.8422141Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.8423190Z ^ 2025-05-07T19:58:56.8423550Z 2025-05-07T19:58:56.8425004Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.8427450Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.8428776Z ^ 2025-05-07T19:58:56.8429032Z 2025-05-07T19:58:56.8429456Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:56.8430403Z 2025-05-07T19:58:56.8431966Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.8434533Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.8435609Z ^ 2025-05-07T19:58:56.8435958Z 2025-05-07T19:58:56.8437367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.8439784Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.8440874Z ^ 2025-05-07T19:58:56.8441118Z 2025-05-07T19:58:56.8441532Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:56.8442140Z 2025-05-07T19:58:56.8443621Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.8445970Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.8446990Z ^ 2025-05-07T19:58:56.8447325Z 2025-05-07T19:59:03.6847340Z [304/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:59:03.6859483Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.6860912Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.6861541Z ^ 2025-05-07T19:59:03.6861717Z 2025-05-07T19:59:03.6861970Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:03.6862333Z 2025-05-07T19:59:03.6863229Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.6864637Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.6865302Z ^ 2025-05-07T19:59:03.6865511Z 2025-05-07T19:59:03.6866433Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.6867814Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.6868465Z ^ 2025-05-07T19:59:03.6868614Z 2025-05-07T19:59:03.6868897Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:03.6869311Z 2025-05-07T19:59:03.6870173Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.6871607Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.6872268Z ^ 2025-05-07T19:59:03.6872474Z 2025-05-07T19:59:03.6873326Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.6874889Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.6875531Z ^ 2025-05-07T19:59:03.6875699Z 2025-05-07T19:59:03.6875942Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:03.6876303Z 2025-05-07T19:59:03.6877197Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.6878573Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.6879231Z ^ 2025-05-07T19:59:03.6879440Z 2025-05-07T19:59:03.6880321Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.6881786Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.6882427Z ^ 2025-05-07T19:59:03.6882575Z 2025-05-07T19:59:03.6882824Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:03.6883234Z 2025-05-07T19:59:03.6884097Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.6885494Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.6886123Z ^ 2025-05-07T19:59:03.6886347Z 2025-05-07T19:59:03.6887199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.6888600Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.6889224Z ^ 2025-05-07T19:59:03.6889389Z 2025-05-07T19:59:03.6889635Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:03.6889990Z 2025-05-07T19:59:03.6890873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.6892255Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.6892908Z ^ 2025-05-07T19:59:03.6893247Z 2025-05-07T19:59:03.7291142Z [305/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:59:03.7312646Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.7315399Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.7316512Z ^ 2025-05-07T19:59:03.7316769Z 2025-05-07T19:59:03.7317214Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:03.7317829Z 2025-05-07T19:59:03.7319325Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.7321862Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.7323014Z ^ 2025-05-07T19:59:03.7323323Z 2025-05-07T19:59:03.7324851Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.7327632Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.7329107Z ^ 2025-05-07T19:59:03.7329393Z 2025-05-07T19:59:03.7329809Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:03.7330383Z 2025-05-07T19:59:03.7331927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.7334444Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.7335588Z ^ 2025-05-07T19:59:03.7335924Z 2025-05-07T19:59:03.7337450Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.7339854Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.7340954Z ^ 2025-05-07T19:59:03.7341208Z 2025-05-07T19:59:03.7341667Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:03.7342253Z 2025-05-07T19:59:03.7343522Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.7345877Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.7349184Z ^ 2025-05-07T19:59:03.7349690Z 2025-05-07T19:59:03.7351167Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.7353787Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.7354978Z ^ 2025-05-07T19:59:03.7355246Z 2025-05-07T19:59:03.7355650Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:03.7356257Z 2025-05-07T19:59:03.7357805Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.7360259Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.7361369Z ^ 2025-05-07T19:59:03.7361692Z 2025-05-07T19:59:03.7363119Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.7365446Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.7366602Z ^ 2025-05-07T19:59:03.7366842Z 2025-05-07T19:59:03.7367259Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:03.7367906Z 2025-05-07T19:59:03.7369480Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.7372232Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.7373336Z ^ 2025-05-07T19:59:03.7373736Z 2025-05-07T19:59:06.1747189Z [306/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:06.1767635Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.1769969Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.1771077Z ^ 2025-05-07T19:59:06.1771318Z 2025-05-07T19:59:06.1771690Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:06.1772295Z 2025-05-07T19:59:06.1773882Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.1776440Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.1777573Z ^ 2025-05-07T19:59:06.1778155Z 2025-05-07T19:59:06.1779636Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.1781987Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.1783043Z ^ 2025-05-07T19:59:06.1783275Z 2025-05-07T19:59:06.1783727Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:06.1784319Z 2025-05-07T19:59:06.1785842Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.1788385Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.1789513Z ^ 2025-05-07T19:59:06.1789866Z 2025-05-07T19:59:06.1791402Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.1793897Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.1795173Z ^ 2025-05-07T19:59:06.1795414Z 2025-05-07T19:59:06.1795810Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:06.1796445Z 2025-05-07T19:59:06.1798260Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.1800782Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.1801851Z ^ 2025-05-07T19:59:06.1802328Z 2025-05-07T19:59:06.1803849Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.1806322Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.1807368Z ^ 2025-05-07T19:59:06.1807586Z 2025-05-07T19:59:06.1807997Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:06.1808587Z 2025-05-07T19:59:06.1810006Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.1812485Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.1813577Z ^ 2025-05-07T19:59:06.1813932Z 2025-05-07T19:59:06.1815481Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.1817812Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.1818826Z ^ 2025-05-07T19:59:06.1819115Z 2025-05-07T19:59:06.1819709Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:06.1820306Z 2025-05-07T19:59:06.1821769Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.1824131Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.1825270Z ^ 2025-05-07T19:59:06.1825605Z 2025-05-07T19:59:06.3852459Z [307/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:59:06.3873929Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.3876709Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.3877794Z ^ 2025-05-07T19:59:06.3878048Z 2025-05-07T19:59:06.3878469Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:06.3879064Z 2025-05-07T19:59:06.3880587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.3883411Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.3884565Z ^ 2025-05-07T19:59:06.3884935Z 2025-05-07T19:59:06.3886466Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.3888729Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.3889803Z ^ 2025-05-07T19:59:06.3890037Z 2025-05-07T19:59:06.3890442Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:06.3891095Z 2025-05-07T19:59:06.3892655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.3895163Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.3896274Z ^ 2025-05-07T19:59:06.3896611Z 2025-05-07T19:59:06.3898108Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.3900552Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.3901693Z ^ 2025-05-07T19:59:06.3901966Z 2025-05-07T19:59:06.3902659Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:06.3903316Z 2025-05-07T19:59:06.3904773Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.3907291Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.3908399Z ^ 2025-05-07T19:59:06.3908752Z 2025-05-07T19:59:06.3910243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.3912765Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.3914100Z ^ 2025-05-07T19:59:06.3914386Z 2025-05-07T19:59:06.3914804Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:06.3915425Z 2025-05-07T19:59:06.3916981Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.3919421Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.3920477Z ^ 2025-05-07T19:59:06.3920752Z 2025-05-07T19:59:06.3922063Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.3924536Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.3925375Z ^ 2025-05-07T19:59:06.3925593Z 2025-05-07T19:59:06.3925998Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:06.3926627Z 2025-05-07T19:59:06.3928095Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.3930827Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:06.3931715Z ^ 2025-05-07T19:59:06.3932044Z 2025-05-07T19:59:09.3619150Z [308/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o 2025-05-07T19:59:09.3642601Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.3645427Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.3646632Z ^ 2025-05-07T19:59:09.3647245Z 2025-05-07T19:59:09.3647729Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:09.3648403Z 2025-05-07T19:59:09.3650078Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.3652796Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.3654007Z ^ 2025-05-07T19:59:09.3654374Z 2025-05-07T19:59:09.3656000Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.3658639Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.3659817Z ^ 2025-05-07T19:59:09.3660100Z 2025-05-07T19:59:09.3660549Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:09.3661234Z 2025-05-07T19:59:09.3662948Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.3665358Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.3666326Z ^ 2025-05-07T19:59:09.3666687Z 2025-05-07T19:59:09.3668392Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.3670720Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.3671802Z ^ 2025-05-07T19:59:09.3672184Z 2025-05-07T19:59:09.3672618Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:09.3673312Z 2025-05-07T19:59:09.3674962Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.3677301Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.3678199Z ^ 2025-05-07T19:59:09.3678545Z 2025-05-07T19:59:09.3680057Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.3682610Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.3683735Z ^ 2025-05-07T19:59:09.3683996Z 2025-05-07T19:59:09.3684433Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:09.3685072Z 2025-05-07T19:59:09.3686685Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.3689089Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.3690399Z ^ 2025-05-07T19:59:09.3690750Z 2025-05-07T19:59:09.3692176Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.3694879Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.3696083Z ^ 2025-05-07T19:59:09.3696338Z 2025-05-07T19:59:09.3696757Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:09.3697440Z 2025-05-07T19:59:09.3699051Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.3701569Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.3702627Z ^ 2025-05-07T19:59:09.3703006Z 2025-05-07T19:59:09.5002112Z [309/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:59:09.5023639Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.5026495Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.5027464Z ^ 2025-05-07T19:59:09.5027697Z 2025-05-07T19:59:09.5028066Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:09.5028950Z 2025-05-07T19:59:09.5030431Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.5032685Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.5033693Z ^ 2025-05-07T19:59:09.5034235Z 2025-05-07T19:59:09.5036069Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.5039013Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.5040262Z ^ 2025-05-07T19:59:09.5040527Z 2025-05-07T19:59:09.5040936Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:09.5041509Z 2025-05-07T19:59:09.5043056Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.5046379Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.5047620Z ^ 2025-05-07T19:59:09.5047990Z 2025-05-07T19:59:09.5049228Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.5051637Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.5052614Z ^ 2025-05-07T19:59:09.5052847Z 2025-05-07T19:59:09.5053222Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:09.5053855Z 2025-05-07T19:59:09.5055499Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.5057781Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.5058923Z ^ 2025-05-07T19:59:09.5059271Z 2025-05-07T19:59:09.5061022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.5063434Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.5064552Z ^ 2025-05-07T19:59:09.5064778Z 2025-05-07T19:59:09.5065201Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:09.5065920Z 2025-05-07T19:59:09.5067632Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.5070578Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.5071834Z ^ 2025-05-07T19:59:09.5072247Z 2025-05-07T19:59:09.5073893Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.5076697Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.5077624Z ^ 2025-05-07T19:59:09.5077879Z 2025-05-07T19:59:09.5078304Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:09.5078912Z 2025-05-07T19:59:09.5080519Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.5082981Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.5084139Z ^ 2025-05-07T19:59:09.5084494Z 2025-05-07T19:59:09.5858642Z [310/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o 2025-05-07T19:59:09.5880968Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.5883762Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.5884985Z ^ 2025-05-07T19:59:09.5885243Z 2025-05-07T19:59:09.5885724Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:09.5886403Z 2025-05-07T19:59:09.5888113Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.5890987Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.5892098Z ^ 2025-05-07T19:59:09.5892584Z 2025-05-07T19:59:09.5894057Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.5896497Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.5897641Z ^ 2025-05-07T19:59:09.5897881Z 2025-05-07T19:59:09.5898271Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:09.5898851Z 2025-05-07T19:59:09.5900785Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.5903391Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.5904682Z ^ 2025-05-07T19:59:09.5905021Z 2025-05-07T19:59:09.5906675Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.5909234Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.5910558Z ^ 2025-05-07T19:59:09.5910806Z 2025-05-07T19:59:09.5911272Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:09.5911934Z 2025-05-07T19:59:09.5913526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.5916182Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.5917280Z ^ 2025-05-07T19:59:09.5917667Z 2025-05-07T19:59:09.5919228Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.5921855Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.5923154Z ^ 2025-05-07T19:59:09.5923432Z 2025-05-07T19:59:09.5923872Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:09.5924481Z 2025-05-07T19:59:09.5925977Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.5928834Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.5929980Z ^ 2025-05-07T19:59:09.5930352Z 2025-05-07T19:59:09.5931990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.5934744Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.5935902Z ^ 2025-05-07T19:59:09.5936164Z 2025-05-07T19:59:09.5936633Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:09.5937305Z 2025-05-07T19:59:09.5938997Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.5941534Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.5942513Z ^ 2025-05-07T19:59:09.5942883Z 2025-05-07T19:59:12.4263714Z [311/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o 2025-05-07T19:59:12.4286452Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:12.4288636Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:12.4289542Z ^ 2025-05-07T19:59:12.4289749Z 2025-05-07T19:59:12.4290127Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:12.4290673Z 2025-05-07T19:59:12.4291888Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:12.4293922Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:12.4294820Z ^ 2025-05-07T19:59:12.4295118Z 2025-05-07T19:59:12.4296336Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:12.4298676Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:12.4299585Z ^ 2025-05-07T19:59:12.4299808Z 2025-05-07T19:59:12.4300159Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:12.4300669Z 2025-05-07T19:59:12.4308803Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:12.4311092Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:12.4312017Z ^ 2025-05-07T19:59:12.4312295Z 2025-05-07T19:59:12.4313540Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:12.4315714Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:12.4316584Z ^ 2025-05-07T19:59:12.4316808Z 2025-05-07T19:59:12.4317143Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:12.4317648Z 2025-05-07T19:59:12.4318962Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:12.4320968Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:12.4321882Z ^ 2025-05-07T19:59:12.4322160Z 2025-05-07T19:59:12.4323406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:12.4325716Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:12.4326734Z ^ 2025-05-07T19:59:12.4326949Z 2025-05-07T19:59:12.4327339Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:12.4327923Z 2025-05-07T19:59:12.4329513Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:12.4331671Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:12.4332691Z ^ 2025-05-07T19:59:12.4332996Z 2025-05-07T19:59:12.4334206Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:12.4336178Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:12.4337099Z ^ 2025-05-07T19:59:12.4337298Z 2025-05-07T19:59:12.4337675Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:12.4338182Z 2025-05-07T19:59:12.4339421Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:12.4341650Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:12.4342606Z ^ 2025-05-07T19:59:12.4342902Z 2025-05-07T19:59:15.5104828Z [312/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:15.5128860Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.5131818Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:15.5133021Z ^ 2025-05-07T19:59:15.5133311Z 2025-05-07T19:59:15.5133781Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:15.5134467Z 2025-05-07T19:59:15.5136212Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.5138626Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:15.5139786Z ^ 2025-05-07T19:59:15.5140156Z 2025-05-07T19:59:15.5142103Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.5144873Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:15.5146195Z ^ 2025-05-07T19:59:15.5146453Z 2025-05-07T19:59:15.5147034Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:15.5147738Z 2025-05-07T19:59:15.5149534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.5152324Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:15.5153582Z ^ 2025-05-07T19:59:15.5153924Z 2025-05-07T19:59:15.5155732Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.5158321Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:15.5159423Z ^ 2025-05-07T19:59:15.5159691Z 2025-05-07T19:59:15.5160127Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:15.5160814Z 2025-05-07T19:59:15.5162677Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.5165253Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:15.5166646Z ^ 2025-05-07T19:59:15.5167010Z 2025-05-07T19:59:15.5168726Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.5171438Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:15.5172617Z ^ 2025-05-07T19:59:15.5172872Z 2025-05-07T19:59:15.5173324Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:15.5174009Z 2025-05-07T19:59:15.5175703Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.5178308Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:15.5179331Z ^ 2025-05-07T19:59:15.5179701Z 2025-05-07T19:59:15.5181361Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.5184083Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:15.5211108Z ^ 2025-05-07T19:59:15.5211646Z 2025-05-07T19:59:15.5212137Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:15.5212798Z 2025-05-07T19:59:15.5216330Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.5219004Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:15.5220184Z ^ 2025-05-07T19:59:15.5220865Z 2025-05-07T19:59:17.8945037Z [313/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lamb_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o 2025-05-07T19:59:17.8968593Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:17.8971281Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:17.8972508Z ^ 2025-05-07T19:59:17.8972780Z 2025-05-07T19:59:17.8973245Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:17.8973934Z 2025-05-07T19:59:17.8975621Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:17.8978541Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:17.8979767Z ^ 2025-05-07T19:59:17.8980147Z 2025-05-07T19:59:17.8981846Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:17.8984754Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:17.8985981Z ^ 2025-05-07T19:59:17.8986255Z 2025-05-07T19:59:17.8986720Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:17.8987407Z 2025-05-07T19:59:17.8989100Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:17.8991794Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:17.8992983Z ^ 2025-05-07T19:59:17.8993380Z 2025-05-07T19:59:17.8995169Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:17.8997934Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:17.8999121Z ^ 2025-05-07T19:59:17.8999396Z 2025-05-07T19:59:17.8999813Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:17.9000481Z 2025-05-07T19:59:17.9002284Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:17.9004996Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:17.9006243Z ^ 2025-05-07T19:59:17.9006619Z 2025-05-07T19:59:17.9008288Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:17.9010924Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:17.9012127Z ^ 2025-05-07T19:59:17.9012380Z 2025-05-07T19:59:17.9012859Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:17.9013539Z 2025-05-07T19:59:17.9015245Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:17.9018028Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:17.9019180Z ^ 2025-05-07T19:59:17.9019584Z 2025-05-07T19:59:17.9021287Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:17.9024166Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:17.9025344Z ^ 2025-05-07T19:59:17.9025623Z 2025-05-07T19:59:17.9026075Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:17.9026726Z 2025-05-07T19:59:17.9028842Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:17.9031500Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:17.9032712Z ^ 2025-05-07T19:59:17.9033079Z 2025-05-07T19:59:18.7707612Z [314/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o 2025-05-07T19:59:18.7731692Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.7734449Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.7735673Z ^ 2025-05-07T19:59:18.7735938Z 2025-05-07T19:59:18.7736399Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:18.7737077Z 2025-05-07T19:59:18.7739072Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.7741780Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.7743139Z ^ 2025-05-07T19:59:18.7743503Z 2025-05-07T19:59:18.7745173Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.7747861Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.7749078Z ^ 2025-05-07T19:59:18.7749338Z 2025-05-07T19:59:18.7749832Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:18.7750506Z 2025-05-07T19:59:18.7752208Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.7755105Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.7756345Z ^ 2025-05-07T19:59:18.7756727Z 2025-05-07T19:59:18.7758383Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.7761105Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.7762469Z ^ 2025-05-07T19:59:18.7762763Z 2025-05-07T19:59:18.7763207Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:18.7763894Z 2025-05-07T19:59:18.7765552Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.7767842Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.7769003Z ^ 2025-05-07T19:59:18.7769346Z 2025-05-07T19:59:18.7770881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.7773512Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.7774702Z ^ 2025-05-07T19:59:18.7774974Z 2025-05-07T19:59:18.7775420Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:18.7776123Z 2025-05-07T19:59:18.7777790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.7780485Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.7781667Z ^ 2025-05-07T19:59:18.7782063Z 2025-05-07T19:59:18.7783853Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.7786470Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.7787659Z ^ 2025-05-07T19:59:18.7787943Z 2025-05-07T19:59:18.7788470Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:18.7789142Z 2025-05-07T19:59:18.7790839Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.7793443Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.7794776Z ^ 2025-05-07T19:59:18.7795098Z 2025-05-07T19:59:25.8073326Z [315/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:59:25.8097573Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:25.8100714Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:25.8101953Z ^ 2025-05-07T19:59:25.8102251Z 2025-05-07T19:59:25.8102723Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:25.8103411Z 2025-05-07T19:59:25.8105379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:25.8108097Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:25.8109484Z ^ 2025-05-07T19:59:25.8109857Z 2025-05-07T19:59:25.8111538Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:25.8114537Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:25.8115718Z ^ 2025-05-07T19:59:25.8115987Z 2025-05-07T19:59:25.8116443Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:25.8117143Z 2025-05-07T19:59:25.8118816Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:25.8121518Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:25.8122722Z ^ 2025-05-07T19:59:25.8123114Z 2025-05-07T19:59:25.8124743Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:25.8127565Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:25.8129016Z ^ 2025-05-07T19:59:25.8129321Z 2025-05-07T19:59:25.8129793Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:25.8130457Z 2025-05-07T19:59:25.8132096Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:25.8134781Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:25.8136021Z ^ 2025-05-07T19:59:25.8136393Z 2025-05-07T19:59:25.8138072Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:25.8140668Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:25.8141866Z ^ 2025-05-07T19:59:25.8142121Z 2025-05-07T19:59:25.8142591Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:25.8143263Z 2025-05-07T19:59:25.8144921Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:25.8147835Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:25.8149025Z ^ 2025-05-07T19:59:25.8149393Z 2025-05-07T19:59:25.8151023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:25.8153901Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:25.8155224Z ^ 2025-05-07T19:59:25.8155503Z 2025-05-07T19:59:25.8155948Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:25.8156615Z 2025-05-07T19:59:25.8158303Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:25.8160966Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:25.8162181Z ^ 2025-05-07T19:59:25.8162517Z 2025-05-07T19:59:26.3480530Z [316/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:59:26.3502444Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:26.3504642Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:26.3505814Z ^ 2025-05-07T19:59:26.3506042Z 2025-05-07T19:59:26.3506618Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:26.3507218Z 2025-05-07T19:59:26.3508771Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:26.3511257Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:26.3512440Z ^ 2025-05-07T19:59:26.3512786Z 2025-05-07T19:59:26.3514449Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:26.3516900Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:26.3518016Z ^ 2025-05-07T19:59:26.3518267Z 2025-05-07T19:59:26.3518723Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:26.3519362Z 2025-05-07T19:59:26.3520896Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:26.3523544Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:26.3524635Z ^ 2025-05-07T19:59:26.3524983Z 2025-05-07T19:59:26.3526482Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:26.3529121Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:26.3530213Z ^ 2025-05-07T19:59:26.3530482Z 2025-05-07T19:59:26.3530913Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:26.3531477Z 2025-05-07T19:59:26.3532930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:26.3535227Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:26.3536476Z ^ 2025-05-07T19:59:26.3536870Z 2025-05-07T19:59:26.3538616Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:26.3541202Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:26.3542160Z ^ 2025-05-07T19:59:26.3542380Z 2025-05-07T19:59:26.3545346Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:26.3546135Z 2025-05-07T19:59:26.3547498Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:26.3550307Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:26.3551736Z ^ 2025-05-07T19:59:26.3552160Z 2025-05-07T19:59:26.3553841Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:26.3556436Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:26.3557679Z ^ 2025-05-07T19:59:26.3557990Z 2025-05-07T19:59:26.3558466Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:26.3559164Z 2025-05-07T19:59:26.3560646Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:26.3563419Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:26.3564576Z ^ 2025-05-07T19:59:26.3565013Z 2025-05-07T19:59:32.0116542Z [317/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:59:32.0139804Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:32.0142415Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:32.0143409Z ^ 2025-05-07T19:59:32.0143750Z 2025-05-07T19:59:32.0144163Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:32.0144758Z 2025-05-07T19:59:32.0146297Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:32.0148682Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:32.0149817Z ^ 2025-05-07T19:59:32.0150186Z 2025-05-07T19:59:32.0151723Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:32.0154334Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:32.0155448Z ^ 2025-05-07T19:59:32.0155699Z 2025-05-07T19:59:32.0156494Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:32.0157104Z 2025-05-07T19:59:32.0158598Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:32.0161135Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:32.0162176Z ^ 2025-05-07T19:59:32.0162565Z 2025-05-07T19:59:32.0164037Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:32.0166508Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:32.0167586Z ^ 2025-05-07T19:59:32.0167850Z 2025-05-07T19:59:32.0168268Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:32.0168893Z 2025-05-07T19:59:32.0170498Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:32.0173072Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:32.0174163Z ^ 2025-05-07T19:59:32.0174511Z 2025-05-07T19:59:32.0176330Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:32.0178864Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:32.0179950Z ^ 2025-05-07T19:59:32.0180178Z 2025-05-07T19:59:32.0180741Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:32.0181375Z 2025-05-07T19:59:32.0183014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:32.0185494Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:32.0186550Z ^ 2025-05-07T19:59:32.0186900Z 2025-05-07T19:59:32.0188443Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:32.0190957Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:32.0192103Z ^ 2025-05-07T19:59:32.0192355Z 2025-05-07T19:59:32.0192768Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:32.0193377Z 2025-05-07T19:59:32.0195123Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:32.0197592Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:32.0198917Z ^ 2025-05-07T19:59:32.0199269Z 2025-05-07T19:59:32.6375624Z [318/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o 2025-05-07T19:59:32.6398412Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:32.6401106Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:32.6402272Z ^ 2025-05-07T19:59:32.6402538Z 2025-05-07T19:59:32.6402983Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:32.6403643Z 2025-05-07T19:59:32.6405213Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:32.6407762Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:32.6408884Z ^ 2025-05-07T19:59:32.6409206Z 2025-05-07T19:59:32.6410515Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:32.6412868Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:32.6414026Z ^ 2025-05-07T19:59:32.6414261Z 2025-05-07T19:59:32.6414641Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:32.6415198Z 2025-05-07T19:59:32.6416515Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:32.6418600Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:32.6419575Z ^ 2025-05-07T19:59:32.6419867Z 2025-05-07T19:59:32.6421158Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:32.6423521Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:32.6424433Z ^ 2025-05-07T19:59:32.6424674Z 2025-05-07T19:59:32.6425043Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:32.6425556Z 2025-05-07T19:59:32.6426923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:32.6429728Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:32.6430919Z ^ 2025-05-07T19:59:32.6431256Z 2025-05-07T19:59:32.6432776Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:32.6435724Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:32.6436880Z ^ 2025-05-07T19:59:32.6437135Z 2025-05-07T19:59:32.6437548Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:32.6438186Z 2025-05-07T19:59:32.6439742Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:32.6442346Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:32.6443459Z ^ 2025-05-07T19:59:32.6443833Z 2025-05-07T19:59:32.6445379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:32.6447956Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:32.6449092Z ^ 2025-05-07T19:59:32.6449390Z 2025-05-07T19:59:32.6449815Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:32.6450459Z 2025-05-07T19:59:32.6452342Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:32.6455161Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:32.6456388Z ^ 2025-05-07T19:59:32.6456758Z 2025-05-07T19:59:36.6678163Z [319/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:36.6702438Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:36.6705088Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:36.6706253Z ^ 2025-05-07T19:59:36.6706498Z 2025-05-07T19:59:36.6706936Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:36.6707610Z 2025-05-07T19:59:36.6709255Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:36.6711914Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:36.6713237Z ^ 2025-05-07T19:59:36.6713604Z 2025-05-07T19:59:36.6715349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:36.6717989Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:36.6719133Z ^ 2025-05-07T19:59:36.6719390Z 2025-05-07T19:59:36.6719823Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:36.6720435Z 2025-05-07T19:59:36.6722482Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:36.6725193Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:36.6726404Z ^ 2025-05-07T19:59:36.6726770Z 2025-05-07T19:59:36.6728694Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:36.6731377Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:36.6732567Z ^ 2025-05-07T19:59:36.6732825Z 2025-05-07T19:59:36.6733281Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:36.6733973Z 2025-05-07T19:59:36.6735964Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:36.6738612Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:36.6739861Z ^ 2025-05-07T19:59:36.6740227Z 2025-05-07T19:59:36.6741961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:36.6744584Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:36.6745726Z ^ 2025-05-07T19:59:36.6745981Z 2025-05-07T19:59:36.6746416Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:36.6747063Z 2025-05-07T19:59:36.6748712Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:36.6751345Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:36.6752522Z ^ 2025-05-07T19:59:36.6752877Z 2025-05-07T19:59:36.6754606Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:36.6757212Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:36.6758519Z ^ 2025-05-07T19:59:36.6758770Z 2025-05-07T19:59:36.6759214Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:36.6759891Z 2025-05-07T19:59:36.6761554Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:36.6764252Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:36.6765422Z ^ 2025-05-07T19:59:36.6765809Z 2025-05-07T19:59:46.0820374Z [320/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o 2025-05-07T19:59:46.0844816Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:46.0847507Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:46.0848726Z ^ 2025-05-07T19:59:46.0848994Z 2025-05-07T19:59:46.0849458Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:46.0850321Z 2025-05-07T19:59:46.0852001Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:46.0854995Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:46.0856235Z ^ 2025-05-07T19:59:46.0856631Z 2025-05-07T19:59:46.0858330Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:46.0861034Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:46.0862184Z ^ 2025-05-07T19:59:46.0862435Z 2025-05-07T19:59:46.0862909Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:46.0863752Z 2025-05-07T19:59:46.0865450Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:46.0868198Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:46.0869447Z ^ 2025-05-07T19:59:46.0869820Z 2025-05-07T19:59:46.0871510Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:46.0874295Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:46.0875679Z ^ 2025-05-07T19:59:46.0875939Z 2025-05-07T19:59:46.0876390Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:46.0877073Z 2025-05-07T19:59:46.0878822Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:46.0881797Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:46.0882997Z ^ 2025-05-07T19:59:46.0883365Z 2025-05-07T19:59:46.0884902Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:46.0887587Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:46.0888800Z ^ 2025-05-07T19:59:46.0889062Z 2025-05-07T19:59:46.0889542Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:46.0890227Z 2025-05-07T19:59:46.0891918Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:46.0894619Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:46.0895853Z ^ 2025-05-07T19:59:46.0896226Z 2025-05-07T19:59:46.0897915Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:46.0900765Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:46.0901986Z ^ 2025-05-07T19:59:46.0902274Z 2025-05-07T19:59:46.0902735Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:46.0903428Z 2025-05-07T19:59:46.0905349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:46.0908139Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:46.0909409Z ^ 2025-05-07T19:59:46.0909793Z 2025-05-07T19:59:56.7293339Z [321/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o 2025-05-07T19:59:56.7315574Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:56.7318230Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:56.7319648Z ^ 2025-05-07T19:59:56.7319893Z 2025-05-07T19:59:56.7320300Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:56.7320920Z 2025-05-07T19:59:56.7322363Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:56.7324514Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:56.7325578Z ^ 2025-05-07T19:59:56.7325960Z 2025-05-07T19:59:56.7327425Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.7329718Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:56.7330408Z ^ 2025-05-07T19:59:56.7330696Z 2025-05-07T19:59:56.7332173Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.7334072Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:56.7334627Z ^ 2025-05-07T19:59:56.7334912Z 2025-05-07T19:59:56.7336413Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.7338273Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:56.7339070Z ^ 2025-05-07T19:59:56.7339362Z 2025-05-07T19:59:56.7340822Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.7342806Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:56.7343380Z ^ 2025-05-07T19:59:56.7343641Z 2025-05-07T19:59:56.7345345Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:56.7347849Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:56.7348971Z ^ 2025-05-07T19:59:56.7349201Z 2025-05-07T19:59:56.7349617Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:56.7350224Z 2025-05-07T19:59:56.7351725Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:56.7354433Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:56.7355688Z ^ 2025-05-07T19:59:56.7356017Z 2025-05-07T19:59:56.7357401Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.7359167Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:56.7359874Z ^ 2025-05-07T19:59:56.7360317Z 2025-05-07T19:59:56.7361986Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.7363785Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:56.7364359Z ^ 2025-05-07T19:59:56.7364626Z 2025-05-07T19:59:56.7366074Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.7367946Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:56.7368519Z ^ 2025-05-07T19:59:56.7368775Z 2025-05-07T19:59:56.7370242Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.7372097Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:56.7372662Z ^ 2025-05-07T19:59:56.7372922Z 2025-05-07T19:59:56.7374457Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:56.7376963Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:56.7378016Z ^ 2025-05-07T19:59:56.7378280Z 2025-05-07T19:59:56.7378718Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:56.7379324Z 2025-05-07T19:59:56.7380987Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:56.7383185Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:56.7384273Z ^ 2025-05-07T19:59:56.7384550Z 2025-05-07T19:59:56.7385869Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.7387867Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:56.7388628Z ^ 2025-05-07T19:59:56.7388910Z 2025-05-07T19:59:56.7390325Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.7392070Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:56.7392622Z ^ 2025-05-07T19:59:56.7392900Z 2025-05-07T19:59:56.7394474Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.7396334Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:56.7396860Z ^ 2025-05-07T19:59:56.7397138Z 2025-05-07T19:59:56.7398489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.7400160Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:56.7400742Z ^ 2025-05-07T19:59:56.7400978Z 2025-05-07T19:59:56.7402313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:56.7404730Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:56.7405842Z ^ 2025-05-07T19:59:56.7406089Z 2025-05-07T19:59:56.7406520Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:56.7407153Z 2025-05-07T19:59:56.7408702Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:56.7411260Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:56.7412428Z ^ 2025-05-07T19:59:56.7412789Z 2025-05-07T19:59:56.7414262Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.7416316Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:56.7417065Z ^ 2025-05-07T19:59:56.7417344Z 2025-05-07T19:59:56.7418795Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.7420664Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:56.7421341Z ^ 2025-05-07T19:59:56.7421666Z 2025-05-07T19:59:56.7423109Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.7424958Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:56.7425479Z ^ 2025-05-07T19:59:56.7425780Z 2025-05-07T19:59:56.7427311Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.7429283Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:56.7429838Z ^ 2025-05-07T19:59:56.7430120Z 2025-05-07T19:59:56.7431745Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:56.7434293Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:56.7435375Z ^ 2025-05-07T19:59:56.7435585Z 2025-05-07T19:59:56.7435946Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:56.7436515Z 2025-05-07T19:59:56.7437975Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:56.7440486Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:56.7441575Z ^ 2025-05-07T19:59:56.7442225Z 2025-05-07T19:59:56.7443675Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.7445734Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:56.7446460Z ^ 2025-05-07T19:59:56.7446760Z 2025-05-07T19:59:56.7448251Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.7450098Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:56.7450629Z ^ 2025-05-07T19:59:56.7450903Z 2025-05-07T19:59:56.7452351Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.7454217Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:56.7454729Z ^ 2025-05-07T19:59:56.7454988Z 2025-05-07T19:59:56.7456398Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.7458191Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:56.7458719Z ^ 2025-05-07T19:59:56.7458969Z 2025-05-07T20:00:07.2118082Z [322/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o 2025-05-07T20:00:07.2140233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:07.2142935Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:07.2144090Z ^ 2025-05-07T20:00:07.2144347Z 2025-05-07T20:00:07.2144787Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:07.2145439Z 2025-05-07T20:00:07.2147072Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:07.2149832Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:07.2150904Z ^ 2025-05-07T20:00:07.2151268Z 2025-05-07T20:00:07.2152817Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:07.2155646Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:07.2156751Z ^ 2025-05-07T20:00:07.2157132Z 2025-05-07T20:00:07.2157559Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:07.2158140Z 2025-05-07T20:00:07.2160027Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:07.2162663Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:07.2163983Z ^ 2025-05-07T20:00:07.2164310Z 2025-05-07T20:00:07.2165934Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:07.2168598Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:07.2169670Z ^ 2025-05-07T20:00:07.2169928Z 2025-05-07T20:00:07.2170314Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:07.2171013Z 2025-05-07T20:00:07.2172420Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:07.2174854Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:07.2175850Z ^ 2025-05-07T20:00:07.2176195Z 2025-05-07T20:00:07.2177708Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:07.2180390Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:07.2181856Z ^ 2025-05-07T20:00:07.2182095Z 2025-05-07T20:00:07.2182577Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:07.2183235Z 2025-05-07T20:00:07.2184878Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:07.2187668Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:07.2188775Z ^ 2025-05-07T20:00:07.2189145Z 2025-05-07T20:00:07.2190744Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:07.2193375Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:07.2194624Z ^ 2025-05-07T20:00:07.2194922Z 2025-05-07T20:00:07.2195355Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:07.2195962Z 2025-05-07T20:00:07.2197661Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:07.2200290Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:07.2201418Z ^ 2025-05-07T20:00:07.2201781Z 2025-05-07T20:00:07.4102017Z [323/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o 2025-05-07T20:00:07.4122235Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:07.4124553Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:07.4125577Z ^ 2025-05-07T20:00:07.4125792Z 2025-05-07T20:00:07.4126169Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:07.4126736Z 2025-05-07T20:00:07.4128056Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:07.4130520Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:07.4131709Z ^ 2025-05-07T20:00:07.4132010Z 2025-05-07T20:00:07.4133254Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:07.4135401Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T20:00:07.4136105Z ^ 2025-05-07T20:00:07.4136334Z 2025-05-07T20:00:07.4137579Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:07.4139392Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:07.4140025Z ^ 2025-05-07T20:00:07.4140270Z 2025-05-07T20:00:07.4141549Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:07.4143179Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:07.4143686Z ^ 2025-05-07T20:00:07.4143935Z 2025-05-07T20:00:07.4145231Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:07.4146969Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:07.4147518Z ^ 2025-05-07T20:00:07.4147760Z 2025-05-07T20:00:07.4149213Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:07.4151492Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:07.4152755Z ^ 2025-05-07T20:00:07.4153039Z 2025-05-07T20:00:07.4153422Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:07.4154094Z 2025-05-07T20:00:07.4155652Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:07.4157929Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:07.4158954Z ^ 2025-05-07T20:00:07.4159250Z 2025-05-07T20:00:07.4160601Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:07.4162391Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T20:00:07.4163039Z ^ 2025-05-07T20:00:07.4163295Z 2025-05-07T20:00:07.4164590Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:07.4166391Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:07.4166857Z ^ 2025-05-07T20:00:07.4167106Z 2025-05-07T20:00:07.4168371Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:07.4169998Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:07.4170473Z ^ 2025-05-07T20:00:07.4170705Z 2025-05-07T20:00:07.4172227Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:07.4173947Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:07.4174411Z ^ 2025-05-07T20:00:07.4174692Z 2025-05-07T20:00:07.4176318Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:07.4178900Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:07.4180248Z ^ 2025-05-07T20:00:07.4180523Z 2025-05-07T20:00:07.4180982Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:07.4181659Z 2025-05-07T20:00:07.4183370Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:07.4186303Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:07.4187505Z ^ 2025-05-07T20:00:07.4187874Z 2025-05-07T20:00:07.4189396Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:07.4191510Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T20:00:07.4192279Z ^ 2025-05-07T20:00:07.4192571Z 2025-05-07T20:00:07.4194606Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:07.4196488Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:07.4196978Z ^ 2025-05-07T20:00:07.4197274Z 2025-05-07T20:00:07.4198802Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:07.4200798Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:07.4201365Z ^ 2025-05-07T20:00:07.4201675Z 2025-05-07T20:00:07.4203229Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:07.4205223Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:07.4205785Z ^ 2025-05-07T20:00:07.4206062Z 2025-05-07T20:00:07.4207785Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:07.4210523Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:07.4211743Z ^ 2025-05-07T20:00:07.4212177Z 2025-05-07T20:00:07.4212666Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:07.4213341Z 2025-05-07T20:00:07.4215065Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:07.4217880Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:07.4219145Z ^ 2025-05-07T20:00:07.4219501Z 2025-05-07T20:00:07.4220895Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:07.4223055Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T20:00:07.4223875Z ^ 2025-05-07T20:00:07.4224163Z 2025-05-07T20:00:07.4225600Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:07.4227351Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:07.4227855Z ^ 2025-05-07T20:00:07.4228100Z 2025-05-07T20:00:07.4229625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:07.4231357Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:07.4231932Z ^ 2025-05-07T20:00:07.4232214Z 2025-05-07T20:00:07.4233964Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:07.4236047Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:07.4236595Z ^ 2025-05-07T20:00:07.4237063Z 2025-05-07T20:00:07.4238752Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:07.4241779Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:07.4242980Z ^ 2025-05-07T20:00:07.4243269Z 2025-05-07T20:00:07.4243735Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:07.4244418Z 2025-05-07T20:00:07.4246142Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:07.4248790Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:07.4249798Z ^ 2025-05-07T20:00:07.4250103Z 2025-05-07T20:00:07.4251614Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:07.4253766Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T20:00:07.4254723Z ^ 2025-05-07T20:00:07.4255021Z 2025-05-07T20:00:07.4256737Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:07.4258680Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:07.4259266Z ^ 2025-05-07T20:00:07.4259546Z 2025-05-07T20:00:07.4261061Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:07.4263187Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:07.4263733Z ^ 2025-05-07T20:00:07.4264133Z 2025-05-07T20:00:07.4265694Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:07.4267747Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:07.4268357Z ^ 2025-05-07T20:00:07.4268831Z 2025-05-07T20:00:08.5573097Z [324/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_inference.so -o fbgemm_gpu_tbe_inference.so CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_config.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed asmjit.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl && : 2025-05-07T20:00:09.9069167Z [325/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o 2025-05-07T20:00:09.9086240Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:09.9088639Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:09.9089699Z ^ 2025-05-07T20:00:09.9089959Z 2025-05-07T20:00:09.9090321Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:09.9090868Z 2025-05-07T20:00:09.9092273Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:09.9094805Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:09.9095955Z ^ 2025-05-07T20:00:09.9096310Z 2025-05-07T20:00:09.9098006Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:09.9100608Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:09.9101526Z ^ 2025-05-07T20:00:09.9101738Z 2025-05-07T20:00:09.9102078Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:09.9102597Z 2025-05-07T20:00:09.9103816Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:09.9105928Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:09.9106824Z ^ 2025-05-07T20:00:09.9107144Z 2025-05-07T20:00:09.9108336Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:09.9110290Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:09.9111159Z ^ 2025-05-07T20:00:09.9111394Z 2025-05-07T20:00:09.9111736Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:09.9112247Z 2025-05-07T20:00:09.9113515Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:09.9115617Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:09.9116558Z ^ 2025-05-07T20:00:09.9116855Z 2025-05-07T20:00:09.9118028Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:09.9119990Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:09.9120899Z ^ 2025-05-07T20:00:09.9121227Z 2025-05-07T20:00:09.9121595Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:09.9122125Z 2025-05-07T20:00:09.9123623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:09.9126059Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:09.9126980Z ^ 2025-05-07T20:00:09.9127294Z 2025-05-07T20:00:09.9128823Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:09.9130891Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:09.9131789Z ^ 2025-05-07T20:00:09.9132005Z 2025-05-07T20:00:09.9132372Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:09.9132879Z 2025-05-07T20:00:09.9134114Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:09.9136113Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:09.9137075Z ^ 2025-05-07T20:00:09.9137373Z 2025-05-07T20:00:24.2808849Z [326/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_none_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o 2025-05-07T20:00:24.2832697Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.2835499Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:24.2836629Z ^ 2025-05-07T20:00:24.2836888Z 2025-05-07T20:00:24.2837357Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:24.2838156Z 2025-05-07T20:00:24.2839864Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.2842794Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:24.2844076Z ^ 2025-05-07T20:00:24.2844462Z 2025-05-07T20:00:24.2846123Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.2848677Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:24.2849876Z ^ 2025-05-07T20:00:24.2850405Z 2025-05-07T20:00:24.2850828Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:24.2851467Z 2025-05-07T20:00:24.2853107Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.2855778Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:24.2856911Z ^ 2025-05-07T20:00:24.2857254Z 2025-05-07T20:00:24.2858730Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.2861299Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:24.2862412Z ^ 2025-05-07T20:00:24.2862648Z 2025-05-07T20:00:24.2863092Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:24.2863732Z 2025-05-07T20:00:24.2865220Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.2867673Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:24.2868794Z ^ 2025-05-07T20:00:24.2869132Z 2025-05-07T20:00:24.2870991Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.2873346Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:24.2874544Z ^ 2025-05-07T20:00:24.2874952Z 2025-05-07T20:00:24.2875371Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:24.2875996Z 2025-05-07T20:00:24.2877638Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.2880126Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:24.2881301Z ^ 2025-05-07T20:00:24.2881618Z 2025-05-07T20:00:24.2883208Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.2885642Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:24.2886698Z ^ 2025-05-07T20:00:24.2886913Z 2025-05-07T20:00:24.2887330Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:24.2888117Z 2025-05-07T20:00:24.2889969Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.2892818Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:24.2894200Z ^ 2025-05-07T20:00:24.2894588Z 2025-05-07T20:00:26.9895679Z [327/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:00:26.9920792Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.9923738Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:26.9924981Z ^ 2025-05-07T20:00:26.9925253Z 2025-05-07T20:00:26.9925700Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:26.9926368Z 2025-05-07T20:00:26.9928141Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.9931150Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:26.9932314Z ^ 2025-05-07T20:00:26.9932650Z 2025-05-07T20:00:26.9934170Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.9936850Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:26.9937995Z ^ 2025-05-07T20:00:26.9938252Z 2025-05-07T20:00:26.9938720Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:26.9939400Z 2025-05-07T20:00:26.9941090Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.9943764Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:26.9944853Z ^ 2025-05-07T20:00:26.9945262Z 2025-05-07T20:00:26.9946972Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.9949505Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:26.9950657Z ^ 2025-05-07T20:00:26.9950912Z 2025-05-07T20:00:26.9951352Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:26.9951925Z 2025-05-07T20:00:26.9953614Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.9956747Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:26.9958027Z ^ 2025-05-07T20:00:26.9958412Z 2025-05-07T20:00:26.9960136Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.9963293Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:26.9964455Z ^ 2025-05-07T20:00:26.9964711Z 2025-05-07T20:00:26.9965166Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:26.9965872Z 2025-05-07T20:00:26.9967543Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.9970277Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:26.9971444Z ^ 2025-05-07T20:00:26.9971850Z 2025-05-07T20:00:26.9973814Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.9976401Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:26.9977485Z ^ 2025-05-07T20:00:26.9977751Z 2025-05-07T20:00:26.9978188Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:26.9978839Z 2025-05-07T20:00:26.9980502Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.9983421Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:26.9984685Z ^ 2025-05-07T20:00:26.9985054Z 2025-05-07T20:00:29.5760603Z [328/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o 2025-05-07T20:00:29.5781103Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.5783472Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:29.5784542Z ^ 2025-05-07T20:00:29.5784788Z 2025-05-07T20:00:29.5785179Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:29.5785752Z 2025-05-07T20:00:29.5787198Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.5789530Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:29.5790814Z ^ 2025-05-07T20:00:29.5791137Z 2025-05-07T20:00:29.5792616Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.5795299Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:29.5796262Z ^ 2025-05-07T20:00:29.5796471Z 2025-05-07T20:00:29.5796838Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:29.5797399Z 2025-05-07T20:00:29.5798758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.5800963Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:29.5801928Z ^ 2025-05-07T20:00:29.5802247Z 2025-05-07T20:00:29.5803647Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.5805840Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:29.5806796Z ^ 2025-05-07T20:00:29.5807031Z 2025-05-07T20:00:29.5807448Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:29.5808016Z 2025-05-07T20:00:29.5809558Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.5811760Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:29.5812842Z ^ 2025-05-07T20:00:29.5813272Z 2025-05-07T20:00:29.5814592Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.5816805Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:29.5817908Z ^ 2025-05-07T20:00:29.5818222Z 2025-05-07T20:00:29.5818717Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:29.5819273Z 2025-05-07T20:00:29.5820678Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.5823004Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:29.5824069Z ^ 2025-05-07T20:00:29.5824387Z 2025-05-07T20:00:29.5825779Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.5827951Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:29.5829429Z ^ 2025-05-07T20:00:29.5829647Z 2025-05-07T20:00:29.5830032Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:29.5830598Z 2025-05-07T20:00:29.5832052Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.5834488Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:29.5835525Z ^ 2025-05-07T20:00:29.5835845Z 2025-05-07T20:00:34.9612225Z [329/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:00:34.9634534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:34.9637142Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:34.9638243Z ^ 2025-05-07T20:00:34.9638492Z 2025-05-07T20:00:34.9638935Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:34.9639825Z 2025-05-07T20:00:34.9641383Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:34.9643853Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:34.9645040Z ^ 2025-05-07T20:00:34.9645397Z 2025-05-07T20:00:34.9646968Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:34.9649377Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:34.9650442Z ^ 2025-05-07T20:00:34.9650674Z 2025-05-07T20:00:34.9651089Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:34.9651658Z 2025-05-07T20:00:34.9653140Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:34.9655589Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:34.9656690Z ^ 2025-05-07T20:00:34.9657090Z 2025-05-07T20:00:34.9658788Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:34.9661692Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:34.9662731Z ^ 2025-05-07T20:00:34.9663004Z 2025-05-07T20:00:34.9663460Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:34.9664280Z 2025-05-07T20:00:34.9666117Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:34.9668324Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:34.9669307Z ^ 2025-05-07T20:00:34.9669633Z 2025-05-07T20:00:34.9671206Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:34.9674022Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:34.9675193Z ^ 2025-05-07T20:00:34.9675463Z 2025-05-07T20:00:34.9675904Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:34.9676578Z 2025-05-07T20:00:34.9678258Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:34.9680958Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:34.9682137Z ^ 2025-05-07T20:00:34.9682531Z 2025-05-07T20:00:34.9684364Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:34.9687038Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:34.9688212Z ^ 2025-05-07T20:00:34.9688490Z 2025-05-07T20:00:34.9688946Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:34.9689612Z 2025-05-07T20:00:34.9691274Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:34.9693957Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:34.9695164Z ^ 2025-05-07T20:00:34.9695534Z 2025-05-07T20:00:36.9621335Z [330/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_none_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o 2025-05-07T20:00:36.9644507Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.9647173Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.9648586Z ^ 2025-05-07T20:00:36.9648867Z 2025-05-07T20:00:36.9649316Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:36.9649982Z 2025-05-07T20:00:36.9651620Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.9654329Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.9655366Z ^ 2025-05-07T20:00:36.9655666Z 2025-05-07T20:00:36.9656926Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.9658984Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.9660002Z ^ 2025-05-07T20:00:36.9660227Z 2025-05-07T20:00:36.9660626Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:36.9661228Z 2025-05-07T20:00:36.9662936Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.9665457Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.9666530Z ^ 2025-05-07T20:00:36.9666865Z 2025-05-07T20:00:36.9668747Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.9671456Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.9672728Z ^ 2025-05-07T20:00:36.9672972Z 2025-05-07T20:00:36.9673505Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:36.9674276Z 2025-05-07T20:00:36.9675753Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.9678157Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.9679398Z ^ 2025-05-07T20:00:36.9679767Z 2025-05-07T20:00:36.9681439Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.9684198Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.9685400Z ^ 2025-05-07T20:00:36.9685685Z 2025-05-07T20:00:36.9686142Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:36.9686821Z 2025-05-07T20:00:36.9688557Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.9691409Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.9692641Z ^ 2025-05-07T20:00:36.9693022Z 2025-05-07T20:00:36.9694764Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.9697228Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.9698402Z ^ 2025-05-07T20:00:36.9698655Z 2025-05-07T20:00:36.9699120Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:36.9699771Z 2025-05-07T20:00:36.9701435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.9704173Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.9705384Z ^ 2025-05-07T20:00:36.9705774Z 2025-05-07T20:00:39.4872870Z [331/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o 2025-05-07T20:00:39.4896773Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:39.4898944Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:39.4900069Z ^ 2025-05-07T20:00:39.4900340Z 2025-05-07T20:00:39.4900794Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:39.4901483Z 2025-05-07T20:00:39.4903243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:39.4905926Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:39.4907113Z ^ 2025-05-07T20:00:39.4907496Z 2025-05-07T20:00:39.4909118Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:39.4911755Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:39.4930950Z ^ 2025-05-07T20:00:39.4931231Z 2025-05-07T20:00:39.4931668Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:39.4932268Z 2025-05-07T20:00:39.4934243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:39.4936477Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:39.4937603Z ^ 2025-05-07T20:00:39.4938149Z 2025-05-07T20:00:39.4939907Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:39.4942589Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:39.4943767Z ^ 2025-05-07T20:00:39.4944036Z 2025-05-07T20:00:39.4944497Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:39.4945159Z 2025-05-07T20:00:39.4946821Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:39.4949533Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:39.4950774Z ^ 2025-05-07T20:00:39.4951134Z 2025-05-07T20:00:39.4952806Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:39.4955680Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:39.4956863Z ^ 2025-05-07T20:00:39.4957121Z 2025-05-07T20:00:39.4957573Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:39.4958468Z 2025-05-07T20:00:39.4960129Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:39.4962866Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:39.4964057Z ^ 2025-05-07T20:00:39.4964432Z 2025-05-07T20:00:39.4966074Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:39.4968790Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:39.4969965Z ^ 2025-05-07T20:00:39.4970236Z 2025-05-07T20:00:39.4970672Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:39.4971327Z 2025-05-07T20:00:39.4973049Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:39.4975734Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:39.4976897Z ^ 2025-05-07T20:00:39.4977253Z 2025-05-07T20:00:40.1104070Z [332/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_none_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o 2025-05-07T20:00:40.1116279Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:40.1117679Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:40.1118296Z ^ 2025-05-07T20:00:40.1118457Z 2025-05-07T20:00:40.1118701Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:40.1119054Z 2025-05-07T20:00:40.1119929Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:40.1121316Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:40.1121956Z ^ 2025-05-07T20:00:40.1122150Z 2025-05-07T20:00:40.1123068Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:40.1124440Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:40.1125068Z ^ 2025-05-07T20:00:40.1125209Z 2025-05-07T20:00:40.1125518Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:40.1125871Z 2025-05-07T20:00:40.1126729Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:40.1128182Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:40.1129050Z ^ 2025-05-07T20:00:40.1129262Z 2025-05-07T20:00:40.1130114Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:40.1131502Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:40.1132116Z ^ 2025-05-07T20:00:40.1132271Z 2025-05-07T20:00:40.1132513Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:40.1132862Z 2025-05-07T20:00:40.1133737Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:40.1135124Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:40.1135767Z ^ 2025-05-07T20:00:40.1135961Z 2025-05-07T20:00:40.1136831Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:40.1138295Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:40.1138932Z ^ 2025-05-07T20:00:40.1139074Z 2025-05-07T20:00:40.1139314Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:40.1139682Z 2025-05-07T20:00:40.1140544Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:40.1141940Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:40.1142566Z ^ 2025-05-07T20:00:40.1142780Z 2025-05-07T20:00:40.1143632Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:40.1145021Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:40.1145644Z ^ 2025-05-07T20:00:40.1145782Z 2025-05-07T20:00:40.1146038Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:40.1146388Z 2025-05-07T20:00:40.1147249Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:40.1148641Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:40.1149349Z ^ 2025-05-07T20:00:40.1149549Z 2025-05-07T20:00:44.9541499Z [333/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:00:44.9564211Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:44.9566574Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:44.9567592Z ^ 2025-05-07T20:00:44.9567825Z 2025-05-07T20:00:44.9568229Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:44.9568833Z 2025-05-07T20:00:44.9570253Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:44.9572609Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:44.9573598Z ^ 2025-05-07T20:00:44.9573920Z 2025-05-07T20:00:44.9575671Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:44.9577894Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:44.9579018Z ^ 2025-05-07T20:00:44.9579262Z 2025-05-07T20:00:44.9579823Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:44.9580434Z 2025-05-07T20:00:44.9582047Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:44.9584446Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:44.9585430Z ^ 2025-05-07T20:00:44.9585772Z 2025-05-07T20:00:44.9587289Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:44.9589798Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:44.9591029Z ^ 2025-05-07T20:00:44.9591294Z 2025-05-07T20:00:44.9591758Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:44.9592390Z 2025-05-07T20:00:44.9594103Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:44.9596352Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:44.9597533Z ^ 2025-05-07T20:00:44.9597931Z 2025-05-07T20:00:44.9599472Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:44.9601766Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:44.9602700Z ^ 2025-05-07T20:00:44.9602937Z 2025-05-07T20:00:44.9603301Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:44.9603816Z 2025-05-07T20:00:44.9605249Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:44.9607835Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:44.9608825Z ^ 2025-05-07T20:00:44.9609131Z 2025-05-07T20:00:44.9610633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:44.9613083Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:44.9614207Z ^ 2025-05-07T20:00:44.9614451Z 2025-05-07T20:00:44.9614878Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:44.9615532Z 2025-05-07T20:00:44.9617274Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:44.9619824Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:44.9620978Z ^ 2025-05-07T20:00:44.9621355Z 2025-05-07T20:00:46.7250707Z [334/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_none_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o 2025-05-07T20:00:46.7274500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:46.7277267Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:46.7278474Z ^ 2025-05-07T20:00:46.7278744Z 2025-05-07T20:00:46.7279266Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:46.7279956Z 2025-05-07T20:00:46.7281607Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:46.7284338Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:46.7285897Z ^ 2025-05-07T20:00:46.7286279Z 2025-05-07T20:00:46.7287939Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:46.7290795Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:46.7292067Z ^ 2025-05-07T20:00:46.7292334Z 2025-05-07T20:00:46.7292801Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:46.7293506Z 2025-05-07T20:00:46.7295181Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:46.7297906Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:46.7299138Z ^ 2025-05-07T20:00:46.7299513Z 2025-05-07T20:00:46.7301197Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:46.7303869Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:46.7305098Z ^ 2025-05-07T20:00:46.7305354Z 2025-05-07T20:00:46.7305830Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:46.7306500Z 2025-05-07T20:00:46.7308178Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:46.7311023Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:46.7312254Z ^ 2025-05-07T20:00:46.7312640Z 2025-05-07T20:00:46.7314435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:46.7317140Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:46.7318324Z ^ 2025-05-07T20:00:46.7318585Z 2025-05-07T20:00:46.7319062Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:46.7319736Z 2025-05-07T20:00:46.7321416Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:46.7324109Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:46.7325338Z ^ 2025-05-07T20:00:46.7325710Z 2025-05-07T20:00:46.7327354Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:46.7330236Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:46.7331432Z ^ 2025-05-07T20:00:46.7331692Z 2025-05-07T20:00:46.7332435Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:46.7333113Z 2025-05-07T20:00:46.7334828Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:46.7341021Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:46.7342237Z ^ 2025-05-07T20:00:46.7342622Z 2025-05-07T20:00:51.0546432Z [335/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T20:00:51.0567347Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:53.6494083Z [336/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_none_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o 2025-05-07T20:00:53.6517357Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:53.6520074Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:53.6521569Z ^ 2025-05-07T20:00:53.6521864Z 2025-05-07T20:00:53.6522294Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:53.6522939Z 2025-05-07T20:00:53.6524587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:53.6527326Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:53.6528863Z ^ 2025-05-07T20:00:53.6529232Z 2025-05-07T20:00:53.6531074Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:53.6533708Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:53.6534869Z ^ 2025-05-07T20:00:53.6535127Z 2025-05-07T20:00:53.6535590Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:53.6536155Z 2025-05-07T20:00:53.6537726Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:53.6540481Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:53.6541577Z ^ 2025-05-07T20:00:53.6541931Z 2025-05-07T20:00:53.6543741Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:53.6546250Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:53.6547514Z ^ 2025-05-07T20:00:53.6547808Z 2025-05-07T20:00:53.6548239Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:53.6548980Z 2025-05-07T20:00:53.6550628Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:53.6553138Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:53.6554552Z ^ 2025-05-07T20:00:53.6554940Z 2025-05-07T20:00:53.6556643Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:53.6559431Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:53.6560653Z ^ 2025-05-07T20:00:53.6560912Z 2025-05-07T20:00:53.6561359Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:53.6562001Z 2025-05-07T20:00:53.6563589Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:53.6566327Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:53.6567715Z ^ 2025-05-07T20:00:53.6568117Z 2025-05-07T20:00:53.6569861Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:53.6572660Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:53.6573709Z ^ 2025-05-07T20:00:53.6573971Z 2025-05-07T20:00:53.6574467Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:53.6575145Z 2025-05-07T20:00:53.6576826Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:53.6579595Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:53.6580784Z ^ 2025-05-07T20:00:53.6581090Z 2025-05-07T20:00:54.5490073Z [337/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T20:00:54.5509923Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:56.0855569Z [338/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T20:00:56.0877811Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:01.8773264Z [339/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T20:01:01.8793621Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:02.8903079Z [340/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T20:01:02.8924758Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:03.3068453Z [341/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:01:03.3093326Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:03.3095988Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:03.3097506Z ^ 2025-05-07T20:01:03.3097782Z 2025-05-07T20:01:03.3098282Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:03.3098981Z 2025-05-07T20:01:03.3100742Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:03.3103767Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:03.3105005Z ^ 2025-05-07T20:01:03.3105413Z 2025-05-07T20:01:03.3107150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:03.3109963Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:03.3111174Z ^ 2025-05-07T20:01:03.3111467Z 2025-05-07T20:01:03.3111931Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:03.3112627Z 2025-05-07T20:01:03.3114440Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:03.3117037Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:03.3118279Z ^ 2025-05-07T20:01:03.3118654Z 2025-05-07T20:01:03.3120333Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:03.3122915Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:03.3124088Z ^ 2025-05-07T20:01:03.3124369Z 2025-05-07T20:01:03.3124826Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:03.3125491Z 2025-05-07T20:01:03.3127197Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:03.3130158Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:03.3131397Z ^ 2025-05-07T20:01:03.3131780Z 2025-05-07T20:01:03.3133707Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:03.3136374Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:03.3137577Z ^ 2025-05-07T20:01:03.3137871Z 2025-05-07T20:01:03.3138340Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:03.3139035Z 2025-05-07T20:01:03.3140643Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:03.3143424Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:03.3144669Z ^ 2025-05-07T20:01:03.3145025Z 2025-05-07T20:01:03.3146648Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:03.3149816Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:03.3151006Z ^ 2025-05-07T20:01:03.3151274Z 2025-05-07T20:01:03.3151733Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:03.3152441Z 2025-05-07T20:01:03.3154461Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:03.3157126Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:03.3158229Z ^ 2025-05-07T20:01:03.3158554Z 2025-05-07T20:01:03.8048367Z [342/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:03.8068710Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:05.1276763Z [343/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T20:01:05.1296702Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:05.3112783Z [344/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o 2025-05-07T20:01:05.3136688Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:05.3139633Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:05.3140786Z ^ 2025-05-07T20:01:05.3141025Z 2025-05-07T20:01:05.3141468Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:05.3142131Z 2025-05-07T20:01:05.3143830Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:05.3146567Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:05.3147778Z ^ 2025-05-07T20:01:05.3148149Z 2025-05-07T20:01:05.3149950Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:05.3152933Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:05.3154205Z ^ 2025-05-07T20:01:05.3154457Z 2025-05-07T20:01:05.3154904Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:05.3155569Z 2025-05-07T20:01:05.3157172Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:05.3159639Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:05.3160762Z ^ 2025-05-07T20:01:05.3161145Z 2025-05-07T20:01:05.3162717Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:05.3165510Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:05.3166755Z ^ 2025-05-07T20:01:05.3167030Z 2025-05-07T20:01:05.3167523Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:05.3168199Z 2025-05-07T20:01:05.3169811Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:05.3172630Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:05.3173832Z ^ 2025-05-07T20:01:05.3174230Z 2025-05-07T20:01:05.3175849Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:05.3178852Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:05.3180072Z ^ 2025-05-07T20:01:05.3180347Z 2025-05-07T20:01:05.3180771Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:05.3181448Z 2025-05-07T20:01:05.3183144Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:05.3185650Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:05.3186741Z ^ 2025-05-07T20:01:05.3187074Z 2025-05-07T20:01:05.3188629Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:05.3191252Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:05.3192523Z ^ 2025-05-07T20:01:05.3192796Z 2025-05-07T20:01:05.3193229Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:05.3194106Z 2025-05-07T20:01:05.3195865Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:05.3198869Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:05.3200096Z ^ 2025-05-07T20:01:05.3200499Z 2025-05-07T20:01:07.6706002Z [345/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_none_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o 2025-05-07T20:01:07.6730518Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.6733181Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.6734390Z ^ 2025-05-07T20:01:07.6734667Z 2025-05-07T20:01:07.6735105Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:07.6735771Z 2025-05-07T20:01:07.6737389Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.6740120Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.6741631Z ^ 2025-05-07T20:01:07.6742019Z 2025-05-07T20:01:07.6743674Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.6746296Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.6747403Z ^ 2025-05-07T20:01:07.6747662Z 2025-05-07T20:01:07.6748140Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:07.6748821Z 2025-05-07T20:01:07.6750403Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.6752929Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.6754240Z ^ 2025-05-07T20:01:07.6754625Z 2025-05-07T20:01:07.6756139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.6758865Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.6760091Z ^ 2025-05-07T20:01:07.6760364Z 2025-05-07T20:01:07.6760840Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:07.6761565Z 2025-05-07T20:01:07.6763510Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.6766283Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.6767543Z ^ 2025-05-07T20:01:07.6767892Z 2025-05-07T20:01:07.6769650Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.6772308Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.6773514Z ^ 2025-05-07T20:01:07.6773784Z 2025-05-07T20:01:07.6774282Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:07.6774952Z 2025-05-07T20:01:07.6776557Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.6779309Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.6780425Z ^ 2025-05-07T20:01:07.6780793Z 2025-05-07T20:01:07.6782436Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.6785146Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.6786453Z ^ 2025-05-07T20:01:07.6786726Z 2025-05-07T20:01:07.6787173Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:07.6787829Z 2025-05-07T20:01:07.6789620Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.6792236Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.6793416Z ^ 2025-05-07T20:01:07.6793785Z 2025-05-07T20:01:08.7808127Z [346/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o 2025-05-07T20:01:08.7833258Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:08.7836262Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:08.7837432Z ^ 2025-05-07T20:01:08.7837689Z 2025-05-07T20:01:08.7838172Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:08.7839148Z 2025-05-07T20:01:08.7840835Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:08.7843564Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:08.7844785Z ^ 2025-05-07T20:01:08.7845162Z 2025-05-07T20:01:08.7846818Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:08.7849527Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:08.7850689Z ^ 2025-05-07T20:01:08.7850983Z 2025-05-07T20:01:08.7851436Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:08.7852113Z 2025-05-07T20:01:08.7853674Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:08.7855816Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:08.7856817Z ^ 2025-05-07T20:01:08.7857129Z 2025-05-07T20:01:08.7858580Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:08.7861320Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:08.7862459Z ^ 2025-05-07T20:01:08.7862702Z 2025-05-07T20:01:08.7863134Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:08.7863920Z 2025-05-07T20:01:08.7865557Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:08.7867768Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:08.7868937Z ^ 2025-05-07T20:01:08.7869341Z 2025-05-07T20:01:08.7871039Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:08.7873714Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:08.7875015Z ^ 2025-05-07T20:01:08.7875302Z 2025-05-07T20:01:08.7875749Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:08.7876423Z 2025-05-07T20:01:08.7878132Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:08.7880799Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:08.7881992Z ^ 2025-05-07T20:01:08.7882357Z 2025-05-07T20:01:08.7884048Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:08.7886798Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:08.7887968Z ^ 2025-05-07T20:01:08.7888273Z 2025-05-07T20:01:08.7888721Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:08.7889369Z 2025-05-07T20:01:08.7891083Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:08.7893731Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:08.7894938Z ^ 2025-05-07T20:01:08.7895294Z 2025-05-07T20:01:10.5537250Z [347/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:01:10.5558506Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:10.5561323Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:10.5562334Z ^ 2025-05-07T20:01:10.5562557Z 2025-05-07T20:01:10.5562950Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:10.5563542Z 2025-05-07T20:01:10.5564953Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:10.5567246Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:10.5568264Z ^ 2025-05-07T20:01:10.5568600Z 2025-05-07T20:01:10.5570010Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:10.5572291Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:10.5573279Z ^ 2025-05-07T20:01:10.5573499Z 2025-05-07T20:01:10.5573906Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:10.5574471Z 2025-05-07T20:01:10.5575904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:10.5578155Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:10.5579338Z ^ 2025-05-07T20:01:10.5579651Z 2025-05-07T20:01:10.5581046Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:10.5583508Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:10.5584625Z ^ 2025-05-07T20:01:10.5584854Z 2025-05-07T20:01:10.5585241Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:10.5585993Z 2025-05-07T20:01:10.5587454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:10.5589784Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:10.5590853Z ^ 2025-05-07T20:01:10.5591179Z 2025-05-07T20:01:10.5592655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:10.5595104Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:10.5596153Z ^ 2025-05-07T20:01:10.5596382Z 2025-05-07T20:01:10.5596793Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:10.5597356Z 2025-05-07T20:01:10.5598788Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:10.5601204Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:10.5602218Z ^ 2025-05-07T20:01:10.5602563Z 2025-05-07T20:01:10.5603967Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:10.5606260Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:10.5607261Z ^ 2025-05-07T20:01:10.5607512Z 2025-05-07T20:01:10.5607907Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:10.5608489Z 2025-05-07T20:01:10.5609956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:10.5612278Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:10.5613322Z ^ 2025-05-07T20:01:10.5613654Z 2025-05-07T20:01:11.8406616Z [348/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T20:01:11.8426756Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:12.3374703Z [349/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:12.3396445Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:12.9160382Z [350/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T20:01:12.9180462Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:12.9404335Z [351/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:12.9425520Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:14.3804133Z [352/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:14.3824925Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:18.5163004Z [353/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:01:18.5187048Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5190096Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5191298Z ^ 2025-05-07T20:01:18.5191595Z 2025-05-07T20:01:18.5192069Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:18.5192738Z 2025-05-07T20:01:18.5194632Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5197367Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5198620Z ^ 2025-05-07T20:01:18.5198992Z 2025-05-07T20:01:18.5200792Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5203502Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5204720Z ^ 2025-05-07T20:01:18.5204976Z 2025-05-07T20:01:18.5205458Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:18.5206141Z 2025-05-07T20:01:18.5207844Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5210737Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5211975Z ^ 2025-05-07T20:01:18.5212348Z 2025-05-07T20:01:18.5214034Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5216914Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5218114Z ^ 2025-05-07T20:01:18.5218396Z 2025-05-07T20:01:18.5218856Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:18.5219535Z 2025-05-07T20:01:18.5221259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5224002Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5225236Z ^ 2025-05-07T20:01:18.5225604Z 2025-05-07T20:01:18.5227315Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5230233Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5231436Z ^ 2025-05-07T20:01:18.5231691Z 2025-05-07T20:01:18.5232166Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:18.5233034Z 2025-05-07T20:01:18.5234821Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5237588Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5238707Z ^ 2025-05-07T20:01:18.5239080Z 2025-05-07T20:01:18.5240639Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5242868Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5243859Z ^ 2025-05-07T20:01:18.5244151Z 2025-05-07T20:01:18.5244561Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:18.5245189Z 2025-05-07T20:01:18.5246810Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5249084Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5250195Z ^ 2025-05-07T20:01:18.5250543Z 2025-05-07T20:01:18.5777682Z [354/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o 2025-05-07T20:01:18.5802420Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5805028Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5806155Z ^ 2025-05-07T20:01:18.5806386Z 2025-05-07T20:01:18.5806810Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:18.5807399Z 2025-05-07T20:01:18.5808897Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5811605Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5812914Z ^ 2025-05-07T20:01:18.5813264Z 2025-05-07T20:01:18.5814890Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5817677Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5818828Z ^ 2025-05-07T20:01:18.5819074Z 2025-05-07T20:01:18.5819740Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:18.5820426Z 2025-05-07T20:01:18.5822035Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5824755Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5825974Z ^ 2025-05-07T20:01:18.5826313Z 2025-05-07T20:01:18.5827850Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5830723Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5831890Z ^ 2025-05-07T20:01:18.5832143Z 2025-05-07T20:01:18.5832597Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:18.5833247Z 2025-05-07T20:01:18.5835195Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5838072Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5839271Z ^ 2025-05-07T20:01:18.5839652Z 2025-05-07T20:01:18.5841376Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5844188Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5845637Z ^ 2025-05-07T20:01:18.5845936Z 2025-05-07T20:01:18.5846402Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:18.5847110Z 2025-05-07T20:01:18.5848885Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5851682Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5852955Z ^ 2025-05-07T20:01:18.5853342Z 2025-05-07T20:01:18.5855177Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5858056Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5859284Z ^ 2025-05-07T20:01:18.5859543Z 2025-05-07T20:01:18.5860004Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:18.5860722Z 2025-05-07T20:01:18.5862418Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5865159Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5866365Z ^ 2025-05-07T20:01:18.5866770Z 2025-05-07T20:01:19.2145178Z [355/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o 2025-05-07T20:01:19.2169649Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.2171985Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:19.2173058Z ^ 2025-05-07T20:01:19.2173297Z 2025-05-07T20:01:19.2173830Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:19.2174481Z 2025-05-07T20:01:19.2176141Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.2178853Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:19.2180007Z ^ 2025-05-07T20:01:19.2180390Z 2025-05-07T20:01:19.2182430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.2185106Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:19.2186317Z ^ 2025-05-07T20:01:19.2186604Z 2025-05-07T20:01:19.2187164Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:19.2187836Z 2025-05-07T20:01:19.2189621Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.2192282Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:19.2193467Z ^ 2025-05-07T20:01:19.2193823Z 2025-05-07T20:01:19.2195664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.2198240Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:19.2199479Z ^ 2025-05-07T20:01:19.2199753Z 2025-05-07T20:01:19.2200217Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:19.2200899Z 2025-05-07T20:01:19.2202565Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.2205410Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:19.2206802Z ^ 2025-05-07T20:01:19.2207209Z 2025-05-07T20:01:19.2208903Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.2211413Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:19.2212599Z ^ 2025-05-07T20:01:19.2212872Z 2025-05-07T20:01:19.2213330Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:19.2214011Z 2025-05-07T20:01:19.2215731Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.2218480Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:19.2219725Z ^ 2025-05-07T20:01:19.2220092Z 2025-05-07T20:01:19.2221775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.2224532Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:19.2225734Z ^ 2025-05-07T20:01:19.2226002Z 2025-05-07T20:01:19.2226451Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:19.2227151Z 2025-05-07T20:01:19.2229407Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.2232277Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:19.2233654Z ^ 2025-05-07T20:01:19.2234191Z 2025-05-07T20:01:20.1334924Z [356/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o 2025-05-07T20:01:20.1358904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.1361427Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:20.1362573Z ^ 2025-05-07T20:01:20.1362829Z 2025-05-07T20:01:20.1363246Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:20.1363891Z 2025-05-07T20:01:20.1365458Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.1368028Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:20.1369495Z ^ 2025-05-07T20:01:20.1369864Z 2025-05-07T20:01:20.1371293Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.1373669Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:20.1374718Z ^ 2025-05-07T20:01:20.1374987Z 2025-05-07T20:01:20.1375397Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:20.1375968Z 2025-05-07T20:01:20.1377602Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.1380142Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:20.1381307Z ^ 2025-05-07T20:01:20.1381643Z 2025-05-07T20:01:20.1383337Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.1385649Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:20.1386705Z ^ 2025-05-07T20:01:20.1386918Z 2025-05-07T20:01:20.1387344Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:20.1387941Z 2025-05-07T20:01:20.1389455Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.1392260Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:20.1393404Z ^ 2025-05-07T20:01:20.1393783Z 2025-05-07T20:01:20.1395578Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.1398128Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:20.1399422Z ^ 2025-05-07T20:01:20.1399686Z 2025-05-07T20:01:20.1400153Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:20.1400808Z 2025-05-07T20:01:20.1402381Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.1405024Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:20.1406251Z ^ 2025-05-07T20:01:20.1406588Z 2025-05-07T20:01:20.1408295Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.1410944Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:20.1412092Z ^ 2025-05-07T20:01:20.1412406Z 2025-05-07T20:01:20.1413044Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:20.1413821Z 2025-05-07T20:01:20.1415489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.1418486Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:20.1419653Z ^ 2025-05-07T20:01:20.1420033Z 2025-05-07T20:01:20.4500448Z [357/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:01:20.4524170Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.4526884Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:20.4528052Z ^ 2025-05-07T20:01:20.4528573Z 2025-05-07T20:01:20.4529034Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:20.4529701Z 2025-05-07T20:01:20.4531631Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.4534216Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:20.4535572Z ^ 2025-05-07T20:01:20.4535905Z 2025-05-07T20:01:20.4537687Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.4540146Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:20.4541346Z ^ 2025-05-07T20:01:20.4541603Z 2025-05-07T20:01:20.4542044Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:20.4542660Z 2025-05-07T20:01:20.4544174Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.4546603Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:20.4547767Z ^ 2025-05-07T20:01:20.4548128Z 2025-05-07T20:01:20.4549660Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.4552281Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:20.4553690Z ^ 2025-05-07T20:01:20.4554083Z 2025-05-07T20:01:20.4554515Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:20.4555168Z 2025-05-07T20:01:20.4556786Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.4559368Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:20.4560564Z ^ 2025-05-07T20:01:20.4560918Z 2025-05-07T20:01:20.4562476Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.4564814Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:20.4565990Z ^ 2025-05-07T20:01:20.4566235Z 2025-05-07T20:01:20.4566680Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:20.4567309Z 2025-05-07T20:01:20.4568870Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.4571531Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:20.4572703Z ^ 2025-05-07T20:01:20.4573102Z 2025-05-07T20:01:20.4574915Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.4577592Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:20.4578775Z ^ 2025-05-07T20:01:20.4579023Z 2025-05-07T20:01:20.4579459Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:20.4580187Z 2025-05-07T20:01:20.4581833Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.4584396Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:20.4585561Z ^ 2025-05-07T20:01:20.4585927Z 2025-05-07T20:01:28.4788062Z [358/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o 2025-05-07T20:01:28.4809359Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:28.4812002Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:28.4813067Z ^ 2025-05-07T20:01:28.4813312Z 2025-05-07T20:01:28.4813742Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:28.4814305Z 2025-05-07T20:01:28.4816110Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:28.4818540Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:28.4819568Z ^ 2025-05-07T20:01:28.4819917Z 2025-05-07T20:01:28.4821404Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:28.4823783Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:28.4824807Z ^ 2025-05-07T20:01:28.4825077Z 2025-05-07T20:01:28.4825499Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:28.4826115Z 2025-05-07T20:01:28.4827685Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:28.4830364Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:28.4831422Z ^ 2025-05-07T20:01:28.4831752Z 2025-05-07T20:01:28.4833263Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:28.4835973Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:28.4837076Z ^ 2025-05-07T20:01:28.4837304Z 2025-05-07T20:01:28.4837728Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:28.4838346Z 2025-05-07T20:01:28.4839873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:28.4842262Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:28.4843386Z ^ 2025-05-07T20:01:28.4843779Z 2025-05-07T20:01:28.4845271Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:28.4847656Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:28.4848707Z ^ 2025-05-07T20:01:28.4848953Z 2025-05-07T20:01:28.4849350Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:28.4849813Z 2025-05-07T20:01:28.4850979Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:28.4853098Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:28.4854112Z ^ 2025-05-07T20:01:28.4854366Z 2025-05-07T20:01:28.4855515Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:28.4858142Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:28.4859290Z ^ 2025-05-07T20:01:28.4859544Z 2025-05-07T20:01:28.4859992Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:28.4860683Z 2025-05-07T20:01:28.4862060Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:28.4864315Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:28.4865316Z ^ 2025-05-07T20:01:28.4865651Z 2025-05-07T20:01:28.5577760Z [359/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o 2025-05-07T20:01:28.5599041Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:28.5601650Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:28.5602691Z ^ 2025-05-07T20:01:28.5603109Z 2025-05-07T20:01:28.5603506Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:28.5604046Z 2025-05-07T20:01:28.5605500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:28.5607769Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:28.5608730Z ^ 2025-05-07T20:01:28.5609045Z 2025-05-07T20:01:28.5610589Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:28.5613114Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:28.5614209Z ^ 2025-05-07T20:01:28.5614444Z 2025-05-07T20:01:28.5614875Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:28.5615444Z 2025-05-07T20:01:28.5616872Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:28.5619517Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:28.5620637Z ^ 2025-05-07T20:01:28.5621034Z 2025-05-07T20:01:28.5622568Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:28.5624864Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:28.5625840Z ^ 2025-05-07T20:01:28.5626092Z 2025-05-07T20:01:28.5626495Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:28.5627096Z 2025-05-07T20:01:28.5629003Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:28.5631530Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:28.5632628Z ^ 2025-05-07T20:01:28.5632955Z 2025-05-07T20:01:28.5634485Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:28.5636517Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:28.5637466Z ^ 2025-05-07T20:01:28.5637706Z 2025-05-07T20:01:28.5638304Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:28.5638948Z 2025-05-07T20:01:28.5640441Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:28.5643226Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:28.5644336Z ^ 2025-05-07T20:01:28.5644682Z 2025-05-07T20:01:28.5646163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:28.5648619Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:28.5649609Z ^ 2025-05-07T20:01:28.5649872Z 2025-05-07T20:01:28.5650249Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:28.5650827Z 2025-05-07T20:01:28.5652269Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:28.5654636Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:28.5655678Z ^ 2025-05-07T20:01:28.5655991Z 2025-05-07T20:01:30.5250857Z [360/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o 2025-05-07T20:01:30.5276016Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:30.5278715Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:30.5279852Z ^ 2025-05-07T20:01:30.5280114Z 2025-05-07T20:01:30.5280599Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:30.5281272Z 2025-05-07T20:01:30.5282935Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:30.5285606Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:30.5286664Z ^ 2025-05-07T20:01:30.5287007Z 2025-05-07T20:01:30.5288556Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:30.5291101Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:30.5292262Z ^ 2025-05-07T20:01:30.5292651Z 2025-05-07T20:01:30.5293069Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:30.5293710Z 2025-05-07T20:01:30.5295237Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:30.5297870Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:30.5299008Z ^ 2025-05-07T20:01:30.5299368Z 2025-05-07T20:01:30.5300976Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:30.5303296Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:30.5304442Z ^ 2025-05-07T20:01:30.5304696Z 2025-05-07T20:01:30.5305146Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:30.5305831Z 2025-05-07T20:01:30.5307530Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:30.5310284Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:30.5311488Z ^ 2025-05-07T20:01:30.5311865Z 2025-05-07T20:01:30.5313542Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:30.5316243Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:30.5317421Z ^ 2025-05-07T20:01:30.5317828Z 2025-05-07T20:01:30.5318280Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:30.5318974Z 2025-05-07T20:01:30.5320731Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:30.5323266Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:30.5324350Z ^ 2025-05-07T20:01:30.5324726Z 2025-05-07T20:01:30.5326249Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:30.5328786Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:30.5329834Z ^ 2025-05-07T20:01:30.5330107Z 2025-05-07T20:01:30.5330531Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:30.5331185Z 2025-05-07T20:01:30.5332787Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:30.5335230Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:30.5336561Z ^ 2025-05-07T20:01:30.5336919Z 2025-05-07T20:01:37.6920283Z [361/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o 2025-05-07T20:01:37.6941110Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:37.6943907Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:37.6944969Z ^ 2025-05-07T20:01:37.6945242Z 2025-05-07T20:01:37.6945665Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:37.6946270Z 2025-05-07T20:01:37.6947815Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:37.6950265Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:37.6951365Z ^ 2025-05-07T20:01:37.6951701Z 2025-05-07T20:01:37.6953286Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:37.6956163Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:37.6957281Z ^ 2025-05-07T20:01:37.6957531Z 2025-05-07T20:01:37.6957965Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:37.6958588Z 2025-05-07T20:01:37.6960112Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:37.6962296Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:37.6963271Z ^ 2025-05-07T20:01:37.6963599Z 2025-05-07T20:01:37.6965090Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:37.6967383Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:37.6968415Z ^ 2025-05-07T20:01:37.6968669Z 2025-05-07T20:01:37.6969085Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:37.6969688Z 2025-05-07T20:01:37.6971206Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:37.6973699Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:37.6975128Z ^ 2025-05-07T20:01:37.6975494Z 2025-05-07T20:01:37.6977020Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:37.6979622Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:37.6980850Z ^ 2025-05-07T20:01:37.6981104Z 2025-05-07T20:01:37.6981523Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:37.6982152Z 2025-05-07T20:01:37.6983571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:37.6985935Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:37.6987023Z ^ 2025-05-07T20:01:37.6987426Z 2025-05-07T20:01:37.6988784Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:37.6991382Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:37.6992605Z ^ 2025-05-07T20:01:37.6992818Z 2025-05-07T20:01:37.6993234Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:37.6993813Z 2025-05-07T20:01:37.6995479Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:37.6998060Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:37.6999217Z ^ 2025-05-07T20:01:37.6999555Z 2025-05-07T20:01:37.7959859Z [362/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o 2025-05-07T20:01:37.7980968Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:37.7983269Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:37.7984303Z ^ 2025-05-07T20:01:37.7984622Z 2025-05-07T20:01:37.7985055Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:37.7985630Z 2025-05-07T20:01:37.7987078Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:37.7989533Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:37.7990699Z ^ 2025-05-07T20:01:37.7991318Z 2025-05-07T20:01:37.7992779Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:37.7995458Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:37.7996535Z ^ 2025-05-07T20:01:37.7996797Z 2025-05-07T20:01:37.7997173Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:37.7997736Z 2025-05-07T20:01:37.7999204Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:37.8001641Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:37.8002794Z ^ 2025-05-07T20:01:37.8003124Z 2025-05-07T20:01:37.8004388Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:37.8007005Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:37.8008199Z ^ 2025-05-07T20:01:37.8008481Z 2025-05-07T20:01:37.8008877Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:37.8009482Z 2025-05-07T20:01:37.8011177Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:37.8013694Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:37.8014756Z ^ 2025-05-07T20:01:37.8015267Z 2025-05-07T20:01:37.8016933Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:37.8019234Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:37.8020211Z ^ 2025-05-07T20:01:37.8020432Z 2025-05-07T20:01:37.8020863Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:37.8021542Z 2025-05-07T20:01:37.8023192Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:37.8025824Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:37.8026953Z ^ 2025-05-07T20:01:37.8027347Z 2025-05-07T20:01:37.8029094Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:37.8031592Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:37.8032611Z ^ 2025-05-07T20:01:37.8049156Z 2025-05-07T20:01:37.8049841Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:37.8050553Z 2025-05-07T20:01:37.8052007Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:37.8054252Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:37.8055284Z ^ 2025-05-07T20:01:37.8055578Z 2025-05-07T20:01:44.7661750Z [363/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o 2025-05-07T20:01:44.7688636Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:44.7691843Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:44.7693172Z ^ 2025-05-07T20:01:44.7693565Z 2025-05-07T20:01:44.7694065Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:44.7694833Z 2025-05-07T20:01:44.7696628Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:44.7699940Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:44.7701263Z ^ 2025-05-07T20:01:44.7701710Z 2025-05-07T20:01:44.7703497Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:44.7706293Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:44.7707584Z ^ 2025-05-07T20:01:44.7707907Z 2025-05-07T20:01:44.7708402Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:44.7709144Z 2025-05-07T20:01:44.7710966Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:44.7713874Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:44.7715397Z ^ 2025-05-07T20:01:44.7715805Z 2025-05-07T20:01:44.7717565Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:44.7720476Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:44.7721799Z ^ 2025-05-07T20:01:44.7722285Z 2025-05-07T20:01:44.7722792Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:44.7723549Z 2025-05-07T20:01:44.7725343Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:44.7728806Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:44.7730471Z ^ 2025-05-07T20:01:44.7730919Z 2025-05-07T20:01:44.7732689Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:44.7735689Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:44.7736995Z ^ 2025-05-07T20:01:44.7737289Z 2025-05-07T20:01:44.7737823Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:44.7738580Z 2025-05-07T20:01:44.7740426Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:44.7743509Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:44.7744843Z ^ 2025-05-07T20:01:44.7745248Z 2025-05-07T20:01:44.7747025Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:44.7750092Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:44.7751405Z ^ 2025-05-07T20:01:44.7751702Z 2025-05-07T20:01:44.7752190Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:44.7752919Z 2025-05-07T20:01:44.7754891Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:44.7757574Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:44.7758907Z ^ 2025-05-07T20:01:44.7759312Z 2025-05-07T20:01:54.7166677Z [364/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o 2025-05-07T20:01:54.7193204Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:54.7196553Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:54.7198200Z ^ 2025-05-07T20:01:54.7198512Z 2025-05-07T20:01:54.7199070Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:54.7199839Z 2025-05-07T20:01:54.7201735Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:54.7204967Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:54.7206316Z ^ 2025-05-07T20:01:54.7206724Z 2025-05-07T20:01:54.7208504Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:54.7211439Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:54.7212764Z ^ 2025-05-07T20:01:54.7213052Z 2025-05-07T20:01:54.7213539Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:54.7214306Z 2025-05-07T20:01:54.7216102Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:54.7219019Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:54.7220319Z ^ 2025-05-07T20:01:54.7220733Z 2025-05-07T20:01:54.7222719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:54.7225848Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:54.7227317Z ^ 2025-05-07T20:01:54.7227618Z 2025-05-07T20:01:54.7228151Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:54.7229422Z 2025-05-07T20:01:54.7231379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:54.7234303Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:54.7235646Z ^ 2025-05-07T20:01:54.7236057Z 2025-05-07T20:01:54.7237822Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:54.7240719Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:54.7242023Z ^ 2025-05-07T20:01:54.7242347Z 2025-05-07T20:01:54.7242840Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:54.7243574Z 2025-05-07T20:01:54.7245607Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:54.7248588Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:54.7250222Z ^ 2025-05-07T20:01:54.7250637Z 2025-05-07T20:01:54.7252717Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:54.7255603Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:54.7256925Z ^ 2025-05-07T20:01:54.7257208Z 2025-05-07T20:01:54.7257702Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:54.7258461Z 2025-05-07T20:01:54.7260242Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:54.7263172Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:54.7264468Z ^ 2025-05-07T20:01:54.7264919Z 2025-05-07T20:01:56.3795656Z [365/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o 2025-05-07T20:01:56.3818868Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:56.3821913Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:56.3823121Z ^ 2025-05-07T20:01:56.3823412Z 2025-05-07T20:01:56.3823879Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:56.3824558Z 2025-05-07T20:01:56.3826256Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:56.3829151Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:56.3830355Z ^ 2025-05-07T20:01:56.3830714Z 2025-05-07T20:01:56.3832346Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:56.3835129Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:56.3836315Z ^ 2025-05-07T20:01:56.3836567Z 2025-05-07T20:01:56.3837034Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:56.3837692Z 2025-05-07T20:01:56.3839322Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:56.3842350Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:56.3843512Z ^ 2025-05-07T20:01:56.3843874Z 2025-05-07T20:01:56.3845580Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:56.3848266Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:56.3849596Z ^ 2025-05-07T20:01:56.3849887Z 2025-05-07T20:01:56.3850350Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:56.3851034Z 2025-05-07T20:01:56.3852941Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:56.3855654Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:56.3856821Z ^ 2025-05-07T20:01:56.3857369Z 2025-05-07T20:01:56.3859086Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:56.3861743Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:56.3862950Z ^ 2025-05-07T20:01:56.3863214Z 2025-05-07T20:01:56.3863634Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:56.3864502Z 2025-05-07T20:01:56.3866204Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:56.3868896Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:56.3870124Z ^ 2025-05-07T20:01:56.3870528Z 2025-05-07T20:01:56.3872158Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:56.3874879Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:56.3876048Z ^ 2025-05-07T20:01:56.3876336Z 2025-05-07T20:01:56.3876784Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:56.3877448Z 2025-05-07T20:01:56.3879149Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:56.3881750Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:56.3883001Z ^ 2025-05-07T20:01:56.3883363Z 2025-05-07T20:02:01.1610504Z [366/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:02:01.1634658Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:01.1637149Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:01.1638281Z ^ 2025-05-07T20:02:01.1638567Z 2025-05-07T20:02:01.1638985Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:01.1639645Z 2025-05-07T20:02:01.1641147Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:01.1643769Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:01.1644999Z ^ 2025-05-07T20:02:01.1645366Z 2025-05-07T20:02:01.1646980Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:01.1649684Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:01.1650901Z ^ 2025-05-07T20:02:01.1651164Z 2025-05-07T20:02:01.1651998Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:01.1652644Z 2025-05-07T20:02:01.1654347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:01.1657314Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:01.1658472Z ^ 2025-05-07T20:02:01.1658847Z 2025-05-07T20:02:01.1660558Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:01.1663238Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:01.1664372Z ^ 2025-05-07T20:02:01.1664633Z 2025-05-07T20:02:01.1665117Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:01.1665799Z 2025-05-07T20:02:01.1667444Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:01.1670154Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:01.1671349Z ^ 2025-05-07T20:02:01.1671678Z 2025-05-07T20:02:01.1673355Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:01.1676349Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:01.1677360Z ^ 2025-05-07T20:02:01.1677626Z 2025-05-07T20:02:01.1678052Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:01.1678643Z 2025-05-07T20:02:01.1680213Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:01.1682755Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:01.1683930Z ^ 2025-05-07T20:02:01.1684302Z 2025-05-07T20:02:01.1685958Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:01.1688531Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:01.1689722Z ^ 2025-05-07T20:02:01.1689976Z 2025-05-07T20:02:01.1690452Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:01.1691119Z 2025-05-07T20:02:01.1692795Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:01.1695388Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:01.1696662Z ^ 2025-05-07T20:02:01.1697010Z 2025-05-07T20:02:01.4988251Z [367/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o 2025-05-07T20:02:01.5009463Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:01.5012076Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:01.5013140Z ^ 2025-05-07T20:02:01.5013405Z 2025-05-07T20:02:01.5013811Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:01.5014396Z 2025-05-07T20:02:01.5015895Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:01.5018227Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:01.5019390Z ^ 2025-05-07T20:02:01.5019717Z 2025-05-07T20:02:01.5021530Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:01.5023971Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:01.5025044Z ^ 2025-05-07T20:02:01.5025278Z 2025-05-07T20:02:01.5025669Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:01.5026453Z 2025-05-07T20:02:01.5028158Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:01.5030867Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:01.5032012Z ^ 2025-05-07T20:02:01.5032342Z 2025-05-07T20:02:01.5033866Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:01.5036590Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:01.5037690Z ^ 2025-05-07T20:02:01.5037953Z 2025-05-07T20:02:01.5038377Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:01.5039006Z 2025-05-07T20:02:01.5040509Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:01.5042961Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:01.5044070Z ^ 2025-05-07T20:02:01.5044646Z 2025-05-07T20:02:01.5046144Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:01.5048648Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:01.5049783Z ^ 2025-05-07T20:02:01.5050033Z 2025-05-07T20:02:01.5050467Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:01.5051065Z 2025-05-07T20:02:01.5052572Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:01.5055171Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:01.5056262Z ^ 2025-05-07T20:02:01.5056585Z 2025-05-07T20:02:01.5058051Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:01.5060522Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:01.5061630Z ^ 2025-05-07T20:02:01.5061904Z 2025-05-07T20:02:01.5062314Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:01.5062923Z 2025-05-07T20:02:01.5064684Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:01.5067029Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:01.5068197Z ^ 2025-05-07T20:02:01.5068706Z 2025-05-07T20:02:11.3012226Z [368/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o 2025-05-07T20:02:11.3035763Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.3038428Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:11.3039526Z ^ 2025-05-07T20:02:11.3039774Z 2025-05-07T20:02:11.3040202Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:11.3040860Z 2025-05-07T20:02:11.3042480Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.3045271Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:11.3046399Z ^ 2025-05-07T20:02:11.3046752Z 2025-05-07T20:02:11.3048222Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.3051172Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:11.3052295Z ^ 2025-05-07T20:02:11.3052529Z 2025-05-07T20:02:11.3052964Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:11.3053620Z 2025-05-07T20:02:11.3055268Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.3057918Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:11.3059141Z ^ 2025-05-07T20:02:11.3059502Z 2025-05-07T20:02:11.3061130Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.3063795Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:11.3064959Z ^ 2025-05-07T20:02:11.3065198Z 2025-05-07T20:02:11.3065634Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:11.3066252Z 2025-05-07T20:02:11.3067880Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.3070462Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:11.3071573Z ^ 2025-05-07T20:02:11.3071928Z 2025-05-07T20:02:11.3073620Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.3076327Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:11.3077444Z ^ 2025-05-07T20:02:11.3077695Z 2025-05-07T20:02:11.3078132Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:11.3078786Z 2025-05-07T20:02:11.3080437Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.3083095Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:11.3084290Z ^ 2025-05-07T20:02:11.3084658Z 2025-05-07T20:02:11.3086271Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.3089103Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:11.3090253Z ^ 2025-05-07T20:02:11.3090508Z 2025-05-07T20:02:11.3090944Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:11.3091598Z 2025-05-07T20:02:11.3093302Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.3095967Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:11.3097103Z ^ 2025-05-07T20:02:11.3097474Z 2025-05-07T20:02:14.7410361Z [369/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o 2025-05-07T20:02:14.7436326Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.7439217Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:14.7440501Z ^ 2025-05-07T20:02:14.7440781Z 2025-05-07T20:02:14.7441263Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:14.7442013Z 2025-05-07T20:02:14.7444021Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.7446929Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:14.7448373Z ^ 2025-05-07T20:02:14.7448909Z 2025-05-07T20:02:14.7450681Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.7453578Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:14.7454859Z ^ 2025-05-07T20:02:14.7455159Z 2025-05-07T20:02:14.7455657Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:14.7456354Z 2025-05-07T20:02:14.7458209Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.7461034Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:14.7462329Z ^ 2025-05-07T20:02:14.7462725Z 2025-05-07T20:02:14.7464501Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.7467207Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:14.7468656Z ^ 2025-05-07T20:02:14.7468943Z 2025-05-07T20:02:14.7469432Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:14.7470201Z 2025-05-07T20:02:14.7471997Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.7475029Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:14.7476341Z ^ 2025-05-07T20:02:14.7476776Z 2025-05-07T20:02:14.7478555Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.7481481Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:14.7482779Z ^ 2025-05-07T20:02:14.7483078Z 2025-05-07T20:02:14.7483559Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:14.7484277Z 2025-05-07T20:02:14.7486080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.7488989Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:14.7490329Z ^ 2025-05-07T20:02:14.7490733Z 2025-05-07T20:02:14.7494049Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.7496861Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:14.7498308Z ^ 2025-05-07T20:02:14.7498528Z 2025-05-07T20:02:14.7499099Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:14.7499857Z 2025-05-07T20:02:14.7501567Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.7504326Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:14.7505590Z ^ 2025-05-07T20:02:14.7505999Z 2025-05-07T20:02:19.7670403Z [370/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o 2025-05-07T20:02:19.7696842Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.7699794Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.7701123Z ^ 2025-05-07T20:02:19.7701420Z 2025-05-07T20:02:19.7702102Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:19.7702845Z 2025-05-07T20:02:19.7704774Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.7707709Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.7709017Z ^ 2025-05-07T20:02:19.7709418Z 2025-05-07T20:02:19.7711187Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.7714228Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.7715516Z ^ 2025-05-07T20:02:19.7715811Z 2025-05-07T20:02:19.7716293Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:19.7717013Z 2025-05-07T20:02:19.7718801Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.7721678Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.7723110Z ^ 2025-05-07T20:02:19.7723508Z 2025-05-07T20:02:19.7725284Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.7728117Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.7729677Z ^ 2025-05-07T20:02:19.7729959Z 2025-05-07T20:02:19.7730440Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:19.7731149Z 2025-05-07T20:02:19.7732922Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.7735812Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.7737118Z ^ 2025-05-07T20:02:19.7737529Z 2025-05-07T20:02:19.7739285Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.7742173Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.7743438Z ^ 2025-05-07T20:02:19.7743736Z 2025-05-07T20:02:19.7744211Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:19.7744929Z 2025-05-07T20:02:19.7746924Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.7749849Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.7751152Z ^ 2025-05-07T20:02:19.7751711Z 2025-05-07T20:02:19.7753568Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.7756204Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.7757318Z ^ 2025-05-07T20:02:19.7757603Z 2025-05-07T20:02:19.7758082Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:19.7758769Z 2025-05-07T20:02:19.7760528Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.7763353Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.7764651Z ^ 2025-05-07T20:02:19.7765069Z 2025-05-07T20:02:22.4884761Z [371/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:02:22.4908576Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:22.4911527Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:22.4912678Z ^ 2025-05-07T20:02:22.4912925Z 2025-05-07T20:02:22.4913345Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:22.4914215Z 2025-05-07T20:02:22.4915854Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:22.4918528Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:22.4919605Z ^ 2025-05-07T20:02:22.4919930Z 2025-05-07T20:02:22.4921569Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:22.4924135Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:22.4925211Z ^ 2025-05-07T20:02:22.4925455Z 2025-05-07T20:02:22.4925901Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:22.4926500Z 2025-05-07T20:02:22.4928278Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:22.4931037Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:22.4932226Z ^ 2025-05-07T20:02:22.4932581Z 2025-05-07T20:02:22.4934148Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:22.4936679Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:22.4937720Z ^ 2025-05-07T20:02:22.4937938Z 2025-05-07T20:02:22.4938351Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:22.4938983Z 2025-05-07T20:02:22.4940486Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:22.4942923Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:22.4944002Z ^ 2025-05-07T20:02:22.4944309Z 2025-05-07T20:02:22.4945967Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:22.4948726Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:22.4949843Z ^ 2025-05-07T20:02:22.4950088Z 2025-05-07T20:02:22.4950538Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:22.4951197Z 2025-05-07T20:02:22.4953054Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:22.4955824Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:22.4956971Z ^ 2025-05-07T20:02:22.4957323Z 2025-05-07T20:02:22.4958919Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:22.4961474Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:22.4962618Z ^ 2025-05-07T20:02:22.4962865Z 2025-05-07T20:02:22.4963287Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:22.4963891Z 2025-05-07T20:02:22.4965484Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:22.4968011Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:22.4969073Z ^ 2025-05-07T20:02:22.4969392Z 2025-05-07T20:02:24.1456551Z [372/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o 2025-05-07T20:02:24.1482652Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:24.1485535Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:24.1486869Z ^ 2025-05-07T20:02:24.1487158Z 2025-05-07T20:02:24.1487659Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:24.1488382Z 2025-05-07T20:02:24.1490188Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:24.1493101Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:24.1494383Z ^ 2025-05-07T20:02:24.1494797Z 2025-05-07T20:02:24.1496273Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:24.1499331Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:24.1500598Z ^ 2025-05-07T20:02:24.1500900Z 2025-05-07T20:02:24.1501380Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:24.1502110Z 2025-05-07T20:02:24.1503853Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:24.1506614Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:24.1507858Z ^ 2025-05-07T20:02:24.1508253Z 2025-05-07T20:02:24.1510041Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:24.1512888Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:24.1514350Z ^ 2025-05-07T20:02:24.1514628Z 2025-05-07T20:02:24.1515113Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:24.1515665Z 2025-05-07T20:02:24.1517442Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:24.1520325Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:24.1521499Z ^ 2025-05-07T20:02:24.1522118Z 2025-05-07T20:02:24.1523887Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:24.1526865Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:24.1528132Z ^ 2025-05-07T20:02:24.1528950Z 2025-05-07T20:02:24.1529426Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:24.1530142Z 2025-05-07T20:02:24.1531920Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:24.1534717Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:24.1536021Z ^ 2025-05-07T20:02:24.1536416Z 2025-05-07T20:02:24.1538171Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:24.1540902Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:24.1542196Z ^ 2025-05-07T20:02:24.1542472Z 2025-05-07T20:02:24.1542951Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:24.1543677Z 2025-05-07T20:02:24.1545466Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:24.1548474Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:24.1549761Z ^ 2025-05-07T20:02:24.1550168Z 2025-05-07T20:02:26.9423201Z [373/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o 2025-05-07T20:02:26.9444998Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:26.9447513Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:26.9448596Z ^ 2025-05-07T20:02:26.9448842Z 2025-05-07T20:02:26.9449272Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:26.9449930Z 2025-05-07T20:02:26.9451396Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:26.9453803Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:26.9455155Z ^ 2025-05-07T20:02:26.9455523Z 2025-05-07T20:02:26.9457070Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:26.9459544Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:26.9460557Z ^ 2025-05-07T20:02:26.9460871Z 2025-05-07T20:02:26.9461271Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:26.9461905Z 2025-05-07T20:02:26.9463380Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:26.9465954Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:26.9467193Z ^ 2025-05-07T20:02:26.9467564Z 2025-05-07T20:02:26.9469229Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:26.9471775Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:26.9472826Z ^ 2025-05-07T20:02:26.9473102Z 2025-05-07T20:02:26.9473564Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:26.9474397Z 2025-05-07T20:02:26.9476351Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:26.9478647Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:26.9479701Z ^ 2025-05-07T20:02:26.9480011Z 2025-05-07T20:02:26.9481640Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:26.9484189Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:26.9485319Z ^ 2025-05-07T20:02:26.9485570Z 2025-05-07T20:02:26.9486049Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:26.9486715Z 2025-05-07T20:02:26.9488386Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:26.9491331Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:26.9492554Z ^ 2025-05-07T20:02:26.9492931Z 2025-05-07T20:02:26.9494588Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:26.9497262Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:26.9498454Z ^ 2025-05-07T20:02:26.9498842Z 2025-05-07T20:02:26.9499295Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:26.9500119Z 2025-05-07T20:02:26.9501812Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:26.9504500Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:26.9505723Z ^ 2025-05-07T20:02:26.9506094Z 2025-05-07T20:02:34.1805970Z [374/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o 2025-05-07T20:02:34.1828977Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:34.1831536Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:34.1832688Z ^ 2025-05-07T20:02:34.1832957Z 2025-05-07T20:02:34.1833427Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:34.1834135Z 2025-05-07T20:02:34.1835754Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:34.1838883Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:34.1840062Z ^ 2025-05-07T20:02:34.1840426Z 2025-05-07T20:02:34.1842063Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:34.1844613Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:34.1845609Z ^ 2025-05-07T20:02:34.1845840Z 2025-05-07T20:02:34.1846218Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:34.1846850Z 2025-05-07T20:02:34.1848348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:34.1850785Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:34.1851902Z ^ 2025-05-07T20:02:34.1852234Z 2025-05-07T20:02:34.1853655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:34.1856426Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:34.1857535Z ^ 2025-05-07T20:02:34.1857773Z 2025-05-07T20:02:34.1858187Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:34.1858857Z 2025-05-07T20:02:34.1860375Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:34.1863097Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:34.1864183Z ^ 2025-05-07T20:02:34.1864554Z 2025-05-07T20:02:34.1866100Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:34.1868647Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:34.1869709Z ^ 2025-05-07T20:02:34.1869974Z 2025-05-07T20:02:34.1870363Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:34.1870968Z 2025-05-07T20:02:34.1872482Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:34.1875153Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:34.1876333Z ^ 2025-05-07T20:02:34.1876689Z 2025-05-07T20:02:34.1878158Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:34.1883997Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:34.1885181Z ^ 2025-05-07T20:02:34.1885443Z 2025-05-07T20:02:34.1885825Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:34.1886433Z 2025-05-07T20:02:34.1887949Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:34.1890077Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:34.1891086Z ^ 2025-05-07T20:02:34.1891466Z 2025-05-07T20:02:40.5496574Z [375/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o 2025-05-07T20:02:40.5516325Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.5519015Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.5520463Z ^ 2025-05-07T20:02:40.5520755Z 2025-05-07T20:02:40.5521216Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:40.5521919Z 2025-05-07T20:02:40.5523691Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.5526484Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.5527749Z ^ 2025-05-07T20:02:40.5528115Z 2025-05-07T20:02:40.5529804Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.5531913Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.5532798Z ^ 2025-05-07T20:02:40.5533062Z 2025-05-07T20:02:40.5533409Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:40.5533925Z 2025-05-07T20:02:40.5535142Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.5537084Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.5537964Z ^ 2025-05-07T20:02:40.5538282Z 2025-05-07T20:02:40.5539860Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.5542317Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.5543263Z ^ 2025-05-07T20:02:40.5543489Z 2025-05-07T20:02:40.5543936Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:40.5544442Z 2025-05-07T20:02:40.5545622Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.5547473Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.5548682Z ^ 2025-05-07T20:02:40.5548953Z 2025-05-07T20:02:40.5550109Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.5552030Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.5552889Z ^ 2025-05-07T20:02:40.5553080Z 2025-05-07T20:02:40.5553418Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:40.5554029Z 2025-05-07T20:02:40.5555225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.5557438Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.5558292Z ^ 2025-05-07T20:02:40.5558593Z 2025-05-07T20:02:40.5559797Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.5561945Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.5562858Z ^ 2025-05-07T20:02:40.5563058Z 2025-05-07T20:02:40.5563413Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:40.5563893Z 2025-05-07T20:02:40.5565083Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.5566985Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.5567873Z ^ 2025-05-07T20:02:40.5568144Z 2025-05-07T20:02:48.2432127Z [376/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o 2025-05-07T20:02:48.2456933Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:48.2459700Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:48.2460916Z ^ 2025-05-07T20:02:48.2461183Z 2025-05-07T20:02:48.2461723Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:48.2462410Z 2025-05-07T20:02:48.2464108Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:48.2466858Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:48.2468082Z ^ 2025-05-07T20:02:48.2468609Z 2025-05-07T20:02:48.2470160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:48.2472802Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:48.2474132Z ^ 2025-05-07T20:02:48.2474418Z 2025-05-07T20:02:48.2474861Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:48.2475511Z 2025-05-07T20:02:48.2477460Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:48.2479937Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:48.2481137Z ^ 2025-05-07T20:02:48.2481469Z 2025-05-07T20:02:48.2482884Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:48.2485175Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:48.2486268Z ^ 2025-05-07T20:02:48.2486492Z 2025-05-07T20:02:48.2486878Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:48.2487552Z 2025-05-07T20:02:48.2489212Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:48.2491575Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:48.2492746Z ^ 2025-05-07T20:02:48.2493122Z 2025-05-07T20:02:48.2494804Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:48.2497564Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:48.2498788Z ^ 2025-05-07T20:02:48.2499213Z 2025-05-07T20:02:48.2499679Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:48.2500286Z 2025-05-07T20:02:48.2501864Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:48.2504622Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:48.2505761Z ^ 2025-05-07T20:02:48.2506153Z 2025-05-07T20:02:48.2507758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:48.2510070Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:48.2511040Z ^ 2025-05-07T20:02:48.2511303Z 2025-05-07T20:02:48.2511700Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:48.2512288Z 2025-05-07T20:02:48.2513753Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:48.2516039Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:48.2516964Z ^ 2025-05-07T20:02:48.2517251Z 2025-05-07T20:02:50.8202865Z [377/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:02:50.8227520Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:50.8230535Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:50.8231765Z ^ 2025-05-07T20:02:50.8232071Z 2025-05-07T20:02:50.8232557Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:50.8233258Z 2025-05-07T20:02:50.8235053Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:50.8237752Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:50.8238800Z ^ 2025-05-07T20:02:50.8239276Z 2025-05-07T20:02:50.8240806Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:50.8243763Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:50.8245000Z ^ 2025-05-07T20:02:50.8245255Z 2025-05-07T20:02:50.8245710Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:50.8246425Z 2025-05-07T20:02:50.8248423Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:50.8250988Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:50.8252151Z ^ 2025-05-07T20:02:50.8252510Z 2025-05-07T20:02:50.8254087Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:50.8256490Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:50.8257588Z ^ 2025-05-07T20:02:50.8257837Z 2025-05-07T20:02:50.8258224Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:50.8258851Z 2025-05-07T20:02:50.8260459Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:50.8263034Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:50.8264220Z ^ 2025-05-07T20:02:50.8264576Z 2025-05-07T20:02:50.8266254Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:50.8269100Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:50.8270489Z ^ 2025-05-07T20:02:50.8270751Z 2025-05-07T20:02:50.8271226Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:50.8271956Z 2025-05-07T20:02:50.8273645Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:50.8276484Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:50.8277725Z ^ 2025-05-07T20:02:50.8278128Z 2025-05-07T20:02:50.8279787Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:50.8282333Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:50.8283404Z ^ 2025-05-07T20:02:50.8283684Z 2025-05-07T20:02:50.8284131Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:50.8284772Z 2025-05-07T20:02:50.8286452Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:50.8289326Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:50.8290522Z ^ 2025-05-07T20:02:50.8290894Z 2025-05-07T20:02:54.2983499Z [378/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o 2025-05-07T20:02:54.3006024Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:54.3008740Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:54.3009959Z ^ 2025-05-07T20:02:54.3010271Z 2025-05-07T20:02:54.3010744Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:54.3011442Z 2025-05-07T20:02:54.3013239Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:54.3016039Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:54.3017403Z ^ 2025-05-07T20:02:54.3017770Z 2025-05-07T20:02:54.3019931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:54.3022675Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:54.3023949Z ^ 2025-05-07T20:02:54.3024209Z 2025-05-07T20:02:54.3024717Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:54.3025419Z 2025-05-07T20:02:54.3027114Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:54.3030093Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:54.3031178Z ^ 2025-05-07T20:02:54.3031566Z 2025-05-07T20:02:54.3033174Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:54.3035835Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:54.3036947Z ^ 2025-05-07T20:02:54.3037197Z 2025-05-07T20:02:54.3037796Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:54.3038486Z 2025-05-07T20:02:54.3040329Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:54.3043220Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:54.3044434Z ^ 2025-05-07T20:02:54.3044810Z 2025-05-07T20:02:54.3046514Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:54.3049139Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:54.3050344Z ^ 2025-05-07T20:02:54.3050611Z 2025-05-07T20:02:54.3051062Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:54.3051755Z 2025-05-07T20:02:54.3053411Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:54.3056117Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:54.3057403Z ^ 2025-05-07T20:02:54.3057769Z 2025-05-07T20:02:54.3059501Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:54.3062065Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:54.3063199Z ^ 2025-05-07T20:02:54.3063460Z 2025-05-07T20:02:54.3064211Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:54.3064850Z 2025-05-07T20:02:54.3066506Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:54.3069159Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:54.3070497Z ^ 2025-05-07T20:02:54.3070861Z 2025-05-07T20:02:57.5755178Z [379/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o 2025-05-07T20:02:57.5779899Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:57.5782678Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:57.5783899Z ^ 2025-05-07T20:02:57.5784164Z 2025-05-07T20:02:57.5784648Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:57.5785506Z 2025-05-07T20:02:57.5787487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:57.5790376Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:57.5791732Z ^ 2025-05-07T20:02:57.5792148Z 2025-05-07T20:02:57.5794137Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:57.5797021Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:57.5798225Z ^ 2025-05-07T20:02:57.5798664Z 2025-05-07T20:02:57.5799125Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:57.5799816Z 2025-05-07T20:02:57.5801636Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:57.5804388Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:57.5805646Z ^ 2025-05-07T20:02:57.5806023Z 2025-05-07T20:02:57.5807769Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:57.5810502Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:57.5811727Z ^ 2025-05-07T20:02:57.5812102Z 2025-05-07T20:02:57.5812553Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:57.5813199Z 2025-05-07T20:02:57.5814781Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:57.5817491Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:57.5818709Z ^ 2025-05-07T20:02:57.5819117Z 2025-05-07T20:02:57.5820790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:57.5823562Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:57.5824767Z ^ 2025-05-07T20:02:57.5825053Z 2025-05-07T20:02:57.5825519Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:57.5826208Z 2025-05-07T20:02:57.5827967Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:57.5830977Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:57.5832223Z ^ 2025-05-07T20:02:57.5832595Z 2025-05-07T20:02:57.5834593Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:57.5837326Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:57.5838537Z ^ 2025-05-07T20:02:57.5838884Z 2025-05-07T20:02:57.5839348Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:57.5840056Z 2025-05-07T20:02:57.5841850Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:57.5844625Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:57.5845850Z ^ 2025-05-07T20:02:57.5846251Z 2025-05-07T20:02:58.4321944Z [380/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o 2025-05-07T20:02:58.4347029Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.4349819Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.4351309Z ^ 2025-05-07T20:02:58.4351586Z 2025-05-07T20:02:58.4352048Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:58.4352763Z 2025-05-07T20:02:58.4354815Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.4357926Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.4359180Z ^ 2025-05-07T20:02:58.4359566Z 2025-05-07T20:02:58.4361344Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.4364151Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.4365351Z ^ 2025-05-07T20:02:58.4365612Z 2025-05-07T20:02:58.4366106Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:58.4366807Z 2025-05-07T20:02:58.4368580Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.4371424Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.4372700Z ^ 2025-05-07T20:02:58.4373247Z 2025-05-07T20:02:58.4374956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.4377836Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.4379068Z ^ 2025-05-07T20:02:58.4379328Z 2025-05-07T20:02:58.4379785Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:58.4380461Z 2025-05-07T20:02:58.4382347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.4385318Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.4386613Z ^ 2025-05-07T20:02:58.4387004Z 2025-05-07T20:02:58.4388757Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.4391544Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.4392817Z ^ 2025-05-07T20:02:58.4393089Z 2025-05-07T20:02:58.4393696Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:58.4394482Z 2025-05-07T20:02:58.4396194Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.4399144Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.4400278Z ^ 2025-05-07T20:02:58.4400632Z 2025-05-07T20:02:58.4402233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.4406172Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.4407367Z ^ 2025-05-07T20:02:58.4407652Z 2025-05-07T20:02:58.4408111Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:58.4408790Z 2025-05-07T20:02:58.4410544Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.4413492Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.4414759Z ^ 2025-05-07T20:02:58.4415140Z 2025-05-07T20:03:00.3208141Z [381/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:03:00.3232530Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:00.3235463Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:00.3236740Z ^ 2025-05-07T20:03:00.3237047Z 2025-05-07T20:03:00.3237589Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:00.3238255Z 2025-05-07T20:03:00.3239881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:00.3242501Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:00.3243705Z ^ 2025-05-07T20:03:00.3244078Z 2025-05-07T20:03:00.3245699Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:00.3248342Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:00.3249478Z ^ 2025-05-07T20:03:00.3249742Z 2025-05-07T20:03:00.3250180Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:00.3250844Z 2025-05-07T20:03:00.3252418Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:00.3254983Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:00.3256254Z ^ 2025-05-07T20:03:00.3256609Z 2025-05-07T20:03:00.3258268Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:00.3260804Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:00.3261942Z ^ 2025-05-07T20:03:00.3262194Z 2025-05-07T20:03:00.3262646Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:00.3263292Z 2025-05-07T20:03:00.3264925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:00.3267594Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:00.3268793Z ^ 2025-05-07T20:03:00.3269154Z 2025-05-07T20:03:00.3270760Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:00.3273351Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:00.3274711Z ^ 2025-05-07T20:03:00.3274952Z 2025-05-07T20:03:00.3275387Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:00.3276047Z 2025-05-07T20:03:00.3277887Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:00.3280535Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:00.3281771Z ^ 2025-05-07T20:03:00.3282191Z 2025-05-07T20:03:00.3283844Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:00.3286481Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:00.3287608Z ^ 2025-05-07T20:03:00.3287866Z 2025-05-07T20:03:00.3288318Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:00.3288968Z 2025-05-07T20:03:00.3290565Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:00.3293168Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:00.3294046Z ^ 2025-05-07T20:03:00.3294360Z 2025-05-07T20:03:03.9810872Z [382/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o 2025-05-07T20:03:03.9836523Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:03.9839291Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:03.9840632Z ^ 2025-05-07T20:03:03.9840909Z 2025-05-07T20:03:03.9841420Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:03.9842077Z 2025-05-07T20:03:03.9843801Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:03.9846732Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:03.9847980Z ^ 2025-05-07T20:03:03.9848352Z 2025-05-07T20:03:03.9849991Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:03.9852655Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:03.9853841Z ^ 2025-05-07T20:03:03.9854102Z 2025-05-07T20:03:03.9854559Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:03.9855437Z 2025-05-07T20:03:03.9857093Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:03.9859695Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:03.9860903Z ^ 2025-05-07T20:03:03.9861264Z 2025-05-07T20:03:03.9862833Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:03.9865490Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:03.9866640Z ^ 2025-05-07T20:03:03.9866902Z 2025-05-07T20:03:03.9867370Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:03.9868136Z 2025-05-07T20:03:03.9869831Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:03.9872681Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:03.9874112Z ^ 2025-05-07T20:03:03.9874511Z 2025-05-07T20:03:03.9876247Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:03.9879429Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:03.9880784Z ^ 2025-05-07T20:03:03.9881077Z 2025-05-07T20:03:03.9881537Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:03.9882335Z 2025-05-07T20:03:03.9884139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:03.9886602Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:03.9887785Z ^ 2025-05-07T20:03:03.9888150Z 2025-05-07T20:03:03.9889802Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:03.9892456Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:03.9893666Z ^ 2025-05-07T20:03:03.9893912Z 2025-05-07T20:03:03.9894329Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:03.9894956Z 2025-05-07T20:03:03.9896561Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:03.9899201Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:03.9900336Z ^ 2025-05-07T20:03:03.9900873Z 2025-05-07T20:03:04.4517108Z [383/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o 2025-05-07T20:03:04.4530548Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:04.4532016Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:04.4532673Z ^ 2025-05-07T20:03:04.4532851Z 2025-05-07T20:03:04.4533105Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:04.4533474Z 2025-05-07T20:03:04.4534390Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:04.4535825Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:04.4536506Z ^ 2025-05-07T20:03:04.4536716Z 2025-05-07T20:03:04.4537617Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:04.4539142Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:04.4539811Z ^ 2025-05-07T20:03:04.4550444Z 2025-05-07T20:03:04.4550837Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:04.4551219Z 2025-05-07T20:03:04.4552205Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:04.4553634Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:04.4554469Z ^ 2025-05-07T20:03:04.4554676Z 2025-05-07T20:03:04.4555587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:04.4556974Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:04.4557643Z ^ 2025-05-07T20:03:04.4557793Z 2025-05-07T20:03:04.4558074Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:04.4558433Z 2025-05-07T20:03:04.4559293Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:04.4561037Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:04.4561733Z ^ 2025-05-07T20:03:04.4561943Z 2025-05-07T20:03:04.4562810Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:04.4564553Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:04.4565200Z ^ 2025-05-07T20:03:04.4565370Z 2025-05-07T20:03:04.4565623Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:04.4565992Z 2025-05-07T20:03:04.4566909Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:04.4568348Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:04.4569028Z ^ 2025-05-07T20:03:04.4569234Z 2025-05-07T20:03:04.4570142Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:04.4571558Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:04.4572222Z ^ 2025-05-07T20:03:04.4572367Z 2025-05-07T20:03:04.4572624Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:04.4573015Z 2025-05-07T20:03:04.4573905Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:04.4575408Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:04.4576065Z ^ 2025-05-07T20:03:04.4576296Z 2025-05-07T20:03:05.6214343Z [384/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o 2025-05-07T20:03:05.6238961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:05.6241659Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:05.6242955Z ^ 2025-05-07T20:03:05.6243196Z 2025-05-07T20:03:05.6243665Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:05.6244365Z 2025-05-07T20:03:05.6245734Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:05.6248464Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:05.6249535Z ^ 2025-05-07T20:03:05.6249868Z 2025-05-07T20:03:05.6251382Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:05.6253711Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:05.6254900Z ^ 2025-05-07T20:03:05.6255144Z 2025-05-07T20:03:05.6255617Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:05.6256250Z 2025-05-07T20:03:05.6257767Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:05.6260594Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:05.6261676Z ^ 2025-05-07T20:03:05.6261992Z 2025-05-07T20:03:05.6263369Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:05.6265724Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:05.6266761Z ^ 2025-05-07T20:03:05.6267054Z 2025-05-07T20:03:05.6267483Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:05.6268370Z 2025-05-07T20:03:05.6269824Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:05.6272415Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:05.6273720Z ^ 2025-05-07T20:03:05.6274227Z 2025-05-07T20:03:05.6275833Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:05.6278411Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:05.6279677Z ^ 2025-05-07T20:03:05.6279953Z 2025-05-07T20:03:05.6280369Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:05.6281176Z 2025-05-07T20:03:05.6282868Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:05.6285550Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:05.6286880Z ^ 2025-05-07T20:03:05.6287274Z 2025-05-07T20:03:05.6288826Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:05.6291496Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:05.6292777Z ^ 2025-05-07T20:03:05.6293035Z 2025-05-07T20:03:05.6293435Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:05.6294002Z 2025-05-07T20:03:05.6295261Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:05.6297561Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:05.6298602Z ^ 2025-05-07T20:03:05.6298928Z 2025-05-07T20:03:07.4942706Z [385/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T20:03:07.4960491Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:08.3278170Z [386/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o 2025-05-07T20:03:08.3301155Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:08.3303794Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:08.3304949Z ^ 2025-05-07T20:03:08.3305208Z 2025-05-07T20:03:08.3305764Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:08.3306379Z 2025-05-07T20:03:08.3308115Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:08.3310733Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:08.3311913Z ^ 2025-05-07T20:03:08.3312278Z 2025-05-07T20:03:08.3314163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:08.3316725Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:08.3317884Z ^ 2025-05-07T20:03:08.3318149Z 2025-05-07T20:03:08.3318602Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:08.3319226Z 2025-05-07T20:03:08.3320784Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:08.3323439Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:08.3324699Z ^ 2025-05-07T20:03:08.3325079Z 2025-05-07T20:03:08.3326723Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:08.3329759Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:08.3330814Z ^ 2025-05-07T20:03:08.3331018Z 2025-05-07T20:03:08.3331405Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:08.3331921Z 2025-05-07T20:03:08.3333364Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:08.3335950Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:08.3337193Z ^ 2025-05-07T20:03:08.3337580Z 2025-05-07T20:03:08.3339232Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:08.3342119Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:08.3343222Z ^ 2025-05-07T20:03:08.3343460Z 2025-05-07T20:03:08.3343898Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:08.3344563Z 2025-05-07T20:03:08.3346333Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:08.3348991Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:08.3350304Z ^ 2025-05-07T20:03:08.3350675Z 2025-05-07T20:03:08.3352416Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:08.3355241Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:08.3356441Z ^ 2025-05-07T20:03:08.3356690Z 2025-05-07T20:03:08.3357170Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:08.3357853Z 2025-05-07T20:03:08.3359506Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:08.3362317Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:08.3363501Z ^ 2025-05-07T20:03:08.3363873Z 2025-05-07T20:03:09.4316448Z [387/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T20:03:09.4337582Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:12.9051208Z [388/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T20:03:12.9072033Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:15.5070815Z [389/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o 2025-05-07T20:03:15.5093359Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:15.5096018Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:15.5097322Z ^ 2025-05-07T20:03:15.5097583Z 2025-05-07T20:03:15.5098031Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:15.5098652Z 2025-05-07T20:03:15.5100221Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:15.5102709Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:15.5104296Z ^ 2025-05-07T20:03:15.5104634Z 2025-05-07T20:03:15.5106232Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:15.5108804Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:15.5110011Z ^ 2025-05-07T20:03:15.5110283Z 2025-05-07T20:03:15.5110699Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:15.5111440Z 2025-05-07T20:03:15.5113076Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:15.5115826Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:15.5116957Z ^ 2025-05-07T20:03:15.5117320Z 2025-05-07T20:03:15.5118748Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:15.5121168Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:15.5122284Z ^ 2025-05-07T20:03:15.5122558Z 2025-05-07T20:03:15.5122973Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:15.5123556Z 2025-05-07T20:03:15.5125326Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:15.5127671Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:15.5129099Z ^ 2025-05-07T20:03:15.5129703Z 2025-05-07T20:03:15.5131422Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:15.5134106Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:15.5135256Z ^ 2025-05-07T20:03:15.5135498Z 2025-05-07T20:03:15.5135872Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:15.5136519Z 2025-05-07T20:03:15.5138134Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:15.5140700Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:15.5141799Z ^ 2025-05-07T20:03:15.5142171Z 2025-05-07T20:03:15.5143702Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:15.5146190Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:15.5147343Z ^ 2025-05-07T20:03:15.5147808Z 2025-05-07T20:03:15.5148253Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:15.5148839Z 2025-05-07T20:03:15.5150356Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:15.5152894Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:15.5154244Z ^ 2025-05-07T20:03:15.5154615Z 2025-05-07T20:03:25.7198254Z [390/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_dense.cpp 2025-05-07T20:03:25.7217937Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:26.4641066Z [391/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o 2025-05-07T20:03:26.4661846Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:26.4664434Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:26.4665596Z ^ 2025-05-07T20:03:26.4665899Z 2025-05-07T20:03:26.4666582Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:26.4667230Z 2025-05-07T20:03:26.4668653Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:26.4671335Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:26.4672368Z ^ 2025-05-07T20:03:26.4672715Z 2025-05-07T20:03:26.4674311Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:26.4676753Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:26.4677800Z ^ 2025-05-07T20:03:26.4678061Z 2025-05-07T20:03:26.4678477Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:26.4679119Z 2025-05-07T20:03:26.4680782Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:26.4683153Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:26.4684277Z ^ 2025-05-07T20:03:26.4684641Z 2025-05-07T20:03:26.4686194Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:26.4689028Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:26.4690224Z ^ 2025-05-07T20:03:26.4690498Z 2025-05-07T20:03:26.4690951Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:26.4691668Z 2025-05-07T20:03:26.4693404Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:26.4695995Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:26.4697037Z ^ 2025-05-07T20:03:26.4697378Z 2025-05-07T20:03:26.4698858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:26.4701136Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:26.4702167Z ^ 2025-05-07T20:03:26.4702440Z 2025-05-07T20:03:26.4702818Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:26.4703390Z 2025-05-07T20:03:26.4704758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:26.4706886Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:26.4708060Z ^ 2025-05-07T20:03:26.4708377Z 2025-05-07T20:03:26.4709761Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:26.4712218Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:26.4713246Z ^ 2025-05-07T20:03:26.4713473Z 2025-05-07T20:03:26.4714035Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:26.4714647Z 2025-05-07T20:03:26.4716066Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:26.4718337Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:26.4719365Z ^ 2025-05-07T20:03:26.4719715Z 2025-05-07T20:03:26.5115034Z [392/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:26.5139723Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:26.5142398Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:26.5143720Z ^ 2025-05-07T20:03:26.5143955Z 2025-05-07T20:03:26.5144338Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:26.5145068Z 2025-05-07T20:03:26.5146469Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:26.5148995Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:26.5150140Z ^ 2025-05-07T20:03:26.5150524Z 2025-05-07T20:03:26.5152133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:26.5154909Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:26.5156001Z ^ 2025-05-07T20:03:26.5156254Z 2025-05-07T20:03:26.5156689Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:26.5157347Z 2025-05-07T20:03:26.5158956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:26.5161204Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:26.5163750Z ^ 2025-05-07T20:03:26.5164100Z 2025-05-07T20:03:26.5165765Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:26.5168440Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:26.5169622Z ^ 2025-05-07T20:03:26.5169868Z 2025-05-07T20:03:26.5170305Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:26.5170985Z 2025-05-07T20:03:26.5172634Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:26.5174740Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:26.5175639Z ^ 2025-05-07T20:03:26.5175939Z 2025-05-07T20:03:26.5177178Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:26.5179295Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:26.5180345Z ^ 2025-05-07T20:03:26.5180570Z 2025-05-07T20:03:26.5180958Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:26.5181539Z 2025-05-07T20:03:26.5183222Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:26.5185515Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:26.5186619Z ^ 2025-05-07T20:03:26.5186928Z 2025-05-07T20:03:26.5188557Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:26.5191053Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:26.5192144Z ^ 2025-05-07T20:03:26.5192378Z 2025-05-07T20:03:26.5192812Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:26.5193447Z 2025-05-07T20:03:26.5195267Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:26.5197616Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:26.5198633Z ^ 2025-05-07T20:03:26.5199009Z 2025-05-07T20:03:27.4438399Z [393/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:03:27.4462096Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:27.4464846Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:27.4466018Z ^ 2025-05-07T20:03:27.4466272Z 2025-05-07T20:03:27.4466739Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:27.4467404Z 2025-05-07T20:03:27.4469099Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:27.4471810Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:27.4472992Z ^ 2025-05-07T20:03:27.4473428Z 2025-05-07T20:03:27.4475215Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:27.4477902Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:27.4479066Z ^ 2025-05-07T20:03:27.4479308Z 2025-05-07T20:03:27.4479753Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:27.4480542Z 2025-05-07T20:03:27.4482256Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:27.4484972Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:27.4486132Z ^ 2025-05-07T20:03:27.4486504Z 2025-05-07T20:03:27.4488135Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:27.4490683Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:27.4491803Z ^ 2025-05-07T20:03:27.4492044Z 2025-05-07T20:03:27.4492502Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:27.4493100Z 2025-05-07T20:03:27.4494463Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:27.4497164Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:27.4498355Z ^ 2025-05-07T20:03:27.4498716Z 2025-05-07T20:03:27.4500380Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:27.4503186Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:27.4504363Z ^ 2025-05-07T20:03:27.4504611Z 2025-05-07T20:03:27.4505054Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:27.4505835Z 2025-05-07T20:03:27.4507566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:27.4510238Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:27.4511398Z ^ 2025-05-07T20:03:27.4511755Z 2025-05-07T20:03:27.4513411Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:27.4515889Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:27.4517079Z ^ 2025-05-07T20:03:27.4517333Z 2025-05-07T20:03:27.4517792Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:27.4518447Z 2025-05-07T20:03:27.4520138Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:27.4522837Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:27.4524030Z ^ 2025-05-07T20:03:27.4524517Z 2025-05-07T20:03:31.0713730Z [394/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o 2025-05-07T20:03:31.0736322Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:31.0738764Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:31.0739780Z ^ 2025-05-07T20:03:31.0740012Z 2025-05-07T20:03:31.0740456Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:31.0741128Z 2025-05-07T20:03:31.0742733Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:31.0745392Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:31.0746522Z ^ 2025-05-07T20:03:31.0746843Z 2025-05-07T20:03:31.0748383Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:31.0751139Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:31.0752096Z ^ 2025-05-07T20:03:31.0752343Z 2025-05-07T20:03:31.0752764Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:31.0753399Z 2025-05-07T20:03:31.0755163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:31.0757755Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:31.0758883Z ^ 2025-05-07T20:03:31.0759196Z 2025-05-07T20:03:31.0760681Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:31.0763086Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:31.0764187Z ^ 2025-05-07T20:03:31.0764411Z 2025-05-07T20:03:31.0764836Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:31.0765480Z 2025-05-07T20:03:31.0767020Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:31.0769812Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:31.0770900Z ^ 2025-05-07T20:03:31.0771244Z 2025-05-07T20:03:31.0772698Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:31.0775427Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:31.0776557Z ^ 2025-05-07T20:03:31.0776817Z 2025-05-07T20:03:31.0777254Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:31.0777913Z 2025-05-07T20:03:31.0779550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:31.0782062Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:31.0783152Z ^ 2025-05-07T20:03:31.0783494Z 2025-05-07T20:03:31.0784997Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:31.0787335Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:31.0788424Z ^ 2025-05-07T20:03:31.0788658Z 2025-05-07T20:03:31.0789073Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:31.0789701Z 2025-05-07T20:03:31.0791307Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:31.0794107Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:31.0795193Z ^ 2025-05-07T20:03:31.0795544Z 2025-05-07T20:03:31.5877167Z [395/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:31.5899927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:31.5902568Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:31.5903740Z ^ 2025-05-07T20:03:31.5904032Z 2025-05-07T20:03:31.5904480Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:31.5905112Z 2025-05-07T20:03:31.5906787Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:31.5909236Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:31.5910609Z ^ 2025-05-07T20:03:31.5910953Z 2025-05-07T20:03:31.5912479Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:31.5915093Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:31.5916205Z ^ 2025-05-07T20:03:31.5916458Z 2025-05-07T20:03:31.5916917Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:31.5917538Z 2025-05-07T20:03:31.5919133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:31.5921964Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:31.5923100Z ^ 2025-05-07T20:03:31.5923463Z 2025-05-07T20:03:31.5924955Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:31.5927371Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:31.5928779Z ^ 2025-05-07T20:03:31.5929029Z 2025-05-07T20:03:31.5929406Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:31.5929959Z 2025-05-07T20:03:31.5931726Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:31.5934212Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:31.5935462Z ^ 2025-05-07T20:03:31.5935825Z 2025-05-07T20:03:31.5937518Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:31.5940276Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:31.5941620Z ^ 2025-05-07T20:03:31.5941848Z 2025-05-07T20:03:31.5942318Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:31.5942962Z 2025-05-07T20:03:31.5944498Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:31.5947011Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:31.5948114Z ^ 2025-05-07T20:03:31.5948474Z 2025-05-07T20:03:31.5950107Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:31.5952623Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:31.5954093Z ^ 2025-05-07T20:03:31.5954389Z 2025-05-07T20:03:31.5954830Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:31.5955475Z 2025-05-07T20:03:31.5957142Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:31.5959741Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:31.5960949Z ^ 2025-05-07T20:03:31.5961317Z 2025-05-07T20:03:32.7208243Z [396/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:32.7229832Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:32.7232414Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:32.7233581Z ^ 2025-05-07T20:03:32.7233935Z 2025-05-07T20:03:32.7234392Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:32.7235338Z 2025-05-07T20:03:32.7236841Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:32.7239345Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:32.7240448Z ^ 2025-05-07T20:03:32.7240824Z 2025-05-07T20:03:32.7242306Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:32.7244642Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:32.7245766Z ^ 2025-05-07T20:03:32.7246057Z 2025-05-07T20:03:32.7246498Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:32.7247120Z 2025-05-07T20:03:32.7248739Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:32.7251148Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:32.7252239Z ^ 2025-05-07T20:03:32.7252567Z 2025-05-07T20:03:32.7254083Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:32.7256800Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:32.7257811Z ^ 2025-05-07T20:03:32.7258022Z 2025-05-07T20:03:32.7258434Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:32.7259204Z 2025-05-07T20:03:32.7260687Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:32.7263061Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:32.7264084Z ^ 2025-05-07T20:03:32.7264436Z 2025-05-07T20:03:32.7266066Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:32.7268358Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:32.7269449Z ^ 2025-05-07T20:03:32.7269714Z 2025-05-07T20:03:32.7270110Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:32.7270677Z 2025-05-07T20:03:32.7272171Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:32.7274787Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:32.7275916Z ^ 2025-05-07T20:03:32.7276444Z 2025-05-07T20:03:32.7277982Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:32.7280424Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:32.7281516Z ^ 2025-05-07T20:03:32.7281779Z 2025-05-07T20:03:32.7282207Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:32.7282870Z 2025-05-07T20:03:32.7284461Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:32.7287024Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:32.7288188Z ^ 2025-05-07T20:03:32.7288553Z 2025-05-07T20:03:33.5965954Z [397/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:33.5987510Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:33.5990436Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:33.5991582Z ^ 2025-05-07T20:03:33.5991826Z 2025-05-07T20:03:33.5992259Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:33.5992901Z 2025-05-07T20:03:33.5994660Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:33.5997282Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:33.5998403Z ^ 2025-05-07T20:03:33.5998772Z 2025-05-07T20:03:33.6000308Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:33.6002906Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:33.6004013Z ^ 2025-05-07T20:03:33.6004249Z 2025-05-07T20:03:33.6004704Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:33.6005327Z 2025-05-07T20:03:33.6006925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:33.6009493Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:33.6010783Z ^ 2025-05-07T20:03:33.6011131Z 2025-05-07T20:03:33.6012681Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:33.6015387Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:33.6016504Z ^ 2025-05-07T20:03:33.6016743Z 2025-05-07T20:03:33.6017162Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:33.6017808Z 2025-05-07T20:03:33.6019383Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:33.6021933Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:33.6023091Z ^ 2025-05-07T20:03:33.6023411Z 2025-05-07T20:03:33.6025048Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:33.6027572Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:33.6028906Z ^ 2025-05-07T20:03:33.6029134Z 2025-05-07T20:03:33.6029576Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:33.6030211Z 2025-05-07T20:03:33.6031773Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:33.6034446Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:33.6035441Z ^ 2025-05-07T20:03:33.6035729Z 2025-05-07T20:03:33.6037268Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:33.6039864Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:33.6040959Z ^ 2025-05-07T20:03:33.6041226Z 2025-05-07T20:03:33.6041673Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:33.6042349Z 2025-05-07T20:03:33.6044007Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:33.6046651Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:33.6047820Z ^ 2025-05-07T20:03:33.6048190Z 2025-05-07T20:03:34.0126531Z [398/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o 2025-05-07T20:03:34.0148399Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.0151068Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.0152202Z ^ 2025-05-07T20:03:34.0152450Z 2025-05-07T20:03:34.0152881Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:34.0153545Z 2025-05-07T20:03:34.0155306Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.0157919Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.0159077Z ^ 2025-05-07T20:03:34.0159442Z 2025-05-07T20:03:34.0161051Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.0163570Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.0164739Z ^ 2025-05-07T20:03:34.0164967Z 2025-05-07T20:03:34.0165405Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:34.0166053Z 2025-05-07T20:03:34.0168026Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.0170634Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.0171861Z ^ 2025-05-07T20:03:34.0172200Z 2025-05-07T20:03:34.0173885Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.0176434Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.0177542Z ^ 2025-05-07T20:03:34.0177780Z 2025-05-07T20:03:34.0178214Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:34.0178849Z 2025-05-07T20:03:34.0180431Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.0182955Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.0184090Z ^ 2025-05-07T20:03:34.0184434Z 2025-05-07T20:03:34.0186010Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.0188536Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.0189667Z ^ 2025-05-07T20:03:34.0189914Z 2025-05-07T20:03:34.0190350Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:34.0190826Z 2025-05-07T20:03:34.0192354Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.0195151Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.0196262Z ^ 2025-05-07T20:03:34.0196632Z 2025-05-07T20:03:34.0198280Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.0200907Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.0202054Z ^ 2025-05-07T20:03:34.0202308Z 2025-05-07T20:03:34.0202754Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:34.0203414Z 2025-05-07T20:03:34.0205099Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.0207715Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.0208880Z ^ 2025-05-07T20:03:34.0209239Z 2025-05-07T20:03:39.3935140Z [399/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:39.3956457Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:39.3958910Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:39.3959997Z ^ 2025-05-07T20:03:39.3960254Z 2025-05-07T20:03:39.3960698Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:39.3961355Z 2025-05-07T20:03:39.3962750Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:39.3965243Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:39.3966365Z ^ 2025-05-07T20:03:39.3966676Z 2025-05-07T20:03:39.3968189Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:39.3970706Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:39.3971752Z ^ 2025-05-07T20:03:39.3971994Z 2025-05-07T20:03:39.3972407Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:39.3973010Z 2025-05-07T20:03:39.3974767Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:39.3977065Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:39.3978123Z ^ 2025-05-07T20:03:39.3978448Z 2025-05-07T20:03:39.3979978Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:39.3982320Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:39.3983326Z ^ 2025-05-07T20:03:39.3983559Z 2025-05-07T20:03:39.3983955Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:39.3984558Z 2025-05-07T20:03:39.3986160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:39.3988740Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:39.3989882Z ^ 2025-05-07T20:03:39.3990224Z 2025-05-07T20:03:39.3991830Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:39.3994642Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:39.3995708Z ^ 2025-05-07T20:03:39.3995957Z 2025-05-07T20:03:39.3996388Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:39.3997016Z 2025-05-07T20:03:39.3998635Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:39.4001161Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:39.4002313Z ^ 2025-05-07T20:03:39.4002652Z 2025-05-07T20:03:39.4004153Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:39.4006513Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:39.4007630Z ^ 2025-05-07T20:03:39.4007852Z 2025-05-07T20:03:39.4008253Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:39.4008789Z 2025-05-07T20:03:39.4010121Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:39.4012743Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:39.4013761Z ^ 2025-05-07T20:03:39.4014059Z 2025-05-07T20:03:40.2329091Z [400/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:03:40.2350106Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:40.2352562Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:40.2353569Z ^ 2025-05-07T20:03:40.2353797Z 2025-05-07T20:03:40.2354315Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:40.2354872Z 2025-05-07T20:03:40.2356333Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:40.2358759Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:40.2359789Z ^ 2025-05-07T20:03:40.2360113Z 2025-05-07T20:03:40.2361891Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:40.2364314Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:40.2365480Z ^ 2025-05-07T20:03:40.2365700Z 2025-05-07T20:03:40.2366245Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:40.2366854Z 2025-05-07T20:03:40.2368311Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:40.2370629Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:40.2371681Z ^ 2025-05-07T20:03:40.2372006Z 2025-05-07T20:03:40.2373468Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:40.2375830Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:40.2376868Z ^ 2025-05-07T20:03:40.2377109Z 2025-05-07T20:03:40.2377513Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:40.2378121Z 2025-05-07T20:03:40.2379615Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:40.2382313Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:40.2383332Z ^ 2025-05-07T20:03:40.2383640Z 2025-05-07T20:03:40.2385047Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:40.2387421Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:40.2388480Z ^ 2025-05-07T20:03:40.2388707Z 2025-05-07T20:03:40.2389096Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:40.2389693Z 2025-05-07T20:03:40.2391097Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:40.2393422Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:40.2394646Z ^ 2025-05-07T20:03:40.2394976Z 2025-05-07T20:03:40.2396377Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:40.2398662Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:40.2399698Z ^ 2025-05-07T20:03:40.2399953Z 2025-05-07T20:03:40.2400364Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:40.2401169Z 2025-05-07T20:03:40.2402696Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:40.2405122Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:40.2406279Z ^ 2025-05-07T20:03:40.2406601Z 2025-05-07T20:03:46.3333056Z [401/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:46.3357056Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:46.3359701Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:46.3360835Z ^ 2025-05-07T20:03:46.3361079Z 2025-05-07T20:03:46.3361532Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:46.3362174Z 2025-05-07T20:03:46.3364123Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:46.3366711Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:46.3367819Z ^ 2025-05-07T20:03:46.3368281Z 2025-05-07T20:03:46.3369954Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:46.3372479Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:46.3373655Z ^ 2025-05-07T20:03:46.3373916Z 2025-05-07T20:03:46.3374357Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:46.3375015Z 2025-05-07T20:03:46.3376725Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:46.3379436Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:46.3380645Z ^ 2025-05-07T20:03:46.3381003Z 2025-05-07T20:03:46.3382694Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:46.3385325Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:46.3386481Z ^ 2025-05-07T20:03:46.3386718Z 2025-05-07T20:03:46.3387155Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:46.3387967Z 2025-05-07T20:03:46.3389583Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:46.3392198Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:46.3393412Z ^ 2025-05-07T20:03:46.3393796Z 2025-05-07T20:03:46.3395601Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:46.3398235Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:46.3399391Z ^ 2025-05-07T20:03:46.3399658Z 2025-05-07T20:03:46.3400110Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:46.3400774Z 2025-05-07T20:03:46.3402481Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:46.3405147Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:46.3406342Z ^ 2025-05-07T20:03:46.3406689Z 2025-05-07T20:03:46.3408363Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:46.3411199Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:46.3412333Z ^ 2025-05-07T20:03:46.3412589Z 2025-05-07T20:03:46.3413036Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:46.3413798Z 2025-05-07T20:03:46.3415549Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:46.3418256Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:46.3419432Z ^ 2025-05-07T20:03:46.3419807Z 2025-05-07T20:03:48.9491250Z [402/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_adagrad.cpp 2025-05-07T20:03:48.9510654Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:52.0762914Z [403/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:52.0786874Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.0789772Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.0790919Z ^ 2025-05-07T20:03:52.0791201Z 2025-05-07T20:03:52.0791637Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:52.0792293Z 2025-05-07T20:03:52.0794130Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.0796694Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.0797708Z ^ 2025-05-07T20:03:52.0797984Z 2025-05-07T20:03:52.0799449Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.0801991Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.0802978Z ^ 2025-05-07T20:03:52.0803176Z 2025-05-07T20:03:52.0803569Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:52.0804117Z 2025-05-07T20:03:52.0805593Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.0808052Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.0809387Z ^ 2025-05-07T20:03:52.0809714Z 2025-05-07T20:03:52.0811207Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.0813748Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.0814782Z ^ 2025-05-07T20:03:52.0815030Z 2025-05-07T20:03:52.0815428Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:52.0816029Z 2025-05-07T20:03:52.0817528Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.0819939Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.0821008Z ^ 2025-05-07T20:03:52.0821328Z 2025-05-07T20:03:52.0822850Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.0825272Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.0826372Z ^ 2025-05-07T20:03:52.0826603Z 2025-05-07T20:03:52.0826965Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:52.0827605Z 2025-05-07T20:03:52.0829467Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.0832419Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.0833629Z ^ 2025-05-07T20:03:52.0834125Z 2025-05-07T20:03:52.0835698Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.0838319Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.0839481Z ^ 2025-05-07T20:03:52.0839740Z 2025-05-07T20:03:52.0840178Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:52.0840854Z 2025-05-07T20:03:52.0842561Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.0845262Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.0846459Z ^ 2025-05-07T20:03:52.0846811Z 2025-05-07T20:03:52.1268713Z [404/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o 2025-05-07T20:03:52.1290143Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.1292633Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.1293717Z ^ 2025-05-07T20:03:52.1293950Z 2025-05-07T20:03:52.1294362Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:52.1294940Z 2025-05-07T20:03:52.1296451Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.1298908Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.1299992Z ^ 2025-05-07T20:03:52.1300323Z 2025-05-07T20:03:52.1301838Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.1304321Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.1305344Z ^ 2025-05-07T20:03:52.1305568Z 2025-05-07T20:03:52.1305973Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:52.1306513Z 2025-05-07T20:03:52.1308182Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.1310638Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.1311857Z ^ 2025-05-07T20:03:52.1312213Z 2025-05-07T20:03:52.1313722Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.1316199Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.1317224Z ^ 2025-05-07T20:03:52.1317463Z 2025-05-07T20:03:52.1317842Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:52.1318438Z 2025-05-07T20:03:52.1319931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.1322426Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.1323473Z ^ 2025-05-07T20:03:52.1323790Z 2025-05-07T20:03:52.1325271Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.1327617Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.1329118Z ^ 2025-05-07T20:03:52.1329340Z 2025-05-07T20:03:52.1329743Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:52.1330356Z 2025-05-07T20:03:52.1331842Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.1334297Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.1335326Z ^ 2025-05-07T20:03:52.1335662Z 2025-05-07T20:03:52.1337153Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.1339585Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.1340642Z ^ 2025-05-07T20:03:52.1340879Z 2025-05-07T20:03:52.1341280Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:52.1341878Z 2025-05-07T20:03:52.1343377Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:52.1345813Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:52.1346828Z ^ 2025-05-07T20:03:52.1347171Z 2025-05-07T20:03:54.5264026Z [405/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_sgd.cpp 2025-05-07T20:03:54.5283019Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:55.1520494Z [406/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T20:03:55.1538350Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:56.1841576Z [407/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:56.1864527Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:56.1867279Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:56.1868437Z ^ 2025-05-07T20:03:56.1868649Z 2025-05-07T20:03:56.1869057Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:56.1869649Z 2025-05-07T20:03:56.1871209Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:56.1874322Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:56.1875489Z ^ 2025-05-07T20:03:56.1875828Z 2025-05-07T20:03:56.1877420Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:56.1880266Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:56.1881458Z ^ 2025-05-07T20:03:56.1881707Z 2025-05-07T20:03:56.1882171Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:56.1882827Z 2025-05-07T20:03:56.1884450Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:56.1886943Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:56.1888115Z ^ 2025-05-07T20:03:56.1888481Z 2025-05-07T20:03:56.1890059Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:56.1892631Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:56.1893717Z ^ 2025-05-07T20:03:56.1893976Z 2025-05-07T20:03:56.1894413Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:56.1895184Z 2025-05-07T20:03:56.1896706Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:56.1899398Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:56.1900601Z ^ 2025-05-07T20:03:56.1900981Z 2025-05-07T20:03:56.1902634Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:56.1905314Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:56.1906513Z ^ 2025-05-07T20:03:56.1906778Z 2025-05-07T20:03:56.1907248Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:56.1907915Z 2025-05-07T20:03:56.1909634Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:56.1912346Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:56.1913446Z ^ 2025-05-07T20:03:56.1913798Z 2025-05-07T20:03:56.1915429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:56.1918033Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:56.1919073Z ^ 2025-05-07T20:03:56.1919310Z 2025-05-07T20:03:56.1919716Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:56.1920407Z 2025-05-07T20:03:56.1922065Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:56.1924739Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:56.1925939Z ^ 2025-05-07T20:03:56.1926298Z 2025-05-07T20:03:56.2021796Z [408/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_lamb.cpp 2025-05-07T20:03:56.2042369Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:59.9616797Z [409/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:03:59.9640934Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:59.9643862Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:59.9645064Z ^ 2025-05-07T20:03:59.9645319Z 2025-05-07T20:03:59.9645780Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:59.9646456Z 2025-05-07T20:03:59.9648160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:59.9650908Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:59.9652116Z ^ 2025-05-07T20:03:59.9652481Z 2025-05-07T20:03:59.9654226Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:59.9656933Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:59.9658135Z ^ 2025-05-07T20:03:59.9658385Z 2025-05-07T20:03:59.9658831Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:59.9659511Z 2025-05-07T20:03:59.9661203Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:59.9663935Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:59.9665125Z ^ 2025-05-07T20:03:59.9665668Z 2025-05-07T20:03:59.9667357Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:59.9670054Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:59.9671320Z ^ 2025-05-07T20:03:59.9671642Z 2025-05-07T20:03:59.9672095Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:59.9672769Z 2025-05-07T20:03:59.9674593Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:59.9677314Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:59.9678519Z ^ 2025-05-07T20:03:59.9678879Z 2025-05-07T20:03:59.9680546Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:59.9683236Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:59.9684405Z ^ 2025-05-07T20:03:59.9684658Z 2025-05-07T20:03:59.9685101Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:59.9685768Z 2025-05-07T20:03:59.9687472Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:59.9690247Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:59.9691447Z ^ 2025-05-07T20:03:59.9691811Z 2025-05-07T20:03:59.9693496Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:59.9696182Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:59.9697358Z ^ 2025-05-07T20:03:59.9697611Z 2025-05-07T20:03:59.9698057Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:59.9698732Z 2025-05-07T20:03:59.9700427Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:59.9703173Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:59.9704352Z ^ 2025-05-07T20:03:59.9704721Z 2025-05-07T20:04:00.2736098Z [410/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_adam.cpp 2025-05-07T20:04:00.2756222Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:00.8872395Z [411/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:00.8885103Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.8886495Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.8887128Z ^ 2025-05-07T20:04:00.8887276Z 2025-05-07T20:04:00.8887521Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:00.8887892Z 2025-05-07T20:04:00.8888756Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.8890158Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.8890791Z ^ 2025-05-07T20:04:00.8891003Z 2025-05-07T20:04:00.8891857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.8893239Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.8893857Z ^ 2025-05-07T20:04:00.8894066Z 2025-05-07T20:04:00.8894310Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:00.8894664Z 2025-05-07T20:04:00.8895519Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.8896914Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.8897550Z ^ 2025-05-07T20:04:00.8897744Z 2025-05-07T20:04:00.8898587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.8899971Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.8900596Z ^ 2025-05-07T20:04:00.8900740Z 2025-05-07T20:04:00.8900981Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:00.8901344Z 2025-05-07T20:04:00.8902206Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.8903597Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.8904220Z ^ 2025-05-07T20:04:00.8904415Z 2025-05-07T20:04:00.8905352Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.8906735Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.8907365Z ^ 2025-05-07T20:04:00.8907556Z 2025-05-07T20:04:00.8907815Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:00.8908163Z 2025-05-07T20:04:00.8909087Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.8910481Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.8911115Z ^ 2025-05-07T20:04:00.8911307Z 2025-05-07T20:04:00.8912161Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.8913544Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.8914330Z ^ 2025-05-07T20:04:00.8914493Z 2025-05-07T20:04:00.8914733Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:00.8915082Z 2025-05-07T20:04:00.8915954Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:00.8917333Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:00.8918026Z ^ 2025-05-07T20:04:00.8918219Z 2025-05-07T20:04:01.3556836Z [412/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T20:04:01.3567405Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:01.9879528Z [413/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:04:01.9916929Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.9919807Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.9921025Z ^ 2025-05-07T20:04:01.9921280Z 2025-05-07T20:04:01.9921749Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:01.9922434Z 2025-05-07T20:04:01.9924152Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.9927255Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.9928719Z ^ 2025-05-07T20:04:01.9929089Z 2025-05-07T20:04:01.9930766Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.9933711Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.9934907Z ^ 2025-05-07T20:04:01.9935160Z 2025-05-07T20:04:01.9935608Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:01.9936288Z 2025-05-07T20:04:01.9938009Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.9940776Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.9942002Z ^ 2025-05-07T20:04:01.9942377Z 2025-05-07T20:04:01.9944093Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.9946831Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.9948037Z ^ 2025-05-07T20:04:01.9948289Z 2025-05-07T20:04:01.9948755Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:01.9949433Z 2025-05-07T20:04:01.9951261Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.9953936Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.9955115Z ^ 2025-05-07T20:04:01.9955496Z 2025-05-07T20:04:01.9957052Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.9959577Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.9960754Z ^ 2025-05-07T20:04:01.9961023Z 2025-05-07T20:04:01.9961475Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:01.9962145Z 2025-05-07T20:04:01.9963861Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.9966519Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.9967639Z ^ 2025-05-07T20:04:01.9968004Z 2025-05-07T20:04:01.9969675Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.9972343Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.9973476Z ^ 2025-05-07T20:04:01.9973716Z 2025-05-07T20:04:01.9974151Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:01.9974820Z 2025-05-07T20:04:01.9976577Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.9979152Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.9980329Z ^ 2025-05-07T20:04:01.9980682Z 2025-05-07T20:04:02.5539235Z [414/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T20:04:02.5560469Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:02.5643544Z [415/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T20:04:02.5663889Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:02.9735178Z [416/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:02.9759317Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:02.9762839Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:02.9763935Z ^ 2025-05-07T20:04:02.9764177Z 2025-05-07T20:04:02.9764550Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:02.9765136Z 2025-05-07T20:04:02.9766671Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:02.9769084Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:02.9770224Z ^ 2025-05-07T20:04:02.9770614Z 2025-05-07T20:04:02.9772309Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:02.9774907Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:02.9776015Z ^ 2025-05-07T20:04:02.9776281Z 2025-05-07T20:04:02.9776754Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:02.9777481Z 2025-05-07T20:04:02.9779004Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:02.9781702Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:02.9782867Z ^ 2025-05-07T20:04:02.9783266Z 2025-05-07T20:04:02.9784773Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:02.9787389Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:02.9788474Z ^ 2025-05-07T20:04:02.9788743Z 2025-05-07T20:04:02.9789212Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:02.9789885Z 2025-05-07T20:04:02.9791612Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:02.9794486Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:02.9795720Z ^ 2025-05-07T20:04:02.9796096Z 2025-05-07T20:04:02.9797768Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:02.9800827Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:02.9802046Z ^ 2025-05-07T20:04:02.9802315Z 2025-05-07T20:04:02.9802760Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:02.9803443Z 2025-05-07T20:04:02.9805097Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:02.9807821Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:02.9808826Z ^ 2025-05-07T20:04:02.9809172Z 2025-05-07T20:04:02.9810595Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:02.9813062Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:02.9814039Z ^ 2025-05-07T20:04:02.9814292Z 2025-05-07T20:04:02.9814764Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:02.9815376Z 2025-05-07T20:04:02.9816894Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:02.9819479Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:02.9820680Z ^ 2025-05-07T20:04:02.9820994Z 2025-05-07T20:04:03.3852812Z [417/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:03.3865016Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:03.3866419Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:03.3867049Z ^ 2025-05-07T20:04:03.3867197Z 2025-05-07T20:04:03.3867462Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:03.3867817Z 2025-05-07T20:04:03.3868676Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:03.3870076Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:03.3870715Z ^ 2025-05-07T20:04:03.3870914Z 2025-05-07T20:04:03.3871761Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:03.3873145Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:03.3873956Z ^ 2025-05-07T20:04:03.3874115Z 2025-05-07T20:04:03.3874356Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:03.3874706Z 2025-05-07T20:04:03.3875582Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:03.3876971Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:03.3877611Z ^ 2025-05-07T20:04:03.3877808Z 2025-05-07T20:04:03.3878675Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:03.3880045Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:03.3880671Z ^ 2025-05-07T20:04:03.3880811Z 2025-05-07T20:04:03.3881067Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:03.3881416Z 2025-05-07T20:04:03.3882276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:03.3883664Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:03.3884290Z ^ 2025-05-07T20:04:03.3884502Z 2025-05-07T20:04:03.3885410Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:03.3886800Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:03.3887459Z ^ 2025-05-07T20:04:03.3887614Z 2025-05-07T20:04:03.3887891Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:03.3888243Z 2025-05-07T20:04:03.3889114Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:03.3890492Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:03.3891134Z ^ 2025-05-07T20:04:03.3891324Z 2025-05-07T20:04:03.3892186Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:03.3893549Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:03.3894176Z ^ 2025-05-07T20:04:03.3894315Z 2025-05-07T20:04:03.3894554Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:03.3894912Z 2025-05-07T20:04:03.3895765Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:03.3897162Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:03.3897824Z ^ 2025-05-07T20:04:03.3898079Z 2025-05-07T20:04:03.8882381Z [418/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:03.8904808Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:03.8907517Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:03.8908659Z ^ 2025-05-07T20:04:03.8908921Z 2025-05-07T20:04:03.8909379Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:03.8910062Z 2025-05-07T20:04:03.8911723Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:03.8914557Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:03.8915748Z ^ 2025-05-07T20:04:03.8916121Z 2025-05-07T20:04:03.8917780Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:03.8920626Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:03.8921723Z ^ 2025-05-07T20:04:03.8921980Z 2025-05-07T20:04:03.8922429Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:03.8923082Z 2025-05-07T20:04:03.8924723Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:03.8927447Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:03.8928942Z ^ 2025-05-07T20:04:03.8929308Z 2025-05-07T20:04:03.8930700Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:03.8933043Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:03.8934168Z ^ 2025-05-07T20:04:03.8934411Z 2025-05-07T20:04:03.8934844Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:03.8935507Z 2025-05-07T20:04:03.8937120Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:03.8939841Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:03.8940910Z ^ 2025-05-07T20:04:03.8941261Z 2025-05-07T20:04:03.8943103Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:03.8945640Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:03.8946638Z ^ 2025-05-07T20:04:03.8946880Z 2025-05-07T20:04:03.8947324Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:03.8947973Z 2025-05-07T20:04:03.8949610Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:03.8952105Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:03.8953269Z ^ 2025-05-07T20:04:03.8953629Z 2025-05-07T20:04:03.8955455Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:03.8958033Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:03.8959166Z ^ 2025-05-07T20:04:03.8959436Z 2025-05-07T20:04:03.8959879Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:03.8960753Z 2025-05-07T20:04:03.8962437Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:03.8965100Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:03.8966311Z ^ 2025-05-07T20:04:03.8966675Z 2025-05-07T20:04:04.7595939Z [419/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:04.7619809Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:04.7622615Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:04.7623804Z ^ 2025-05-07T20:04:04.7624053Z 2025-05-07T20:04:04.7624502Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:04.7625187Z 2025-05-07T20:04:04.7626906Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:04.7629910Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:04.7630986Z ^ 2025-05-07T20:04:04.7631338Z 2025-05-07T20:04:04.7632940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:04.7635716Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:04.7636904Z ^ 2025-05-07T20:04:04.7637152Z 2025-05-07T20:04:04.7637621Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:04.7638296Z 2025-05-07T20:04:04.7640024Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:04.7642701Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:04.7643889Z ^ 2025-05-07T20:04:04.7644239Z 2025-05-07T20:04:04.7645944Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:04.7648662Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:04.7649798Z ^ 2025-05-07T20:04:04.7650048Z 2025-05-07T20:04:04.7650746Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:04.7651412Z 2025-05-07T20:04:04.7653128Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:04.7655772Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:04.7656904Z ^ 2025-05-07T20:04:04.7657268Z 2025-05-07T20:04:04.7658859Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:04.7661580Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:04.7662649Z ^ 2025-05-07T20:04:04.7662890Z 2025-05-07T20:04:04.7663314Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:04.7663983Z 2025-05-07T20:04:04.7665463Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:04.7668169Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:04.7669363Z ^ 2025-05-07T20:04:04.7669741Z 2025-05-07T20:04:04.7671424Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:04.7674466Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:04.7675637Z ^ 2025-05-07T20:04:04.7675907Z 2025-05-07T20:04:04.7676368Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:04.7677045Z 2025-05-07T20:04:04.7678723Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:04.7681437Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:04.7682599Z ^ 2025-05-07T20:04:04.7682936Z 2025-05-07T20:04:05.1098283Z [420/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:05.1122383Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:05.1124936Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:05.1126332Z ^ 2025-05-07T20:04:05.1126574Z 2025-05-07T20:04:05.1127019Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:05.1127664Z 2025-05-07T20:04:05.1129504Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:05.1132207Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:05.1133398Z ^ 2025-05-07T20:04:05.1133762Z 2025-05-07T20:04:05.1135393Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:05.1138109Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:05.1139307Z ^ 2025-05-07T20:04:05.1139554Z 2025-05-07T20:04:05.1139991Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:05.1140614Z 2025-05-07T20:04:05.1142245Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:05.1144935Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:05.1146114Z ^ 2025-05-07T20:04:05.1146406Z 2025-05-07T20:04:05.1148262Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:05.1150924Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:05.1152198Z ^ 2025-05-07T20:04:05.1152428Z 2025-05-07T20:04:05.1152889Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:05.1153511Z 2025-05-07T20:04:05.1155226Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:05.1157955Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:05.1159138Z ^ 2025-05-07T20:04:05.1159483Z 2025-05-07T20:04:05.1161123Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:05.1163775Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:05.1164942Z ^ 2025-05-07T20:04:05.1165200Z 2025-05-07T20:04:05.1165661Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:05.1166348Z 2025-05-07T20:04:05.1168105Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:05.1170853Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:05.1172280Z ^ 2025-05-07T20:04:05.1172638Z 2025-05-07T20:04:05.1174270Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:05.1176966Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:05.1178151Z ^ 2025-05-07T20:04:05.1178395Z 2025-05-07T20:04:05.1178848Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:05.1179505Z 2025-05-07T20:04:05.1181193Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:05.1183943Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:05.1185140Z ^ 2025-05-07T20:04:05.1185533Z 2025-05-07T20:04:05.1248574Z [421/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T20:04:05.1268507Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:05.2982324Z [422/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T20:04:05.3002856Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:06.1707146Z [423/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o 2025-05-07T20:04:06.1730713Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.1733391Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.1734547Z ^ 2025-05-07T20:04:06.1734811Z 2025-05-07T20:04:06.1735197Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:06.1735800Z 2025-05-07T20:04:06.1737463Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.1740193Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.1741420Z ^ 2025-05-07T20:04:06.1741790Z 2025-05-07T20:04:06.1743443Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.1746510Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.1747699Z ^ 2025-05-07T20:04:06.1747948Z 2025-05-07T20:04:06.1748411Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:06.1749159Z 2025-05-07T20:04:06.1750934Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.1753360Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.1754554Z ^ 2025-05-07T20:04:06.1754909Z 2025-05-07T20:04:06.1756521Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.1759205Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.1760409Z ^ 2025-05-07T20:04:06.1760669Z 2025-05-07T20:04:06.1761120Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:06.1761799Z 2025-05-07T20:04:06.1763542Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.1766205Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.1767427Z ^ 2025-05-07T20:04:06.1768021Z 2025-05-07T20:04:06.1769648Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.1772278Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.1773476Z ^ 2025-05-07T20:04:06.1773748Z 2025-05-07T20:04:06.1774212Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:06.1774922Z 2025-05-07T20:04:06.1776649Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.1779382Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.1780578Z ^ 2025-05-07T20:04:06.1780958Z 2025-05-07T20:04:06.1782632Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.1785161Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.1786299Z ^ 2025-05-07T20:04:06.1786556Z 2025-05-07T20:04:06.1787029Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:06.1787708Z 2025-05-07T20:04:06.1789544Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.1792098Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.1793229Z ^ 2025-05-07T20:04:06.1793687Z 2025-05-07T20:04:06.2620120Z [424/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_none.cpp 2025-05-07T20:04:06.2640126Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:07.5897708Z [425/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T20:04:07.5918770Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:08.0755482Z [426/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T20:04:08.0766419Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:09.1059508Z [427/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T20:04:09.1080247Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:09.2006453Z [428/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T20:04:09.2026075Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:09.2163890Z [429/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T20:04:09.2184298Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:10.3351878Z [430/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T20:04:10.3372535Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:10.4706873Z [431/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T20:04:10.4726988Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:11.5303139Z [432/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T20:04:11.5321505Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:11.5547610Z [433/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T20:04:11.5570987Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:11.5693555Z [434/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T20:04:11.5714227Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:12.1218076Z [435/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T20:04:12.1238979Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:12.5607176Z [436/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T20:04:12.5625978Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:13.0719434Z [437/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T20:04:13.0737362Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:14.0835577Z [438/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T20:04:14.0854965Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:14.3357567Z [439/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T20:04:14.3376350Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:14.4366791Z [440/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:14.4387667Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.4390208Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.4391238Z ^ 2025-05-07T20:04:14.4391461Z 2025-05-07T20:04:14.4391878Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:14.4392468Z 2025-05-07T20:04:14.4394178Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.4396583Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.4399395Z ^ 2025-05-07T20:04:14.4399718Z 2025-05-07T20:04:14.4401249Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.4403600Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.4404661Z ^ 2025-05-07T20:04:14.4404876Z 2025-05-07T20:04:14.4405273Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:14.4405881Z 2025-05-07T20:04:14.4407339Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.4409659Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.4410569Z ^ 2025-05-07T20:04:14.4410858Z 2025-05-07T20:04:14.4412178Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.4414424Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.4415409Z ^ 2025-05-07T20:04:14.4415629Z 2025-05-07T20:04:14.4415983Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:14.4416518Z 2025-05-07T20:04:14.4417986Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.4420099Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.4421201Z ^ 2025-05-07T20:04:14.4421495Z 2025-05-07T20:04:14.4422939Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.4425208Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.4426241Z ^ 2025-05-07T20:04:14.4426444Z 2025-05-07T20:04:14.4426799Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:14.4427346Z 2025-05-07T20:04:14.4428977Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.4431119Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.4432060Z ^ 2025-05-07T20:04:14.4432368Z 2025-05-07T20:04:14.4433734Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.4436001Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.4437073Z ^ 2025-05-07T20:04:14.4437277Z 2025-05-07T20:04:14.4437659Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:14.4438206Z 2025-05-07T20:04:14.4439629Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.4441832Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.4442730Z ^ 2025-05-07T20:04:14.4443022Z 2025-05-07T20:04:14.4512958Z [441/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T20:04:14.4529047Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:14.4661062Z [442/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T20:04:14.4677819Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:14.5698392Z [443/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:14.5720694Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.5723367Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.5724541Z ^ 2025-05-07T20:04:14.5724784Z 2025-05-07T20:04:14.5725176Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:14.5725719Z 2025-05-07T20:04:14.5727252Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.5730021Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.5731090Z ^ 2025-05-07T20:04:14.5731399Z 2025-05-07T20:04:14.5732718Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.5734839Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.5735766Z ^ 2025-05-07T20:04:14.5735992Z 2025-05-07T20:04:14.5736371Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:14.5736935Z 2025-05-07T20:04:14.5738708Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.5741125Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.5742359Z ^ 2025-05-07T20:04:14.5742684Z 2025-05-07T20:04:14.5744249Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.5746675Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.5747629Z ^ 2025-05-07T20:04:14.5747856Z 2025-05-07T20:04:14.5748285Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:14.5748889Z 2025-05-07T20:04:14.5750415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.5752821Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.5754067Z ^ 2025-05-07T20:04:14.5754381Z 2025-05-07T20:04:14.5755780Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.5757998Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.5759148Z ^ 2025-05-07T20:04:14.5759380Z 2025-05-07T20:04:14.5759768Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:14.5760359Z 2025-05-07T20:04:14.5761791Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.5764105Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.5765138Z ^ 2025-05-07T20:04:14.5765456Z 2025-05-07T20:04:14.5766953Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.5769306Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.5770397Z ^ 2025-05-07T20:04:14.5770625Z 2025-05-07T20:04:14.5771050Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:14.5771626Z 2025-05-07T20:04:14.5773086Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.5775463Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.5776461Z ^ 2025-05-07T20:04:14.5776789Z 2025-05-07T20:04:14.7301527Z [444/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:14.7323233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.7325740Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.7326784Z ^ 2025-05-07T20:04:14.7327053Z 2025-05-07T20:04:14.7327471Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:14.7328121Z 2025-05-07T20:04:14.7329941Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.7332412Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.7333419Z ^ 2025-05-07T20:04:14.7333756Z 2025-05-07T20:04:14.7335124Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.7337786Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.7338759Z ^ 2025-05-07T20:04:14.7339022Z 2025-05-07T20:04:14.7339439Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:14.7340148Z 2025-05-07T20:04:14.7341792Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.7344179Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.7345238Z ^ 2025-05-07T20:04:14.7345539Z 2025-05-07T20:04:14.7347076Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.7349632Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.7350690Z ^ 2025-05-07T20:04:14.7350949Z 2025-05-07T20:04:14.7351369Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:14.7352011Z 2025-05-07T20:04:14.7353595Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.7356152Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.7357277Z ^ 2025-05-07T20:04:14.7357612Z 2025-05-07T20:04:14.7359381Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.7361802Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.7362894Z ^ 2025-05-07T20:04:14.7363115Z 2025-05-07T20:04:14.7363556Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:14.7364134Z 2025-05-07T20:04:14.7365645Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.7368073Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.7369183Z ^ 2025-05-07T20:04:14.7369510Z 2025-05-07T20:04:14.7371032Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.7373482Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.7374435Z ^ 2025-05-07T20:04:14.7374646Z 2025-05-07T20:04:14.7375027Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:14.7375609Z 2025-05-07T20:04:14.7377246Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.7379666Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.7380772Z ^ 2025-05-07T20:04:14.7381116Z 2025-05-07T20:04:15.2487583Z [445/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T20:04:15.2507398Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:15.3153763Z [446/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T20:04:15.3172093Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:15.8497965Z [447/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T20:04:15.8515862Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:16.9937779Z [448/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:16.9959574Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:16.9962026Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:16.9963060Z ^ 2025-05-07T20:04:16.9963319Z 2025-05-07T20:04:16.9963721Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:16.9964302Z 2025-05-07T20:04:16.9965796Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:16.9968129Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:16.9969148Z ^ 2025-05-07T20:04:16.9969454Z 2025-05-07T20:04:16.9970865Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:16.9973293Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:16.9974336Z ^ 2025-05-07T20:04:16.9974552Z 2025-05-07T20:04:16.9974935Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:16.9975511Z 2025-05-07T20:04:16.9977229Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:16.9979646Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:16.9980662Z ^ 2025-05-07T20:04:16.9981091Z 2025-05-07T20:04:16.9982538Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:16.9985025Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:16.9985993Z ^ 2025-05-07T20:04:16.9986223Z 2025-05-07T20:04:16.9986598Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:16.9987158Z 2025-05-07T20:04:16.9988589Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:16.9990834Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:16.9991908Z ^ 2025-05-07T20:04:16.9992236Z 2025-05-07T20:04:16.9993731Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:16.9996325Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:16.9997387Z ^ 2025-05-07T20:04:16.9997616Z 2025-05-07T20:04:16.9998168Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:16.9998746Z 2025-05-07T20:04:17.0000166Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:17.0002502Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:17.0003526Z ^ 2025-05-07T20:04:17.0003876Z 2025-05-07T20:04:17.0005365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:17.0007766Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:17.0008804Z ^ 2025-05-07T20:04:17.0009041Z 2025-05-07T20:04:17.0009438Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:17.0010035Z 2025-05-07T20:04:17.0011518Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:17.0013926Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:17.0014982Z ^ 2025-05-07T20:04:17.0015300Z 2025-05-07T20:04:17.0080243Z [449/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T20:04:17.1538302Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:17.1555380Z [450/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T20:04:17.1573403Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:18.1490206Z [451/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T20:04:18.1509950Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:18.3025672Z [452/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T20:04:18.3045266Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:18.4237908Z [453/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T20:04:18.4257933Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:18.6037083Z [454/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T20:04:18.6055299Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:19.3960086Z [455/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:19.3982004Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.3984804Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.3985990Z ^ 2025-05-07T20:04:19.3986207Z 2025-05-07T20:04:19.3986588Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:19.3987141Z 2025-05-07T20:04:19.3988434Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.3990617Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.3991958Z ^ 2025-05-07T20:04:19.3992259Z 2025-05-07T20:04:19.3993689Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.3996144Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.3997156Z ^ 2025-05-07T20:04:19.3997369Z 2025-05-07T20:04:19.3997765Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:19.3998327Z 2025-05-07T20:04:19.3999722Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.4002053Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.4003060Z ^ 2025-05-07T20:04:19.4003369Z 2025-05-07T20:04:19.4004916Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.4007246Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.4008303Z ^ 2025-05-07T20:04:19.4008563Z 2025-05-07T20:04:19.4008995Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:19.4009577Z 2025-05-07T20:04:19.4011275Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.4013543Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.4014574Z ^ 2025-05-07T20:04:19.4014863Z 2025-05-07T20:04:19.4016276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.4018516Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.4019509Z ^ 2025-05-07T20:04:19.4019724Z 2025-05-07T20:04:19.4020114Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:19.4020733Z 2025-05-07T20:04:19.4022205Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.4024663Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.4025531Z ^ 2025-05-07T20:04:19.4025863Z 2025-05-07T20:04:19.4027325Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.4030141Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.4031116Z ^ 2025-05-07T20:04:19.4031604Z 2025-05-07T20:04:19.4031962Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:19.4032496Z 2025-05-07T20:04:19.4034008Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.4036280Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.4037410Z ^ 2025-05-07T20:04:19.4037761Z 2025-05-07T20:04:20.7990373Z [456/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:04:20.8011545Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:20.8013866Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:20.8014925Z ^ 2025-05-07T20:04:20.8015159Z 2025-05-07T20:04:20.8015572Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:20.8016188Z 2025-05-07T20:04:20.8017713Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:20.8020509Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:20.8021604Z ^ 2025-05-07T20:04:20.8021942Z 2025-05-07T20:04:20.8023544Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:20.8026007Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:20.8027100Z ^ 2025-05-07T20:04:20.8027342Z 2025-05-07T20:04:20.8027751Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:20.8028672Z 2025-05-07T20:04:20.8030208Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:20.8032637Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:20.8033730Z ^ 2025-05-07T20:04:20.8034147Z 2025-05-07T20:04:20.8035595Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:20.8037851Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:20.8039214Z ^ 2025-05-07T20:04:20.8039460Z 2025-05-07T20:04:20.8039856Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:20.8040476Z 2025-05-07T20:04:20.8042022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:20.8044716Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:20.8045850Z ^ 2025-05-07T20:04:20.8046192Z 2025-05-07T20:04:20.8047759Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:20.8050333Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:20.8051455Z ^ 2025-05-07T20:04:20.8051697Z 2025-05-07T20:04:20.8052135Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:20.8052817Z 2025-05-07T20:04:20.8054478Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:20.8057182Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:20.8058355Z ^ 2025-05-07T20:04:20.8058731Z 2025-05-07T20:04:20.8060395Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:20.8063225Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:20.8064382Z ^ 2025-05-07T20:04:20.8064644Z 2025-05-07T20:04:20.8065068Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:20.8065603Z 2025-05-07T20:04:20.8067181Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:20.8069722Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:20.8070792Z ^ 2025-05-07T20:04:20.8071095Z 2025-05-07T20:04:22.3506914Z [457/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils.cpp 2025-05-07T20:04:22.3525179Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:23.3868929Z [458/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:23.3892376Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:23.3895301Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:23.3896372Z ^ 2025-05-07T20:04:23.3896618Z 2025-05-07T20:04:23.3897031Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:23.3897834Z 2025-05-07T20:04:23.3899524Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:23.3902278Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:23.3903479Z ^ 2025-05-07T20:04:23.3903847Z 2025-05-07T20:04:23.3905511Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:23.3908080Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:23.3909245Z ^ 2025-05-07T20:04:23.3909492Z 2025-05-07T20:04:23.3909938Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:23.3910611Z 2025-05-07T20:04:23.3912350Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:23.3915266Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:23.3916434Z ^ 2025-05-07T20:04:23.3916926Z 2025-05-07T20:04:23.3918432Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:23.3920781Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:23.3921809Z ^ 2025-05-07T20:04:23.3922058Z 2025-05-07T20:04:23.3922462Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:23.3923084Z 2025-05-07T20:04:23.3924624Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:23.3927109Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:23.3928232Z ^ 2025-05-07T20:04:23.3928921Z 2025-05-07T20:04:23.3930411Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:23.3932991Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:23.3934164Z ^ 2025-05-07T20:04:23.3934419Z 2025-05-07T20:04:23.3934873Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:23.3935562Z 2025-05-07T20:04:23.3937462Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:23.3940023Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:23.3941195Z ^ 2025-05-07T20:04:23.3941676Z 2025-05-07T20:04:23.3943268Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:23.3945633Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:23.3946713Z ^ 2025-05-07T20:04:23.3946952Z 2025-05-07T20:04:23.3947357Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:23.3948015Z 2025-05-07T20:04:23.3949623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:23.3952120Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:23.3953252Z ^ 2025-05-07T20:04:23.3953599Z 2025-05-07T20:04:23.7531015Z [459/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils_ops.cpp 2025-05-07T20:04:23.7549143Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:24.3647080Z [460/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T20:04:24.3666993Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:24.5222848Z [461/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T20:04:24.5241925Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:25.0128657Z [462/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_split_host.so -o fbgemm_gpu_tbe_training_backward_split_host.so CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so && : 2025-05-07T20:04:25.9362020Z [463/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/eeg_models.cpp 2025-05-07T20:04:25.9379572Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:26.0253824Z [464/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:26.0278515Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.0281298Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.0282475Z ^ 2025-05-07T20:04:26.0282737Z 2025-05-07T20:04:26.0283177Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:26.0283757Z 2025-05-07T20:04:26.0285755Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.0288251Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.0289328Z ^ 2025-05-07T20:04:26.0289808Z 2025-05-07T20:04:26.0291501Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.0294109Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.0295183Z ^ 2025-05-07T20:04:26.0295423Z 2025-05-07T20:04:26.0295850Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:26.0296476Z 2025-05-07T20:04:26.0298057Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.0300728Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.0301891Z ^ 2025-05-07T20:04:26.0302278Z 2025-05-07T20:04:26.0303829Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.0306415Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.0307573Z ^ 2025-05-07T20:04:26.0307975Z 2025-05-07T20:04:26.0308424Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:26.0309075Z 2025-05-07T20:04:26.0310693Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.0312954Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.0314201Z ^ 2025-05-07T20:04:26.0314548Z 2025-05-07T20:04:26.0316160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.0318599Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.0319726Z ^ 2025-05-07T20:04:26.0319955Z 2025-05-07T20:04:26.0320343Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:26.0320981Z 2025-05-07T20:04:26.0322670Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.0325072Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.0326139Z ^ 2025-05-07T20:04:26.0326500Z 2025-05-07T20:04:26.0328175Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.0330998Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.0332047Z ^ 2025-05-07T20:04:26.0332477Z 2025-05-07T20:04:26.0332905Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:26.0333533Z 2025-05-07T20:04:26.0335304Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.0338019Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.0339197Z ^ 2025-05-07T20:04:26.0339560Z 2025-05-07T20:04:26.8600030Z [465/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T20:04:26.8619018Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:27.7253951Z [466/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T20:04:27.7273396Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:28.8039474Z [467/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T20:04:28.8057724Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:29.0771520Z [468/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T20:04:29.0791006Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:29.8694406Z [469/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T20:04:29.8712685Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:30.4580505Z [470/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T20:04:30.4600495Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:31.1218563Z [471/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T20:04:31.1237396Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:31.3981306Z [472/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T20:04:31.4000381Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:31.4918566Z [473/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T20:04:31.4936833Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:32.1425789Z [474/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T20:04:32.1443259Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:32.6961168Z [475/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T20:04:32.6978714Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:32.7515585Z [476/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T20:04:32.7533550Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:33.2515688Z [477/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T20:04:33.2532790Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:33.6270923Z [478/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T20:04:33.6288174Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:34.0270219Z [479/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T20:04:34.0289024Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:35.5707065Z [480/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T20:04:35.5723984Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:35.9912774Z [481/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/topology_utils.cpp 2025-05-07T20:04:35.9930804Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:37.1249370Z [482/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o 2025-05-07T20:04:37.1269420Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:37.1271791Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:37.1272766Z ^ 2025-05-07T20:04:37.1273303Z 2025-05-07T20:04:37.1273680Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:37.1274463Z 2025-05-07T20:04:37.1275991Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:37.1278414Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:37.1292846Z ^ 2025-05-07T20:04:37.1293228Z 2025-05-07T20:04:37.1294751Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:37.1297206Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:37.1298276Z ^ 2025-05-07T20:04:37.1298512Z 2025-05-07T20:04:37.1298850Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:37.1299296Z 2025-05-07T20:04:37.1300666Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:37.1302755Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:37.1303768Z ^ 2025-05-07T20:04:37.1304078Z 2025-05-07T20:04:37.1305587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:37.1307563Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:37.1308366Z ^ 2025-05-07T20:04:37.1308659Z 2025-05-07T20:04:37.1309016Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:37.1309621Z 2025-05-07T20:04:37.1311193Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:37.1313298Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:37.1314461Z ^ 2025-05-07T20:04:37.1314745Z 2025-05-07T20:04:37.1316006Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:37.1318188Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:37.1319172Z ^ 2025-05-07T20:04:37.1319411Z 2025-05-07T20:04:37.1319790Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:37.1320340Z 2025-05-07T20:04:37.1321764Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:37.1324019Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:37.1325194Z ^ 2025-05-07T20:04:37.1325524Z 2025-05-07T20:04:37.1327003Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:37.1329602Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:37.1330638Z ^ 2025-05-07T20:04:37.1330870Z 2025-05-07T20:04:37.1331281Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:37.1331920Z 2025-05-07T20:04:37.1333346Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:37.1335702Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:37.1336718Z ^ 2025-05-07T20:04:37.1337068Z 2025-05-07T20:04:40.1641305Z [483/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T20:04:40.1658760Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:41.0766422Z [484/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/metric_ops/metric_ops_host.cpp 2025-05-07T20:04:41.0784986Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:41.7271535Z [485/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T20:04:41.7289708Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:42.0972321Z [486/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/eeg_utils.cpp 2025-05-07T20:04:42.0987973Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:42.2433019Z [487/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T20:04:42.2451118Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:42.2823982Z [488/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T20:04:42.2842860Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:42.7383692Z [489/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:04:42.7404590Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:42.7406880Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:42.7407977Z ^ 2025-05-07T20:04:42.7408264Z 2025-05-07T20:04:42.7408657Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:42.7409230Z 2025-05-07T20:04:42.7410758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:42.7412981Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:42.7413861Z ^ 2025-05-07T20:04:42.7414200Z 2025-05-07T20:04:42.7415889Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:42.7418277Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:42.7419469Z ^ 2025-05-07T20:04:42.7419698Z 2025-05-07T20:04:42.7420210Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:42.7420835Z 2025-05-07T20:04:42.7422357Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:42.7424859Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:42.7425941Z ^ 2025-05-07T20:04:42.7426258Z 2025-05-07T20:04:42.7427746Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:42.7430436Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:42.7431506Z ^ 2025-05-07T20:04:42.7431741Z 2025-05-07T20:04:42.7432150Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:42.7432757Z 2025-05-07T20:04:42.7434312Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:42.7436790Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:42.7437819Z ^ 2025-05-07T20:04:42.7438110Z 2025-05-07T20:04:42.7439506Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:42.7441885Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:42.7442941Z ^ 2025-05-07T20:04:42.7443149Z 2025-05-07T20:04:42.7443523Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:42.7444060Z 2025-05-07T20:04:42.7445520Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:42.7447885Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:42.7448923Z ^ 2025-05-07T20:04:42.7449256Z 2025-05-07T20:04:42.7450703Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:42.7453078Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:42.7454127Z ^ 2025-05-07T20:04:42.7454374Z 2025-05-07T20:04:42.7454786Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:42.7455629Z 2025-05-07T20:04:42.7457113Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:42.7459767Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:42.7460903Z ^ 2025-05-07T20:04:42.7461333Z 2025-05-07T20:04:42.7686463Z [490/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_estimator.cpp 2025-05-07T20:04:42.7702923Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:43.9027943Z [491/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T20:04:43.9046513Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:44.1877778Z [492/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o 2025-05-07T20:04:44.1901078Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.1903781Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.1904965Z ^ 2025-05-07T20:04:44.1905278Z 2025-05-07T20:04:44.1905733Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:44.1906618Z 2025-05-07T20:04:44.1908249Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.1911044Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.1912238Z ^ 2025-05-07T20:04:44.1912739Z 2025-05-07T20:04:44.1914558Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.1917126Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.1918242Z ^ 2025-05-07T20:04:44.1918506Z 2025-05-07T20:04:44.1918955Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:44.1919573Z 2025-05-07T20:04:44.1921032Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.1923744Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.1924876Z ^ 2025-05-07T20:04:44.1925240Z 2025-05-07T20:04:44.1926919Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.1929750Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.1931292Z ^ 2025-05-07T20:04:44.1931553Z 2025-05-07T20:04:44.1931995Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:44.1932672Z 2025-05-07T20:04:44.1934386Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.1936993Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.1938035Z ^ 2025-05-07T20:04:44.1938382Z 2025-05-07T20:04:44.1939900Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.1942452Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.1943528Z ^ 2025-05-07T20:04:44.1943848Z 2025-05-07T20:04:44.1944279Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:44.1944918Z 2025-05-07T20:04:44.1946560Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.1949202Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.1950352Z ^ 2025-05-07T20:04:44.1950676Z 2025-05-07T20:04:44.1952351Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.1955092Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.1956311Z ^ 2025-05-07T20:04:44.1956552Z 2025-05-07T20:04:44.1957052Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:44.1957617Z 2025-05-07T20:04:44.1959189Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.1961699Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.1962826Z ^ 2025-05-07T20:04:44.1963198Z 2025-05-07T20:04:44.6589592Z [493/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T20:04:44.6608192Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:45.2750317Z [494/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:04:45.2773624Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:45.2776369Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:45.2777534Z ^ 2025-05-07T20:04:45.2777761Z 2025-05-07T20:04:45.2778213Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:45.2778842Z 2025-05-07T20:04:45.2780471Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:45.2783090Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:45.2784267Z ^ 2025-05-07T20:04:45.2784613Z 2025-05-07T20:04:45.2786322Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:45.2788935Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:45.2790090Z ^ 2025-05-07T20:04:45.2790341Z 2025-05-07T20:04:45.2790780Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:45.2791444Z 2025-05-07T20:04:45.2794608Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:45.2797360Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:45.2798530Z ^ 2025-05-07T20:04:45.2798999Z 2025-05-07T20:04:45.2800590Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:45.2803030Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:45.2804128Z ^ 2025-05-07T20:04:45.2804371Z 2025-05-07T20:04:45.2804803Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:45.2805338Z 2025-05-07T20:04:45.2806812Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:45.2809312Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:45.2810422Z ^ 2025-05-07T20:04:45.2810755Z 2025-05-07T20:04:45.2812289Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:45.2814825Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:45.2815971Z ^ 2025-05-07T20:04:45.2816205Z 2025-05-07T20:04:45.2816638Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:45.2817486Z 2025-05-07T20:04:45.2818952Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:45.2821656Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:45.2822839Z ^ 2025-05-07T20:04:45.2823174Z 2025-05-07T20:04:45.2824767Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:45.2827309Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:45.2828769Z ^ 2025-05-07T20:04:45.2829047Z 2025-05-07T20:04:45.2829487Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:45.2830125Z 2025-05-07T20:04:45.2831759Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:45.2834469Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:45.2835651Z ^ 2025-05-07T20:04:45.2835996Z 2025-05-07T20:04:45.3688145Z [495/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_generator.cpp 2025-05-07T20:04:45.3705782Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:45.6558238Z [496/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T20:04:45.6578629Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:46.5832292Z [497/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T20:04:46.5850800Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:47.0042978Z [498/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T20:04:47.0062079Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:48.5791987Z [499/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T20:04:48.5811805Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:50.7201617Z [500/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:50.7225655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:50.7228657Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:50.7230049Z ^ 2025-05-07T20:04:50.7230305Z 2025-05-07T20:04:50.7230746Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:50.7231410Z 2025-05-07T20:04:50.7233131Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:50.7235912Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:50.7236985Z ^ 2025-05-07T20:04:50.7237284Z 2025-05-07T20:04:50.7238783Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:50.7241284Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:50.7242362Z ^ 2025-05-07T20:04:50.7242600Z 2025-05-07T20:04:50.7243048Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:50.7243704Z 2025-05-07T20:04:50.7245470Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:50.7248233Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:50.7249450Z ^ 2025-05-07T20:04:50.7249815Z 2025-05-07T20:04:50.7251657Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:50.7254316Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:50.7255575Z ^ 2025-05-07T20:04:50.7255821Z 2025-05-07T20:04:50.7256331Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:50.7256969Z 2025-05-07T20:04:50.7258711Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:50.7261425Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:50.7262652Z ^ 2025-05-07T20:04:50.7263026Z 2025-05-07T20:04:50.7264651Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:50.7267322Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:50.7268489Z ^ 2025-05-07T20:04:50.7268733Z 2025-05-07T20:04:50.7269179Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:50.7269839Z 2025-05-07T20:04:50.7271300Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:50.7274049Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:50.7275164Z ^ 2025-05-07T20:04:50.7275515Z 2025-05-07T20:04:50.7277185Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:50.7279702Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:50.7280744Z ^ 2025-05-07T20:04:50.7280981Z 2025-05-07T20:04:50.7281383Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:50.7281974Z 2025-05-07T20:04:50.7283528Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:50.7286092Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:50.7287365Z ^ 2025-05-07T20:04:50.7287731Z 2025-05-07T20:04:50.7937090Z [501/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T20:04:50.7955434Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:54.9598815Z [502/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o 2025-05-07T20:04:54.9619719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.9622531Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.9623523Z ^ 2025-05-07T20:04:54.9623729Z 2025-05-07T20:04:54.9624138Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:54.9624719Z 2025-05-07T20:04:54.9626192Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.9628944Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.9629988Z ^ 2025-05-07T20:04:54.9630312Z 2025-05-07T20:04:54.9631684Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.9634121Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.9635106Z ^ 2025-05-07T20:04:54.9635354Z 2025-05-07T20:04:54.9635723Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:54.9636297Z 2025-05-07T20:04:54.9637844Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.9640546Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.9641663Z ^ 2025-05-07T20:04:54.9642002Z 2025-05-07T20:04:54.9643552Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.9646090Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.9647110Z ^ 2025-05-07T20:04:54.9647318Z 2025-05-07T20:04:54.9647698Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:54.9648258Z 2025-05-07T20:04:54.9649714Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.9652147Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.9653236Z ^ 2025-05-07T20:04:54.9653589Z 2025-05-07T20:04:54.9655069Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.9657460Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.9658741Z ^ 2025-05-07T20:04:54.9659008Z 2025-05-07T20:04:54.9659412Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:54.9660063Z 2025-05-07T20:04:54.9661659Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.9664360Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.9665515Z ^ 2025-05-07T20:04:54.9665836Z 2025-05-07T20:04:54.9667270Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.9669626Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.9670677Z ^ 2025-05-07T20:04:54.9670910Z 2025-05-07T20:04:54.9671326Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:54.9671960Z 2025-05-07T20:04:54.9673477Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.9676080Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.9677135Z ^ 2025-05-07T20:04:54.9677493Z 2025-05-07T20:04:57.4002278Z [503/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T20:04:57.4018375Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:05:00.9519872Z [504/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o 2025-05-07T20:05:00.9543364Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:00.9546080Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:00.9547273Z ^ 2025-05-07T20:05:00.9547535Z 2025-05-07T20:05:00.9547978Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:00.9548592Z 2025-05-07T20:05:00.9550216Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:00.9552895Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:00.9554198Z ^ 2025-05-07T20:05:00.9554563Z 2025-05-07T20:05:00.9556265Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:00.9559218Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:00.9560418Z ^ 2025-05-07T20:05:00.9560670Z 2025-05-07T20:05:00.9561124Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:00.9561844Z 2025-05-07T20:05:00.9563561Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:00.9566241Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:00.9567427Z ^ 2025-05-07T20:05:00.9567798Z 2025-05-07T20:05:00.9569281Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:00.9571772Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:00.9572872Z ^ 2025-05-07T20:05:00.9573135Z 2025-05-07T20:05:00.9573581Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:00.9574162Z 2025-05-07T20:05:00.9575800Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:00.9578525Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:00.9579677Z ^ 2025-05-07T20:05:00.9580172Z 2025-05-07T20:05:00.9581745Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:00.9584130Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:00.9585302Z ^ 2025-05-07T20:05:00.9585551Z 2025-05-07T20:05:00.9586006Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:00.9586688Z 2025-05-07T20:05:00.9588379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:00.9591125Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:00.9592312Z ^ 2025-05-07T20:05:00.9592687Z 2025-05-07T20:05:00.9594490Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:00.9597191Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:00.9598363Z ^ 2025-05-07T20:05:00.9598610Z 2025-05-07T20:05:00.9599075Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:00.9599742Z 2025-05-07T20:05:00.9601601Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:00.9604279Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:00.9605471Z ^ 2025-05-07T20:05:00.9605901Z 2025-05-07T20:05:01.3139371Z [505/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:05:01.3160595Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:01.3162543Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:01.3163406Z ^ 2025-05-07T20:05:01.3163592Z 2025-05-07T20:05:01.3163921Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:01.3164411Z 2025-05-07T20:05:01.3165596Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:01.3167795Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:01.3168648Z ^ 2025-05-07T20:05:01.3168914Z 2025-05-07T20:05:01.3170091Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:01.3173159Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:01.3174161Z ^ 2025-05-07T20:05:01.3174389Z 2025-05-07T20:05:01.3174807Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:01.3175386Z 2025-05-07T20:05:01.3176731Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:01.3179049Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:01.3180068Z ^ 2025-05-07T20:05:01.3180396Z 2025-05-07T20:05:01.3181754Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:01.3184036Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:01.3185091Z ^ 2025-05-07T20:05:01.3185335Z 2025-05-07T20:05:01.3185748Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:01.3186337Z 2025-05-07T20:05:01.3187882Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:01.3190451Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:01.3191566Z ^ 2025-05-07T20:05:01.3191872Z 2025-05-07T20:05:01.3193368Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:01.3195890Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:01.3196885Z ^ 2025-05-07T20:05:01.3197094Z 2025-05-07T20:05:01.3197481Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:01.3198115Z 2025-05-07T20:05:01.3199669Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:01.3202171Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:01.3203275Z ^ 2025-05-07T20:05:01.3203621Z 2025-05-07T20:05:01.3205175Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:01.3207593Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:01.3208802Z ^ 2025-05-07T20:05:01.3209031Z 2025-05-07T20:05:01.3209445Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:01.3210019Z 2025-05-07T20:05:01.3211512Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:01.3213959Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:01.3215039Z ^ 2025-05-07T20:05:01.3215295Z 2025-05-07T20:05:02.5134992Z [506/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o 2025-05-07T20:05:02.5156715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:02.5159154Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:02.5160267Z ^ 2025-05-07T20:05:02.5160507Z 2025-05-07T20:05:02.5160913Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:02.5161515Z 2025-05-07T20:05:02.5163333Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:02.5165794Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:02.5167035Z ^ 2025-05-07T20:05:02.5167396Z 2025-05-07T20:05:02.5169067Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:02.5171481Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:02.5172470Z ^ 2025-05-07T20:05:02.5172696Z 2025-05-07T20:05:02.5173169Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:02.5173812Z 2025-05-07T20:05:02.5175297Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:02.5177780Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:02.5178956Z ^ 2025-05-07T20:05:02.5179265Z 2025-05-07T20:05:02.5180416Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:02.5182036Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:02.5182536Z ^ 2025-05-07T20:05:02.5182789Z 2025-05-07T20:05:02.5184314Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:02.5186944Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:02.5187984Z ^ 2025-05-07T20:05:02.5188223Z 2025-05-07T20:05:02.5188663Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:02.5189272Z 2025-05-07T20:05:02.5190773Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:02.5193138Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:02.5194425Z ^ 2025-05-07T20:05:02.5194777Z 2025-05-07T20:05:02.5196003Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:02.5197645Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:02.5198141Z ^ 2025-05-07T20:05:02.5198373Z 2025-05-07T20:05:02.5199864Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:02.5202329Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:02.5203390Z ^ 2025-05-07T20:05:02.5203618Z 2025-05-07T20:05:02.5204248Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:02.5204862Z 2025-05-07T20:05:02.5206282Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:02.5208999Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:02.5210030Z ^ 2025-05-07T20:05:02.5210386Z 2025-05-07T20:05:02.5211672Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:02.5213338Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:02.5213807Z ^ 2025-05-07T20:05:02.5214040Z 2025-05-07T20:05:02.5215502Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:02.5217954Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:02.5219037Z ^ 2025-05-07T20:05:02.5219271Z 2025-05-07T20:05:02.5219681Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:02.5220304Z 2025-05-07T20:05:02.5221839Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:02.5224334Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:02.5225555Z ^ 2025-05-07T20:05:02.5225901Z 2025-05-07T20:05:02.5227060Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:02.5229008Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:02.5229457Z ^ 2025-05-07T20:05:02.5229695Z 2025-05-07T20:05:05.3689990Z [507/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o 2025-05-07T20:05:05.3702004Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:05.3703398Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:05.3704019Z ^ 2025-05-07T20:05:05.3704160Z 2025-05-07T20:05:05.3704403Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:05.3704768Z 2025-05-07T20:05:05.3705620Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:05.3707077Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:05.3707697Z ^ 2025-05-07T20:05:05.3707901Z 2025-05-07T20:05:05.3708743Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:05.3710113Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:05.3710720Z ^ 2025-05-07T20:05:05.3710856Z 2025-05-07T20:05:05.3711102Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:05.3711449Z 2025-05-07T20:05:05.3712354Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:05.3713732Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:05.3714517Z ^ 2025-05-07T20:05:05.3714711Z 2025-05-07T20:05:05.3715575Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:05.3716942Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:05.3717560Z ^ 2025-05-07T20:05:05.3717697Z 2025-05-07T20:05:05.3717979Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:05.3718341Z 2025-05-07T20:05:05.3719194Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:05.3720609Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:05.3721265Z ^ 2025-05-07T20:05:05.3721470Z 2025-05-07T20:05:05.3722312Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:05.3723679Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:05.3724284Z ^ 2025-05-07T20:05:05.3724432Z 2025-05-07T20:05:05.3724670Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:05.3725016Z 2025-05-07T20:05:05.3725863Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:05.3727246Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:05.3727876Z ^ 2025-05-07T20:05:05.3728067Z 2025-05-07T20:05:05.3729178Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:05.3730561Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:05.3731282Z ^ 2025-05-07T20:05:05.3731417Z 2025-05-07T20:05:05.3731653Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:05.3732016Z 2025-05-07T20:05:05.3732870Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:05.3734257Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:05.3734870Z ^ 2025-05-07T20:05:05.3735059Z 2025-05-07T20:05:10.3773919Z [508/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o 2025-05-07T20:05:10.3796199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:10.3798623Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:10.3799641Z ^ 2025-05-07T20:05:10.3799867Z 2025-05-07T20:05:10.3800304Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:10.3801268Z 2025-05-07T20:05:10.3802696Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:10.3805124Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:10.3806214Z ^ 2025-05-07T20:05:10.3806555Z 2025-05-07T20:05:10.3808062Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:10.3810475Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:10.3811525Z ^ 2025-05-07T20:05:10.3811731Z 2025-05-07T20:05:10.3812111Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:10.3812711Z 2025-05-07T20:05:10.3814117Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:10.3816369Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:10.3817400Z ^ 2025-05-07T20:05:10.3817721Z 2025-05-07T20:05:10.3818939Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:10.3820553Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:10.3821239Z ^ 2025-05-07T20:05:10.3821471Z 2025-05-07T20:05:10.3822926Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:10.3825529Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:10.3826687Z ^ 2025-05-07T20:05:10.3826915Z 2025-05-07T20:05:10.3827305Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:10.3827886Z 2025-05-07T20:05:10.3829602Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:10.3832079Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:10.3833085Z ^ 2025-05-07T20:05:10.3833419Z 2025-05-07T20:05:10.3834787Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:10.3836436Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:10.3836904Z ^ 2025-05-07T20:05:10.3837150Z 2025-05-07T20:05:10.3838651Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:10.3841050Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:10.3842368Z ^ 2025-05-07T20:05:10.3842609Z 2025-05-07T20:05:10.3843016Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:10.3843629Z 2025-05-07T20:05:10.3845084Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:10.3847490Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:10.3848509Z ^ 2025-05-07T20:05:10.3848810Z 2025-05-07T20:05:10.3850081Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:10.3851732Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:10.3852194Z ^ 2025-05-07T20:05:10.3852445Z 2025-05-07T20:05:10.3853921Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:10.3856395Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:10.3857401Z ^ 2025-05-07T20:05:10.3857638Z 2025-05-07T20:05:10.3858057Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:10.3858664Z 2025-05-07T20:05:10.3863352Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:10.3865842Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:10.3866948Z ^ 2025-05-07T20:05:10.3867280Z 2025-05-07T20:05:10.3868843Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:10.3870485Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:10.3870969Z ^ 2025-05-07T20:05:10.3871187Z 2025-05-07T20:05:11.6608766Z [509/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_forward.so -o fbgemm_gpu_tbe_training_forward.so CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_common.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && : 2025-05-07T20:05:13.4203236Z [510/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:05:13.4225303Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:13.4227817Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:13.4229240Z ^ 2025-05-07T20:05:13.4229554Z 2025-05-07T20:05:13.4230015Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:13.4230639Z 2025-05-07T20:05:13.4232269Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:13.4234745Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:13.4235707Z ^ 2025-05-07T20:05:13.4236028Z 2025-05-07T20:05:13.4237704Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:13.4239965Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:13.4241041Z ^ 2025-05-07T20:05:13.4241275Z 2025-05-07T20:05:13.4241799Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:13.4242405Z 2025-05-07T20:05:13.4243983Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:13.4246291Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:13.4247326Z ^ 2025-05-07T20:05:13.4247671Z 2025-05-07T20:05:13.4249182Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:13.4251608Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:13.4252699Z ^ 2025-05-07T20:05:13.4252932Z 2025-05-07T20:05:13.4253350Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:13.4253939Z 2025-05-07T20:05:13.4255356Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:13.4257636Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:13.4258891Z ^ 2025-05-07T20:05:13.4259240Z 2025-05-07T20:05:13.4260787Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:13.4263292Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:13.4264320Z ^ 2025-05-07T20:05:13.4264544Z 2025-05-07T20:05:13.4264957Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:13.4265531Z 2025-05-07T20:05:13.4267028Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:13.4269518Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:13.4270667Z ^ 2025-05-07T20:05:13.4271015Z 2025-05-07T20:05:13.4272579Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:13.4275233Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:13.4276246Z ^ 2025-05-07T20:05:13.4276473Z 2025-05-07T20:05:13.4276873Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:13.4277473Z 2025-05-07T20:05:13.4279126Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:13.4281688Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:13.4282976Z ^ 2025-05-07T20:05:13.4283322Z 2025-05-07T20:05:13.8390744Z [511/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_grad_index_select.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o 2025-05-07T20:05:13.8411632Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:13.8414104Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:13.8415178Z ^ 2025-05-07T20:05:13.8415445Z 2025-05-07T20:05:13.8415858Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:13.8416474Z 2025-05-07T20:05:13.8418019Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:13.8420503Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:13.8421595Z ^ 2025-05-07T20:05:13.8422249Z 2025-05-07T20:05:13.8423775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:13.8426386Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:13.8427476Z ^ 2025-05-07T20:05:13.8427848Z 2025-05-07T20:05:13.8428301Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:13.8429229Z 2025-05-07T20:05:13.8430661Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:13.8433180Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:13.8434421Z ^ 2025-05-07T20:05:13.8434785Z 2025-05-07T20:05:13.8436294Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:13.8438847Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:13.8439925Z ^ 2025-05-07T20:05:13.8440184Z 2025-05-07T20:05:13.8440603Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:13.8441237Z 2025-05-07T20:05:13.8442778Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:13.8445523Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:13.8446627Z ^ 2025-05-07T20:05:13.8446990Z 2025-05-07T20:05:13.8448528Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:13.8451025Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:13.8452128Z ^ 2025-05-07T20:05:13.8452371Z 2025-05-07T20:05:13.8452784Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:13.8453421Z 2025-05-07T20:05:13.8454953Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:13.8457406Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:13.8458480Z ^ 2025-05-07T20:05:13.8458840Z 2025-05-07T20:05:13.8460382Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:13.8462789Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:13.8463837Z ^ 2025-05-07T20:05:13.8464084Z 2025-05-07T20:05:13.8464721Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:13.8465302Z 2025-05-07T20:05:13.8466703Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:13.8469430Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:13.8470551Z ^ 2025-05-07T20:05:13.8470877Z 2025-05-07T20:05:17.5989486Z [512/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_batch_index_select_dim0_forward_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o 2025-05-07T20:05:17.6011969Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:17.6014616Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:17.6015766Z ^ 2025-05-07T20:05:17.6016009Z 2025-05-07T20:05:17.6016452Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:17.6017096Z 2025-05-07T20:05:17.6020631Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:17.6023356Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:17.6024371Z ^ 2025-05-07T20:05:17.6024696Z 2025-05-07T20:05:17.6026377Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:17.6029228Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:17.6030353Z ^ 2025-05-07T20:05:17.6030588Z 2025-05-07T20:05:17.6030980Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:17.6031602Z 2025-05-07T20:05:17.6033225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:17.6035986Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:17.6037144Z ^ 2025-05-07T20:05:17.6037491Z 2025-05-07T20:05:17.6039108Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:17.6041691Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:17.6042842Z ^ 2025-05-07T20:05:17.6043079Z 2025-05-07T20:05:17.6043515Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:17.6044408Z 2025-05-07T20:05:17.6046027Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:17.6048603Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:17.6049740Z ^ 2025-05-07T20:05:17.6050103Z 2025-05-07T20:05:17.6051727Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:17.6054327Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:17.6055464Z ^ 2025-05-07T20:05:17.6055721Z 2025-05-07T20:05:17.6056146Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:17.6056793Z 2025-05-07T20:05:17.6058407Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:17.6061048Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:17.6062157Z ^ 2025-05-07T20:05:17.6062491Z 2025-05-07T20:05:17.6064099Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:17.6066858Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:17.6067976Z ^ 2025-05-07T20:05:17.6068206Z 2025-05-07T20:05:17.6068623Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:17.6069403Z 2025-05-07T20:05:17.6071277Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:17.6074102Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:17.6075230Z ^ 2025-05-07T20:05:17.6075589Z 2025-05-07T20:05:20.9916828Z [513/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_batch_index_select_dim0_forward_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o 2025-05-07T20:05:20.9937112Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:20.9939529Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:20.9940676Z ^ 2025-05-07T20:05:20.9940903Z 2025-05-07T20:05:20.9941321Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:20.9941996Z 2025-05-07T20:05:20.9943844Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:20.9946190Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:20.9947338Z ^ 2025-05-07T20:05:20.9947801Z 2025-05-07T20:05:20.9949182Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:20.9951398Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:20.9952436Z ^ 2025-05-07T20:05:20.9952676Z 2025-05-07T20:05:20.9953073Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:20.9953672Z 2025-05-07T20:05:20.9955225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:20.9961500Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:20.9962511Z ^ 2025-05-07T20:05:20.9962841Z 2025-05-07T20:05:20.9964398Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:20.9966722Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:20.9967759Z ^ 2025-05-07T20:05:20.9968001Z 2025-05-07T20:05:20.9968314Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:20.9968864Z 2025-05-07T20:05:20.9970387Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:20.9972825Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:20.9973851Z ^ 2025-05-07T20:05:20.9974191Z 2025-05-07T20:05:20.9975645Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:20.9977940Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:20.9979085Z ^ 2025-05-07T20:05:20.9979335Z 2025-05-07T20:05:20.9979767Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:20.9980379Z 2025-05-07T20:05:20.9982022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:20.9984296Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:20.9985436Z ^ 2025-05-07T20:05:20.9985786Z 2025-05-07T20:05:20.9987551Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:20.9990141Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:20.9991368Z ^ 2025-05-07T20:05:20.9991616Z 2025-05-07T20:05:20.9992155Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:20.9992722Z 2025-05-07T20:05:20.9994246Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:20.9996616Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:20.9997700Z ^ 2025-05-07T20:05:20.9998047Z 2025-05-07T20:05:24.1535256Z [514/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o 2025-05-07T20:05:24.7660537Z [515/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o 2025-05-07T20:05:24.7682608Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:24.7684870Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:24.7685912Z ^ 2025-05-07T20:05:24.7686142Z 2025-05-07T20:05:24.7686521Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:24.7687113Z 2025-05-07T20:05:24.7688656Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:24.7691252Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:24.7692394Z ^ 2025-05-07T20:05:24.7692742Z 2025-05-07T20:05:24.7694282Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:24.7696873Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:24.7698025Z ^ 2025-05-07T20:05:24.7698259Z 2025-05-07T20:05:24.7698695Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:24.7699373Z 2025-05-07T20:05:24.7701090Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:24.7703560Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:24.7704816Z ^ 2025-05-07T20:05:24.7705184Z 2025-05-07T20:05:24.7706885Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:24.7709435Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:24.7710532Z ^ 2025-05-07T20:05:24.7710780Z 2025-05-07T20:05:24.7711189Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:24.7711808Z 2025-05-07T20:05:24.7713363Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:24.7715975Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:24.7717239Z ^ 2025-05-07T20:05:24.7717560Z 2025-05-07T20:05:24.7718987Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:24.7721396Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:24.7722425Z ^ 2025-05-07T20:05:24.7722674Z 2025-05-07T20:05:24.7723065Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:24.7723663Z 2025-05-07T20:05:24.7725178Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:24.7727581Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:24.7728936Z ^ 2025-05-07T20:05:24.7729266Z 2025-05-07T20:05:24.7730758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:24.7733158Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:24.7734199Z ^ 2025-05-07T20:05:24.7734420Z 2025-05-07T20:05:24.7734793Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:24.7735391Z 2025-05-07T20:05:24.7736913Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:24.7739311Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:24.7740385Z ^ 2025-05-07T20:05:24.7740706Z 2025-05-07T20:05:25.3798197Z [516/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T20:05:25.3816647Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:05:26.7330319Z [517/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o 2025-05-07T20:05:34.8702875Z [518/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o 2025-05-07T20:05:36.4595555Z [519/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_batch_index_select_dim0_forward_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o 2025-05-07T20:05:36.4618459Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:36.4621049Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:36.4622098Z ^ 2025-05-07T20:05:36.4622329Z 2025-05-07T20:05:36.4622732Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:36.4623370Z 2025-05-07T20:05:36.4624831Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:36.4627470Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:36.4628949Z ^ 2025-05-07T20:05:36.4629324Z 2025-05-07T20:05:36.4630977Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:36.4633696Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:36.4634968Z ^ 2025-05-07T20:05:36.4635223Z 2025-05-07T20:05:36.4635689Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:36.4636355Z 2025-05-07T20:05:36.4638033Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:36.4640704Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:36.4641898Z ^ 2025-05-07T20:05:36.4642266Z 2025-05-07T20:05:36.4643925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:36.4646625Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:36.4648011Z ^ 2025-05-07T20:05:36.4648275Z 2025-05-07T20:05:36.4663776Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:36.4664592Z 2025-05-07T20:05:36.4666049Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:36.4668877Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:36.4670043Z ^ 2025-05-07T20:05:36.4670406Z 2025-05-07T20:05:36.4672035Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:36.4674835Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:36.4675975Z ^ 2025-05-07T20:05:36.4676238Z 2025-05-07T20:05:36.4676669Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:36.4677326Z 2025-05-07T20:05:36.4678953Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:36.4681714Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:36.4682885Z ^ 2025-05-07T20:05:36.4683239Z 2025-05-07T20:05:36.4684852Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:36.4687445Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:36.4688599Z ^ 2025-05-07T20:05:36.4688849Z 2025-05-07T20:05:36.4689256Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:36.4689895Z 2025-05-07T20:05:36.4691516Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:36.4693832Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:36.4694983Z ^ 2025-05-07T20:05:36.4695359Z 2025-05-07T20:05:44.1061707Z [520/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/histogram_binning_calibration_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o 2025-05-07T20:05:48.2333658Z [521/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_batch_index_select_dim0_backward_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o 2025-05-07T20:05:48.2356814Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:48.2359433Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:48.2360657Z ^ 2025-05-07T20:05:48.2360905Z 2025-05-07T20:05:48.2361334Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:48.2362103Z 2025-05-07T20:05:48.2363809Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:48.2366441Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:48.2367653Z ^ 2025-05-07T20:05:48.2367999Z 2025-05-07T20:05:48.2369606Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:48.2372112Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:48.2373273Z ^ 2025-05-07T20:05:48.2373519Z 2025-05-07T20:05:48.2373966Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:48.2374636Z 2025-05-07T20:05:48.2376230Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:48.2378787Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:48.2379937Z ^ 2025-05-07T20:05:48.2380288Z 2025-05-07T20:05:48.2381873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:48.2384255Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:48.2385400Z ^ 2025-05-07T20:05:48.2385638Z 2025-05-07T20:05:48.2386057Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:48.2386701Z 2025-05-07T20:05:48.2388290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:48.2390669Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:48.2391520Z ^ 2025-05-07T20:05:48.2391822Z 2025-05-07T20:05:48.2393146Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:48.2395625Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:48.2396685Z ^ 2025-05-07T20:05:48.2396920Z 2025-05-07T20:05:48.2397347Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:48.2397940Z 2025-05-07T20:05:48.2399633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:48.2402032Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:48.2403210Z ^ 2025-05-07T20:05:48.2403561Z 2025-05-07T20:05:48.2405166Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:48.2407675Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:48.2408730Z ^ 2025-05-07T20:05:48.2408987Z 2025-05-07T20:05:48.2409406Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:48.2410031Z 2025-05-07T20:05:48.2411580Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:48.2414090Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:48.2415323Z ^ 2025-05-07T20:05:48.2415656Z 2025-05-07T20:05:51.1018143Z [522/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_batch_index_select_dim0_backward_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o 2025-05-07T20:05:51.1030273Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:51.1031899Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:51.1032538Z ^ 2025-05-07T20:05:51.1032680Z 2025-05-07T20:05:51.1032925Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:51.1033280Z 2025-05-07T20:05:51.1034255Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:51.1035635Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:51.1036279Z ^ 2025-05-07T20:05:51.1036476Z 2025-05-07T20:05:51.1037338Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:51.1038790Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:51.1039407Z ^ 2025-05-07T20:05:51.1039548Z 2025-05-07T20:05:51.1039800Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:51.1040149Z 2025-05-07T20:05:51.1041011Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:51.1042399Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:51.1043021Z ^ 2025-05-07T20:05:51.1043236Z 2025-05-07T20:05:51.1044081Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:51.1045453Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:51.1046060Z ^ 2025-05-07T20:05:51.1046208Z 2025-05-07T20:05:51.1046446Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:51.1046792Z 2025-05-07T20:05:51.1047657Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:51.1049033Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:51.1049665Z ^ 2025-05-07T20:05:51.1049857Z 2025-05-07T20:05:51.1050712Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:51.1052072Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:51.1052776Z ^ 2025-05-07T20:05:51.1052915Z 2025-05-07T20:05:51.1053152Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:51.1053518Z 2025-05-07T20:05:51.1054367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:51.1055875Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:51.1056490Z ^ 2025-05-07T20:05:51.1056701Z 2025-05-07T20:05:51.1057548Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:51.1058923Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:51.1059533Z ^ 2025-05-07T20:05:51.1059688Z 2025-05-07T20:05:51.1059930Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:51.1060279Z 2025-05-07T20:05:51.1061136Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:51.1062578Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:51.1063213Z ^ 2025-05-07T20:05:51.1063404Z 2025-05-07T20:05:51.2533251Z [523/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o 2025-05-07T20:05:57.3360868Z [524/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o 2025-05-07T20:05:57.3382542Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:57.3384968Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:57.3386086Z ^ 2025-05-07T20:05:57.3386360Z 2025-05-07T20:05:57.3386752Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:57.3387360Z 2025-05-07T20:05:57.3388910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:57.3391536Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:57.3392589Z ^ 2025-05-07T20:05:57.3392941Z 2025-05-07T20:05:57.3394770Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:57.3397810Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:57.3398981Z ^ 2025-05-07T20:05:57.3399233Z 2025-05-07T20:05:57.3399676Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:57.3400358Z 2025-05-07T20:05:57.3402063Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:57.3404787Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:57.3405987Z ^ 2025-05-07T20:05:57.3406365Z 2025-05-07T20:05:57.3408059Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:57.3410870Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:57.3412059Z ^ 2025-05-07T20:05:57.3412310Z 2025-05-07T20:05:57.3412779Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:57.3413456Z 2025-05-07T20:05:57.3415173Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:57.3417911Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:57.3419115Z ^ 2025-05-07T20:05:57.3419443Z 2025-05-07T20:05:57.3421059Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:57.3423685Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:57.3424831Z ^ 2025-05-07T20:05:57.3425072Z 2025-05-07T20:05:57.3425519Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:57.3426193Z 2025-05-07T20:05:57.3427840Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:57.3430823Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:57.3431992Z ^ 2025-05-07T20:05:57.3432345Z 2025-05-07T20:05:57.3434086Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:57.3436878Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:57.3438008Z ^ 2025-05-07T20:05:57.3438238Z 2025-05-07T20:05:57.3438676Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:57.3439256Z 2025-05-07T20:05:57.3440769Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:57.3443473Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:57.3444609Z ^ 2025-05-07T20:05:57.3444944Z 2025-05-07T20:05:57.5294733Z [525/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o 2025-05-07T20:05:58.0638365Z [526/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o 2025-05-07T20:05:58.4317977Z [527/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o 2025-05-07T20:05:59.1611991Z [528/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o 2025-05-07T20:06:00.1942979Z [529/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/metric_ops/metric_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o 2025-05-07T20:06:00.5794143Z [530/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o 2025-05-07T20:06:00.5995728Z [531/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o 2025-05-07T20:06:01.0761683Z [532/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_softmax_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o 2025-05-07T20:06:01.1035695Z [533/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o 2025-05-07T20:06:01.2632825Z [534/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_softmax_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o 2025-05-07T20:06:01.2753076Z [535/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so 2025-05-07T20:06:01.2755355Z ################################################################################ 2025-05-07T20:06:01.2755966Z [CMAKE] Running post-build script ... 2025-05-07T20:06:01.2756859Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so 2025-05-07T20:06:01.2757869Z Removing all RPATHs ... 2025-05-07T20:06:01.2758361Z ################################################################################ 2025-05-07T20:06:01.2877913Z [536/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so 1 2025-05-07T20:06:01.2880001Z ################################################################################ 2025-05-07T20:06:01.2880597Z [CMAKE] Running post-build script ... 2025-05-07T20:06:01.2881455Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so 2025-05-07T20:06:01.2882255Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:01.2882899Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:01.2883561Z ################################################################################ 2025-05-07T20:06:01.5551912Z [537/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:06:01.5554485Z ################################################################################ 2025-05-07T20:06:01.5555127Z [CMAKE] Running post-build script ... 2025-05-07T20:06:01.5556112Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:06:01.5557166Z Removing all RPATHs ... 2025-05-07T20:06:01.5557644Z ################################################################################ 2025-05-07T20:06:01.9514722Z [538/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_unique_indices.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o 2025-05-07T20:06:01.9534206Z In file included from tmpxft_0000427d_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:1: 2025-05-07T20:06:01.9536510Z /tmp/tmpxft_0000427d_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:46:1022: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:06:01.9545941Z static void __device_stub__ZN10fbgemm_gpu28unique_indices_length_kernelIlLl9223372036854775807ELln9223372036854775808EEEvN2at27GenericPackedTensorAccessorIT_Lm1ENS1_17RestrictPtrTraitsEiEES5_S5_S5_(const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par0, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par1, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par2, _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par3){__cudaLaunchPrologue(4);__cudaSetupArg(__par0, 0UL);__cudaSetupArg(__par1, 16UL);__cudaSetupArg(__par2, 32UL);__cudaSetupArg(__par3, 48UL);__cudaLaunch(((char *)((void ( *)(const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE))fbgemm_gpu::unique_indices_length_kernel )));}namespace fbgemm_gpu{ 2025-05-07T20:06:01.9556967Z ^ 2025-05-07T20:06:01.9558972Z /tmp/tmpxft_0000427d_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:46:1022: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:06:01.9562035Z /tmp/tmpxft_0000427d_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:46:1022: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:06:01.9564864Z /tmp/tmpxft_0000427d_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:52:860: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:06:01.9573139Z static void __device_stub__ZN10fbgemm_gpu24compute_hash_size_kernelIlLln9223372036854775808EEEvN2at27GenericPackedTensorAccessorIT_Lm1ENS1_17RestrictPtrTraitsEiEES5_lS5_(const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par0, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par1, const int64_t __par2, _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par3){__cudaLaunchPrologue(4);__cudaSetupArg(__par0, 0UL);__cudaSetupArg(__par1, 16UL);__cudaSetupArgSimple(__par2, 32UL);__cudaSetupArg(__par3, 40UL);__cudaLaunch(((char *)((void ( *)(const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, const int64_t, _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE))fbgemm_gpu::compute_hash_size_kernel )));}namespace fbgemm_gpu{ 2025-05-07T20:06:01.9580848Z ^ 2025-05-07T20:06:01.9583120Z /tmp/tmpxft_0000427d_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:52:860: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:06:01.9585774Z /tmp/tmpxft_0000427d_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:52:860: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:06:01.9588244Z /tmp/tmpxft_0000427d_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:55:445: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:06:01.9591214Z /tmp/tmpxft_0000427d_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:55:1476: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:06:01.9592989Z 8 warnings generated. 2025-05-07T20:06:01.9594939Z [539/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:06:01.9596971Z ################################################################################ 2025-05-07T20:06:01.9597563Z [CMAKE] Running post-build script ... 2025-05-07T20:06:01.9598447Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:06:01.9599477Z Removing all RPATHs ... 2025-05-07T20:06:01.9599934Z ################################################################################ 2025-05-07T20:06:01.9615792Z [540/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so 1 2025-05-07T20:06:01.9617810Z ################################################################################ 2025-05-07T20:06:01.9618383Z [CMAKE] Running post-build script ... 2025-05-07T20:06:01.9619429Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:06:01.9620460Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:01.9621140Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:01.9621810Z ################################################################################ 2025-05-07T20:06:01.9709719Z [541/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 1 2025-05-07T20:06:01.9712242Z ################################################################################ 2025-05-07T20:06:01.9712854Z [CMAKE] Running post-build script ... 2025-05-07T20:06:01.9714162Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:06:01.9715239Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:01.9715856Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:01.9716546Z ################################################################################ 2025-05-07T20:06:01.9811278Z [542/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so 1 2025-05-07T20:06:01.9815974Z ################################################################################ 2025-05-07T20:06:01.9816600Z [CMAKE] Running post-build script ... 2025-05-07T20:06:01.9817650Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:06:01.9818719Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:01.9819383Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:01.9820174Z ################################################################################ 2025-05-07T20:06:02.1501822Z [543/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:06:02.1504107Z ################################################################################ 2025-05-07T20:06:02.1504732Z [CMAKE] Running post-build script ... 2025-05-07T20:06:02.1505717Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:06:02.1506589Z Removing all RPATHs ... 2025-05-07T20:06:02.1507010Z ################################################################################ 2025-05-07T20:06:02.2509890Z [544/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o 2025-05-07T20:06:02.2590260Z [545/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:06:02.2592095Z ################################################################################ 2025-05-07T20:06:02.2592619Z [CMAKE] Running post-build script ... 2025-05-07T20:06:02.2593511Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:06:02.2594489Z Removing all RPATHs ... 2025-05-07T20:06:02.2594897Z ################################################################################ 2025-05-07T20:06:02.2680846Z [546/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 1 2025-05-07T20:06:02.2683006Z ################################################################################ 2025-05-07T20:06:02.2683679Z [CMAKE] Running post-build script ... 2025-05-07T20:06:02.2684799Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:06:02.2685982Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:02.2686677Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:02.2687366Z ################################################################################ 2025-05-07T20:06:02.2705638Z [547/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 1 2025-05-07T20:06:02.2708140Z ################################################################################ 2025-05-07T20:06:02.2708753Z [CMAKE] Running post-build script ... 2025-05-07T20:06:02.2710072Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:06:02.2711368Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:02.2712027Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:02.2712734Z ################################################################################ 2025-05-07T20:06:02.4797474Z [548/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so 1 2025-05-07T20:06:02.4799861Z ################################################################################ 2025-05-07T20:06:02.4800464Z [CMAKE] Running post-build script ... 2025-05-07T20:06:02.4801512Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:06:02.4802532Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:02.4803125Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:02.4804093Z ################################################################################ 2025-05-07T20:06:06.9328647Z [549/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update.cu -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o 2025-05-07T20:06:06.9950515Z [550/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o 2025-05-07T20:06:07.5148390Z [551/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_embedding_inplace_ops.so -o fbgemm_gpu_embedding_inplace_ops.so CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -ldl && : 2025-05-07T20:06:07.5226057Z [552/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:06:07.5227385Z ################################################################################ 2025-05-07T20:06:07.5227766Z [CMAKE] Running post-build script ... 2025-05-07T20:06:07.5228928Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:06:07.5229575Z Removing all RPATHs ... 2025-05-07T20:06:07.5229871Z ################################################################################ 2025-05-07T20:06:07.5406539Z [553/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o 2025-05-07T20:06:08.0975042Z [554/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/dense_to_jagged_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o 2025-05-07T20:06:08.3025817Z [555/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o 2025-05-07T20:06:09.4389067Z [556/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o 2025-05-07T20:06:12.0546659Z [557/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o 2025-05-07T20:06:12.4699298Z [558/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o 2025-05-07T20:06:12.4722960Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:12.4725732Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:12.4726934Z ^ 2025-05-07T20:06:12.4727193Z 2025-05-07T20:06:12.4727672Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:12.4728587Z 2025-05-07T20:06:12.4730142Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:12.4732965Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:12.4734156Z ^ 2025-05-07T20:06:12.4734528Z 2025-05-07T20:06:12.4736248Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:12.4738972Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:12.4740170Z ^ 2025-05-07T20:06:12.4740430Z 2025-05-07T20:06:12.4740881Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:12.4741577Z 2025-05-07T20:06:12.4743264Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:12.4745962Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:12.4747072Z ^ 2025-05-07T20:06:12.4747411Z 2025-05-07T20:06:12.4749004Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:12.4751523Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:12.4752714Z ^ 2025-05-07T20:06:12.4752977Z 2025-05-07T20:06:12.4753472Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:12.4754286Z 2025-05-07T20:06:12.4755893Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:12.4758813Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:12.4760025Z ^ 2025-05-07T20:06:12.4760385Z 2025-05-07T20:06:12.4761913Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:12.4764795Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:12.4765990Z ^ 2025-05-07T20:06:12.4766248Z 2025-05-07T20:06:12.4766697Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:12.4767342Z 2025-05-07T20:06:12.4768918Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:12.4771512Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:12.4772705Z ^ 2025-05-07T20:06:12.4773068Z 2025-05-07T20:06:12.4774770Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:12.4777506Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:12.4778698Z ^ 2025-05-07T20:06:12.4779004Z 2025-05-07T20:06:12.4779420Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:12.4780077Z 2025-05-07T20:06:12.4781648Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:12.4784309Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:12.4785472Z ^ 2025-05-07T20:06:12.4785861Z 2025-05-07T20:06:13.8493423Z [559/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:06:13.8517540Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:13.8520307Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:13.8521742Z ^ 2025-05-07T20:06:13.8521997Z 2025-05-07T20:06:13.8522457Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:13.8523095Z 2025-05-07T20:06:13.8524754Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:13.8527407Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:13.8528844Z ^ 2025-05-07T20:06:13.8529213Z 2025-05-07T20:06:13.8530868Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:13.8533561Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:13.8534761Z ^ 2025-05-07T20:06:13.8535045Z 2025-05-07T20:06:13.8535498Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:13.8536162Z 2025-05-07T20:06:13.8537786Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:13.8540403Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:13.8541606Z ^ 2025-05-07T20:06:13.8541955Z 2025-05-07T20:06:13.8543647Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:13.8546352Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:13.8547513Z ^ 2025-05-07T20:06:13.8547754Z 2025-05-07T20:06:13.8548215Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:13.8548913Z 2025-05-07T20:06:13.8550757Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:13.8553499Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:13.8554904Z ^ 2025-05-07T20:06:13.8555431Z 2025-05-07T20:06:13.8557085Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:13.8559619Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:13.8560816Z ^ 2025-05-07T20:06:13.8561090Z 2025-05-07T20:06:13.8561520Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:13.8562199Z 2025-05-07T20:06:13.8563812Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:13.8566543Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:13.8567907Z ^ 2025-05-07T20:06:13.8568289Z 2025-05-07T20:06:13.8570000Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:13.8572754Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:13.8573969Z ^ 2025-05-07T20:06:13.8574228Z 2025-05-07T20:06:13.8574692Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:13.8575397Z 2025-05-07T20:06:13.8577049Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:13.8579622Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:13.8580853Z ^ 2025-05-07T20:06:13.8581255Z 2025-05-07T20:06:14.9241175Z [560/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o 2025-05-07T20:06:15.0518305Z [561/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_batch_index_select_dim0_backward_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o 2025-05-07T20:06:15.0530564Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:15.0532046Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:15.0532698Z ^ 2025-05-07T20:06:15.0532845Z 2025-05-07T20:06:15.0533167Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:15.0533555Z 2025-05-07T20:06:15.0534489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:15.0535910Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:15.0536550Z ^ 2025-05-07T20:06:15.0536779Z 2025-05-07T20:06:15.0537633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:15.0539036Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:15.0539665Z ^ 2025-05-07T20:06:15.0539861Z 2025-05-07T20:06:15.0540138Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:15.0540492Z 2025-05-07T20:06:15.0541352Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:15.0542767Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:15.0543431Z ^ 2025-05-07T20:06:15.0543632Z 2025-05-07T20:06:15.0544482Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:15.0545885Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:15.0546527Z ^ 2025-05-07T20:06:15.0546678Z 2025-05-07T20:06:15.0546924Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:15.0547278Z 2025-05-07T20:06:15.0548166Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:15.0549560Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:15.0550220Z ^ 2025-05-07T20:06:15.0550420Z 2025-05-07T20:06:15.0551299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:15.0552683Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:15.0553324Z ^ 2025-05-07T20:06:15.0553466Z 2025-05-07T20:06:15.0553732Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:15.0554187Z 2025-05-07T20:06:15.0555101Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:15.0556521Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:15.0557189Z ^ 2025-05-07T20:06:15.0557417Z 2025-05-07T20:06:15.0558300Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:15.0559711Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:15.0560338Z ^ 2025-05-07T20:06:15.0560506Z 2025-05-07T20:06:15.0560751Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:15.0561110Z 2025-05-07T20:06:15.0561992Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:15.0563390Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:15.0564090Z ^ 2025-05-07T20:06:15.0564292Z 2025-05-07T20:06:15.5045790Z [562/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o 2025-05-07T20:06:15.5066095Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:15.5067585Z __attribute__((global)) inline void _float_to_FP8rowwise_cuda_kernel( 2025-05-07T20:06:15.5068309Z ^ 2025-05-07T20:06:15.5068582Z 2025-05-07T20:06:15.5069107Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:15.5069912Z 2025-05-07T20:06:15.5070986Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(61): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:15.5072425Z __attribute__((global)) inline void _get_FP8_qparam_cuda_kernel( 2025-05-07T20:06:15.5073139Z ^ 2025-05-07T20:06:15.5073374Z 2025-05-07T20:06:15.5074411Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(121): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:15.5075888Z __attribute__((global)) inline void _compute_FP8_quantize_cuda_kernel( 2025-05-07T20:06:15.5076601Z ^ 2025-05-07T20:06:15.5076834Z 2025-05-07T20:06:15.5077753Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(161): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:15.5079257Z __attribute__((global)) inline void _FP8rowwise_to_float_cuda_kernel( 2025-05-07T20:06:15.5080002Z ^ 2025-05-07T20:06:15.5080230Z 2025-05-07T20:06:15.5081319Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:15.5082747Z __attribute__((global)) inline void _float_to_FP8rowwise_cuda_kernel( 2025-05-07T20:06:15.5083443Z ^ 2025-05-07T20:06:15.5083702Z 2025-05-07T20:06:15.5084143Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:15.5084796Z 2025-05-07T20:06:15.5085734Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(61): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:15.5087158Z __attribute__((global)) inline void _get_FP8_qparam_cuda_kernel( 2025-05-07T20:06:15.5087844Z ^ 2025-05-07T20:06:15.5088074Z 2025-05-07T20:06:15.5089002Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(121): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:15.5090535Z __attribute__((global)) inline void _compute_FP8_quantize_cuda_kernel( 2025-05-07T20:06:15.5091233Z ^ 2025-05-07T20:06:15.5091484Z 2025-05-07T20:06:15.5092391Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(161): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:15.5093916Z __attribute__((global)) inline void _FP8rowwise_to_float_cuda_kernel( 2025-05-07T20:06:15.5094619Z ^ 2025-05-07T20:06:15.5094889Z 2025-05-07T20:06:15.5095804Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:15.5097261Z __attribute__((global)) inline void _float_to_FP8rowwise_cuda_kernel( 2025-05-07T20:06:15.5097978Z ^ 2025-05-07T20:06:15.5098209Z 2025-05-07T20:06:15.5098673Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:15.5099330Z 2025-05-07T20:06:15.5100253Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(61): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:15.5101695Z __attribute__((global)) inline void _get_FP8_qparam_cuda_kernel( 2025-05-07T20:06:15.5102317Z ^ 2025-05-07T20:06:15.5102565Z 2025-05-07T20:06:15.5103475Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(121): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:15.5105043Z __attribute__((global)) inline void _compute_FP8_quantize_cuda_kernel( 2025-05-07T20:06:15.5105754Z ^ 2025-05-07T20:06:15.5106006Z 2025-05-07T20:06:15.5106917Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(161): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:15.5108505Z __attribute__((global)) inline void _FP8rowwise_to_float_cuda_kernel( 2025-05-07T20:06:15.5109223Z ^ 2025-05-07T20:06:15.5109460Z 2025-05-07T20:06:15.5110449Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:15.5111893Z __attribute__((global)) inline void _float_to_FP8rowwise_cuda_kernel( 2025-05-07T20:06:15.5112586Z ^ 2025-05-07T20:06:15.5112821Z 2025-05-07T20:06:15.5113227Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:15.5113966Z 2025-05-07T20:06:15.5114878Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(61): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:15.5116297Z __attribute__((global)) inline void _get_FP8_qparam_cuda_kernel( 2025-05-07T20:06:15.5116962Z ^ 2025-05-07T20:06:15.5117196Z 2025-05-07T20:06:15.5118137Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(121): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:15.5119747Z __attribute__((global)) inline void _compute_FP8_quantize_cuda_kernel( 2025-05-07T20:06:15.5120480Z ^ 2025-05-07T20:06:15.5120718Z 2025-05-07T20:06:15.5121648Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(161): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:15.5123071Z __attribute__((global)) inline void _FP8rowwise_to_float_cuda_kernel( 2025-05-07T20:06:15.5123806Z ^ 2025-05-07T20:06:15.5124049Z 2025-05-07T20:06:15.5124969Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:15.5126478Z __attribute__((global)) inline void _float_to_FP8rowwise_cuda_kernel( 2025-05-07T20:06:15.5127204Z ^ 2025-05-07T20:06:15.5127386Z 2025-05-07T20:06:15.5127795Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:15.5128630Z 2025-05-07T20:06:15.5129548Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(61): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:15.5131019Z __attribute__((global)) inline void _get_FP8_qparam_cuda_kernel( 2025-05-07T20:06:15.5131722Z ^ 2025-05-07T20:06:15.5131967Z 2025-05-07T20:06:15.5132880Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(121): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:15.5134406Z __attribute__((global)) inline void _compute_FP8_quantize_cuda_kernel( 2025-05-07T20:06:15.5135151Z ^ 2025-05-07T20:06:15.5135383Z 2025-05-07T20:06:15.5136316Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(161): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:15.5137801Z __attribute__((global)) inline void _FP8rowwise_to_float_cuda_kernel( 2025-05-07T20:06:15.5138503Z ^ 2025-05-07T20:06:15.5138759Z 2025-05-07T20:06:15.6786553Z [563/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_index_select.so -o fbgemm_gpu_tbe_index_select.so CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_sparse_async_cumsum.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl && : 2025-05-07T20:06:15.7182112Z [564/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so 1 2025-05-07T20:06:15.7184561Z ################################################################################ 2025-05-07T20:06:15.7185188Z [CMAKE] Running post-build script ... 2025-05-07T20:06:15.7186205Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:06:15.7187285Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:15.7187932Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:15.7188607Z ################################################################################ 2025-05-07T20:06:16.9049481Z [565/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_bfloat16.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o 2025-05-07T20:06:17.1908035Z [566/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o 2025-05-07T20:06:17.1919401Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:17.1920302Z __attribute__((global)) inline void _float_to_fused8bitrowwise_cuda_kernel( 2025-05-07T20:06:17.1920874Z ^ 2025-05-07T20:06:17.1921043Z 2025-05-07T20:06:17.1921364Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:17.1921729Z 2025-05-07T20:06:17.1922285Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(52): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:17.1923124Z __attribute__((global)) inline void _get_8bit_qparam_cuda_kernel( 2025-05-07T20:06:17.1923539Z ^ 2025-05-07T20:06:17.1923731Z 2025-05-07T20:06:17.1924291Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(118): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:17.1925136Z __attribute__((global)) inline void _compute_8bit_quantize_cuda_kernel( 2025-05-07T20:06:17.1925570Z ^ 2025-05-07T20:06:17.1925738Z 2025-05-07T20:06:17.1926296Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(154): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:17.1927240Z __attribute__((global)) inline void _fused8bitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:17.1927691Z ^ 2025-05-07T20:06:17.1927831Z 2025-05-07T20:06:17.1928566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(195): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:17.1929492Z __attribute__((global)) inline void _fused8bitrowwise_to_float_mixed_dim_cuda_kernel( 2025-05-07T20:06:17.1929983Z ^ 2025-05-07T20:06:17.1930120Z 2025-05-07T20:06:17.1930655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:17.1931535Z __attribute__((global)) inline void _float_to_fused8bitrowwise_cuda_kernel( 2025-05-07T20:06:17.1931951Z ^ 2025-05-07T20:06:17.1932108Z 2025-05-07T20:06:17.1932356Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:17.1932715Z 2025-05-07T20:06:17.1933271Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(52): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:17.1934101Z __attribute__((global)) inline void _get_8bit_qparam_cuda_kernel( 2025-05-07T20:06:17.1934519Z ^ 2025-05-07T20:06:17.1934658Z 2025-05-07T20:06:17.1935187Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(118): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:17.1936059Z __attribute__((global)) inline void _compute_8bit_quantize_cuda_kernel( 2025-05-07T20:06:17.1936495Z ^ 2025-05-07T20:06:17.1936634Z 2025-05-07T20:06:17.1937166Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(154): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:17.1938055Z __attribute__((global)) inline void _fused8bitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:17.1938482Z ^ 2025-05-07T20:06:17.1938640Z 2025-05-07T20:06:17.1939169Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(195): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:17.1940089Z __attribute__((global)) inline void _fused8bitrowwise_to_float_mixed_dim_cuda_kernel( 2025-05-07T20:06:17.1940552Z ^ 2025-05-07T20:06:17.1940717Z 2025-05-07T20:06:17.1941327Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:17.1942191Z __attribute__((global)) inline void _float_to_fused8bitrowwise_cuda_kernel( 2025-05-07T20:06:17.1942639Z ^ 2025-05-07T20:06:17.1942772Z 2025-05-07T20:06:17.1943044Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:17.1943463Z 2025-05-07T20:06:17.1944032Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(52): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:17.1944874Z __attribute__((global)) inline void _get_8bit_qparam_cuda_kernel( 2025-05-07T20:06:17.1945248Z ^ 2025-05-07T20:06:17.1945406Z 2025-05-07T20:06:17.1945931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(118): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:17.1946789Z __attribute__((global)) inline void _compute_8bit_quantize_cuda_kernel( 2025-05-07T20:06:17.1947193Z ^ 2025-05-07T20:06:17.1947355Z 2025-05-07T20:06:17.1947879Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(154): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:17.1948729Z __attribute__((global)) inline void _fused8bitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:17.1949175Z ^ 2025-05-07T20:06:17.1949391Z 2025-05-07T20:06:17.1949948Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(195): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:17.1950840Z __attribute__((global)) inline void _fused8bitrowwise_to_float_mixed_dim_cuda_kernel( 2025-05-07T20:06:17.1951321Z ^ 2025-05-07T20:06:17.1951456Z 2025-05-07T20:06:17.1951976Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:17.1952860Z __attribute__((global)) inline void _float_to_fused8bitrowwise_cuda_kernel( 2025-05-07T20:06:17.1953301Z ^ 2025-05-07T20:06:17.1953435Z 2025-05-07T20:06:17.1953681Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:17.1954165Z 2025-05-07T20:06:17.1954691Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(52): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:17.1955514Z __attribute__((global)) inline void _get_8bit_qparam_cuda_kernel( 2025-05-07T20:06:17.1955928Z ^ 2025-05-07T20:06:17.1956062Z 2025-05-07T20:06:17.1956610Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(118): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:17.1957454Z __attribute__((global)) inline void _compute_8bit_quantize_cuda_kernel( 2025-05-07T20:06:17.1957897Z ^ 2025-05-07T20:06:17.1958032Z 2025-05-07T20:06:17.1958564Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(154): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:17.1959443Z __attribute__((global)) inline void _fused8bitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:17.1959889Z ^ 2025-05-07T20:06:17.1960022Z 2025-05-07T20:06:17.1960552Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(195): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:17.1961462Z __attribute__((global)) inline void _fused8bitrowwise_to_float_mixed_dim_cuda_kernel( 2025-05-07T20:06:17.1961922Z ^ 2025-05-07T20:06:17.1962078Z 2025-05-07T20:06:17.1962600Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:17.1963517Z __attribute__((global)) inline void _float_to_fused8bitrowwise_cuda_kernel( 2025-05-07T20:06:17.1963936Z ^ 2025-05-07T20:06:17.1964069Z 2025-05-07T20:06:17.1964334Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:17.1964686Z 2025-05-07T20:06:17.1965207Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(52): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:17.1966088Z __attribute__((global)) inline void _get_8bit_qparam_cuda_kernel( 2025-05-07T20:06:17.1966529Z ^ 2025-05-07T20:06:17.1966664Z 2025-05-07T20:06:17.1967186Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(118): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:17.1968049Z __attribute__((global)) inline void _compute_8bit_quantize_cuda_kernel( 2025-05-07T20:06:17.1968461Z ^ 2025-05-07T20:06:17.1968619Z 2025-05-07T20:06:17.1969144Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(154): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:17.1970014Z __attribute__((global)) inline void _fused8bitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:17.1970431Z ^ 2025-05-07T20:06:17.1970567Z 2025-05-07T20:06:17.1971112Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(195): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:17.1972046Z __attribute__((global)) inline void _fused8bitrowwise_to_float_mixed_dim_cuda_kernel( 2025-05-07T20:06:17.1972524Z ^ 2025-05-07T20:06:17.1972658Z 2025-05-07T20:06:23.1501632Z [567/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_mx.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o 2025-05-07T20:06:24.0263105Z [568/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o 2025-05-07T20:06:24.0274425Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:24.0275340Z __attribute__((global)) inline void _float_to_fusednbitrowwise_cuda_kernel( 2025-05-07T20:06:24.0275828Z ^ 2025-05-07T20:06:24.0275997Z 2025-05-07T20:06:24.0276247Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:24.0276606Z 2025-05-07T20:06:24.0277168Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(78): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:24.0278039Z __attribute__((global)) inline void _fusednbitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:24.0278478Z ^ 2025-05-07T20:06:24.0278616Z 2025-05-07T20:06:24.0279136Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:24.0280017Z __attribute__((global)) inline void _float_to_fusednbitrowwise_cuda_kernel( 2025-05-07T20:06:24.0280454Z ^ 2025-05-07T20:06:24.0280588Z 2025-05-07T20:06:24.0280827Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:24.0281203Z 2025-05-07T20:06:24.0281726Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(78): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:24.0282645Z __attribute__((global)) inline void _fusednbitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:24.0283059Z ^ 2025-05-07T20:06:24.0283195Z 2025-05-07T20:06:24.0283740Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:24.0284629Z __attribute__((global)) inline void _float_to_fusednbitrowwise_cuda_kernel( 2025-05-07T20:06:24.0285071Z ^ 2025-05-07T20:06:24.0285210Z 2025-05-07T20:06:24.0285484Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:24.0285860Z 2025-05-07T20:06:24.0286383Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(78): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:24.0287250Z __attribute__((global)) inline void _fusednbitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:24.0287667Z ^ 2025-05-07T20:06:24.0287825Z 2025-05-07T20:06:24.0288347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:24.0289217Z __attribute__((global)) inline void _float_to_fusednbitrowwise_cuda_kernel( 2025-05-07T20:06:24.0289646Z ^ 2025-05-07T20:06:24.0289779Z 2025-05-07T20:06:24.0290043Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:24.0290424Z 2025-05-07T20:06:24.0290948Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(78): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:24.0291826Z __attribute__((global)) inline void _fusednbitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:24.0292244Z ^ 2025-05-07T20:06:24.0292400Z 2025-05-07T20:06:24.0292919Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:24.0293792Z __attribute__((global)) inline void _float_to_fusednbitrowwise_cuda_kernel( 2025-05-07T20:06:24.0294209Z ^ 2025-05-07T20:06:24.0294361Z 2025-05-07T20:06:24.0294599Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:24.0294951Z 2025-05-07T20:06:24.0295497Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(78): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:24.0296347Z __attribute__((global)) inline void _fusednbitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:24.0296786Z ^ 2025-05-07T20:06:24.0296921Z 2025-05-07T20:06:28.5006775Z [569/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o 2025-05-07T20:06:28.5018131Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:28.5019019Z __attribute__((global)) inline void _float_to_paddedFP8rowwise_cuda_kernel( 2025-05-07T20:06:28.5019450Z ^ 2025-05-07T20:06:28.5019614Z 2025-05-07T20:06:28.5019934Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:28.5020293Z 2025-05-07T20:06:28.5020847Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(94): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:28.5021665Z __attribute__((global)) inline void _get_padding_value_kernel( 2025-05-07T20:06:28.5022073Z ^ 2025-05-07T20:06:28.5022212Z 2025-05-07T20:06:28.5022778Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(110): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:28.5023633Z __attribute__((global)) inline void _single_thread_sum_padding_kernel( 2025-05-07T20:06:28.5024062Z ^ 2025-05-07T20:06:28.5024198Z 2025-05-07T20:06:28.5024726Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(137): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:28.5025626Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_1d_cuda_kernel( 2025-05-07T20:06:28.5026058Z ^ 2025-05-07T20:06:28.5026218Z 2025-05-07T20:06:28.5026745Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(166): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:28.5027637Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_2d_cuda_kernel( 2025-05-07T20:06:28.5028068Z ^ 2025-05-07T20:06:28.5028225Z 2025-05-07T20:06:28.5028962Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:28.5029825Z __attribute__((global)) inline void _float_to_paddedFP8rowwise_cuda_kernel( 2025-05-07T20:06:28.5030280Z ^ 2025-05-07T20:06:28.5030416Z 2025-05-07T20:06:28.5030689Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:28.5031045Z 2025-05-07T20:06:28.5031650Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(94): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:28.5032464Z __attribute__((global)) inline void _get_padding_value_kernel( 2025-05-07T20:06:28.5032868Z ^ 2025-05-07T20:06:28.5033007Z 2025-05-07T20:06:28.5033540Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(110): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:28.5034605Z __attribute__((global)) inline void _single_thread_sum_padding_kernel( 2025-05-07T20:06:28.5035040Z ^ 2025-05-07T20:06:28.5035183Z 2025-05-07T20:06:28.5035717Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(137): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:28.5036675Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_1d_cuda_kernel( 2025-05-07T20:06:28.5037108Z ^ 2025-05-07T20:06:28.5037303Z 2025-05-07T20:06:28.5037830Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(166): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:28.5038716Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_2d_cuda_kernel( 2025-05-07T20:06:28.5039146Z ^ 2025-05-07T20:06:28.5039279Z 2025-05-07T20:06:28.5039827Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:28.5040674Z __attribute__((global)) inline void _float_to_paddedFP8rowwise_cuda_kernel( 2025-05-07T20:06:28.5041115Z ^ 2025-05-07T20:06:28.5041247Z 2025-05-07T20:06:28.5041509Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:28.5041862Z 2025-05-07T20:06:28.5042384Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(94): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:28.5043294Z __attribute__((global)) inline void _get_padding_value_kernel( 2025-05-07T20:06:28.5043665Z ^ 2025-05-07T20:06:28.5043824Z 2025-05-07T20:06:28.5044354Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(110): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:28.5045218Z __attribute__((global)) inline void _single_thread_sum_padding_kernel( 2025-05-07T20:06:28.5045617Z ^ 2025-05-07T20:06:28.5045751Z 2025-05-07T20:06:28.5046303Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(137): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:28.5047171Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_1d_cuda_kernel( 2025-05-07T20:06:28.5047623Z ^ 2025-05-07T20:06:28.5047758Z 2025-05-07T20:06:28.5048312Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(166): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:28.5049183Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_2d_cuda_kernel( 2025-05-07T20:06:28.5049632Z ^ 2025-05-07T20:06:28.5049768Z 2025-05-07T20:06:28.5050290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:28.5051173Z __attribute__((global)) inline void _float_to_paddedFP8rowwise_cuda_kernel( 2025-05-07T20:06:28.5051595Z ^ 2025-05-07T20:06:28.5051755Z 2025-05-07T20:06:28.5051997Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:28.5052347Z 2025-05-07T20:06:28.5052896Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(94): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:28.5053710Z __attribute__((global)) inline void _get_padding_value_kernel( 2025-05-07T20:06:28.5054111Z ^ 2025-05-07T20:06:28.5054246Z 2025-05-07T20:06:28.5054778Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(110): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:28.5055640Z __attribute__((global)) inline void _single_thread_sum_padding_kernel( 2025-05-07T20:06:28.5056060Z ^ 2025-05-07T20:06:28.5056192Z 2025-05-07T20:06:28.5056757Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(137): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:28.5057650Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_1d_cuda_kernel( 2025-05-07T20:06:28.5058106Z ^ 2025-05-07T20:06:28.5058264Z 2025-05-07T20:06:28.5058822Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(166): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:28.5059725Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_2d_cuda_kernel( 2025-05-07T20:06:28.5060153Z ^ 2025-05-07T20:06:28.5060310Z 2025-05-07T20:06:28.5060834Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:28.5061687Z __attribute__((global)) inline void _float_to_paddedFP8rowwise_cuda_kernel( 2025-05-07T20:06:28.5062127Z ^ 2025-05-07T20:06:28.5062264Z 2025-05-07T20:06:28.5062528Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:28.5062879Z 2025-05-07T20:06:28.5063398Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(94): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:28.5064237Z __attribute__((global)) inline void _get_padding_value_kernel( 2025-05-07T20:06:28.5064644Z ^ 2025-05-07T20:06:28.5064798Z 2025-05-07T20:06:28.5065318Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(110): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:28.5066190Z __attribute__((global)) inline void _single_thread_sum_padding_kernel( 2025-05-07T20:06:28.5066591Z ^ 2025-05-07T20:06:28.5066754Z 2025-05-07T20:06:28.5067280Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(137): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:28.5068175Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_1d_cuda_kernel( 2025-05-07T20:06:28.5068610Z ^ 2025-05-07T20:06:28.5068751Z 2025-05-07T20:06:28.5069297Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(166): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:28.5070177Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_2d_cuda_kernel( 2025-05-07T20:06:28.5070636Z ^ 2025-05-07T20:06:28.5070780Z 2025-05-07T20:06:29.3255356Z [570/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o 2025-05-07T20:06:29.3910505Z [571/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o 2025-05-07T20:06:29.3930141Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu(73): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.3931646Z __attribute__((global)) inline void _compute_msfp_shared_exponent_cuda_kernel( 2025-05-07T20:06:29.3932454Z ^ 2025-05-07T20:06:29.3932679Z 2025-05-07T20:06:29.3933069Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:29.3933587Z 2025-05-07T20:06:29.3934614Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu(73): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.3936007Z __attribute__((global)) inline void _compute_msfp_shared_exponent_cuda_kernel( 2025-05-07T20:06:29.3936751Z ^ 2025-05-07T20:06:29.3936967Z 2025-05-07T20:06:29.3937394Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:29.3938222Z 2025-05-07T20:06:29.3939148Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu(73): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.3940593Z __attribute__((global)) inline void _compute_msfp_shared_exponent_cuda_kernel( 2025-05-07T20:06:29.3941327Z ^ 2025-05-07T20:06:29.3941586Z 2025-05-07T20:06:29.3942060Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:29.3942649Z 2025-05-07T20:06:29.3943468Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu(73): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.3944853Z __attribute__((global)) inline void _compute_msfp_shared_exponent_cuda_kernel( 2025-05-07T20:06:29.3945617Z ^ 2025-05-07T20:06:29.3945854Z 2025-05-07T20:06:29.3946303Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:29.3946904Z 2025-05-07T20:06:29.3947694Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu(73): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.3949400Z __attribute__((global)) inline void _compute_msfp_shared_exponent_cuda_kernel( 2025-05-07T20:06:29.3950164Z ^ 2025-05-07T20:06:29.3950421Z 2025-05-07T20:06:29.3950842Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:29.3951441Z 2025-05-07T20:06:31.1374528Z [572/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_hfp8.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o 2025-05-07T20:06:37.1787943Z [573/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o 2025-05-07T20:06:40.8197934Z [574/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_bucketize_features.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o 2025-05-07T20:06:40.8209273Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:40.8210663Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:40.8211341Z ^ 2025-05-07T20:06:40.8211496Z 2025-05-07T20:06:40.8211737Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:40.8212091Z 2025-05-07T20:06:40.8212946Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:40.8214333Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:40.8214965Z ^ 2025-05-07T20:06:40.8215163Z 2025-05-07T20:06:40.8216007Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:40.8217383Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:40.8217996Z ^ 2025-05-07T20:06:40.8218133Z 2025-05-07T20:06:40.8218369Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:40.8218728Z 2025-05-07T20:06:40.8219583Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:40.8220996Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:40.8221613Z ^ 2025-05-07T20:06:40.8221819Z 2025-05-07T20:06:40.8222663Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:40.8224034Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:40.8224635Z ^ 2025-05-07T20:06:40.8224783Z 2025-05-07T20:06:40.8225017Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:40.8225362Z 2025-05-07T20:06:40.8226259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:40.8227643Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:40.8228304Z ^ 2025-05-07T20:06:40.8228723Z 2025-05-07T20:06:40.8229635Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:40.8231010Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:40.8231629Z ^ 2025-05-07T20:06:40.8231763Z 2025-05-07T20:06:40.8232002Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:40.8232364Z 2025-05-07T20:06:40.8233218Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:40.8234680Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:40.8235352Z ^ 2025-05-07T20:06:40.8235544Z 2025-05-07T20:06:40.8236403Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:40.8237764Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:40.8238376Z ^ 2025-05-07T20:06:40.8238514Z 2025-05-07T20:06:40.8238759Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:40.8239105Z 2025-05-07T20:06:40.8239954Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:40.8241337Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:40.8241966Z ^ 2025-05-07T20:06:40.8242157Z 2025-05-07T20:06:41.0273028Z [575/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_batched_unary_embeddings.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o 2025-05-07T20:06:41.0284701Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:41.0286248Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:41.0286911Z ^ 2025-05-07T20:06:41.0287067Z 2025-05-07T20:06:41.0287325Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:41.0287713Z 2025-05-07T20:06:41.0288587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:41.0289994Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:41.0290642Z ^ 2025-05-07T20:06:41.0290871Z 2025-05-07T20:06:41.0291729Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:41.0293142Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:41.0293762Z ^ 2025-05-07T20:06:41.0293930Z 2025-05-07T20:06:41.0294219Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:41.0294574Z 2025-05-07T20:06:41.0295462Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:41.0296859Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:41.0297525Z ^ 2025-05-07T20:06:41.0297731Z 2025-05-07T20:06:41.0298616Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:41.0299996Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:41.0300646Z ^ 2025-05-07T20:06:41.0300800Z 2025-05-07T20:06:41.0312415Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:41.0312949Z 2025-05-07T20:06:41.0313955Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:41.0315467Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:41.0316110Z ^ 2025-05-07T20:06:41.0316324Z 2025-05-07T20:06:41.0317177Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:41.0318597Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:41.0319255Z ^ 2025-05-07T20:06:41.0319408Z 2025-05-07T20:06:41.0319658Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:41.0320044Z 2025-05-07T20:06:41.0320910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:41.0322361Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:41.0322997Z ^ 2025-05-07T20:06:41.0323230Z 2025-05-07T20:06:41.0324085Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:41.0325483Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:41.0326102Z ^ 2025-05-07T20:06:41.0326251Z 2025-05-07T20:06:41.0326521Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:41.0326881Z 2025-05-07T20:06:41.0327744Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:41.0329367Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:41.0330027Z ^ 2025-05-07T20:06:41.0330229Z 2025-05-07T20:06:44.2740073Z [576/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o 2025-05-07T20:06:44.2760085Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:44.2762773Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:44.2763903Z ^ 2025-05-07T20:06:44.2764136Z 2025-05-07T20:06:44.2764537Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:44.2765179Z 2025-05-07T20:06:44.2766744Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:44.2769302Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:44.2770409Z ^ 2025-05-07T20:06:44.2770771Z 2025-05-07T20:06:44.2772345Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:44.2774846Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:44.2775960Z ^ 2025-05-07T20:06:44.2776209Z 2025-05-07T20:06:44.2776678Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:44.2777288Z 2025-05-07T20:06:44.2778846Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:44.2781511Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:44.2782649Z ^ 2025-05-07T20:06:44.2782999Z 2025-05-07T20:06:44.2783840Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:06:44.2785110Z static constexpr uint32_t kMaxThreads = 1024; 2025-05-07T20:06:44.2785585Z ^ 2025-05-07T20:06:44.2785873Z 2025-05-07T20:06:44.2787562Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:44.2790045Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:44.2791254Z ^ 2025-05-07T20:06:44.2791525Z 2025-05-07T20:06:44.2792032Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:44.2792670Z 2025-05-07T20:06:44.2794331Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:44.2796873Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:44.2798006Z ^ 2025-05-07T20:06:44.2798335Z 2025-05-07T20:06:44.2799301Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:06:44.2800635Z static constexpr uint32_t kMaxThreads = 1024; 2025-05-07T20:06:44.2801205Z ^ 2025-05-07T20:06:44.2801459Z 2025-05-07T20:06:44.2803015Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:44.2805495Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:44.2806595Z ^ 2025-05-07T20:06:44.2806831Z 2025-05-07T20:06:44.2807283Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:44.2807942Z 2025-05-07T20:06:44.2809409Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:44.2811897Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:44.2813031Z ^ 2025-05-07T20:06:44.2813382Z 2025-05-07T20:06:44.2814357Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:06:44.2815751Z static constexpr uint32_t kMaxThreads = 1024; 2025-05-07T20:06:44.2816351Z ^ 2025-05-07T20:06:44.2816646Z 2025-05-07T20:06:44.2818207Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:44.2820681Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:44.2821840Z ^ 2025-05-07T20:06:44.2822080Z 2025-05-07T20:06:44.2822495Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:44.2823090Z 2025-05-07T20:06:44.2824572Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:44.2827067Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:44.2828537Z ^ 2025-05-07T20:06:44.2828910Z 2025-05-07T20:06:44.2829867Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:06:44.2831264Z static constexpr uint32_t kMaxThreads = 1024; 2025-05-07T20:06:44.2832014Z ^ 2025-05-07T20:06:44.2832297Z 2025-05-07T20:06:46.4356233Z [577/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_compute_frequency_sequence.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o 2025-05-07T20:06:46.4367683Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:46.4369077Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:46.4369700Z ^ 2025-05-07T20:06:46.4369845Z 2025-05-07T20:06:46.4370087Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:46.4370458Z 2025-05-07T20:06:46.4371316Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:46.4372703Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:46.4373321Z ^ 2025-05-07T20:06:46.4373532Z 2025-05-07T20:06:46.4374461Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:46.4375841Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:46.4376515Z ^ 2025-05-07T20:06:46.4376659Z 2025-05-07T20:06:46.4376939Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:46.4377320Z 2025-05-07T20:06:46.4378181Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:46.4379552Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:46.4380188Z ^ 2025-05-07T20:06:46.4380383Z 2025-05-07T20:06:46.4381223Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:46.4382601Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:46.4383264Z ^ 2025-05-07T20:06:46.4383399Z 2025-05-07T20:06:46.4383634Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:46.4383992Z 2025-05-07T20:06:46.4384843Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:46.4386223Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:46.4386838Z ^ 2025-05-07T20:06:46.4387041Z 2025-05-07T20:06:46.4387884Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:46.4389269Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:46.4389874Z ^ 2025-05-07T20:06:46.4390009Z 2025-05-07T20:06:46.4390262Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:46.4390608Z 2025-05-07T20:06:46.4391468Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:46.4392857Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:46.4393493Z ^ 2025-05-07T20:06:46.4393686Z 2025-05-07T20:06:46.4394629Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:46.4396005Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:46.4396625Z ^ 2025-05-07T20:06:46.4396762Z 2025-05-07T20:06:46.4397083Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:46.4397431Z 2025-05-07T20:06:46.4398304Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:46.4399705Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:46.4400367Z ^ 2025-05-07T20:06:46.4400559Z 2025-05-07T20:06:48.8544103Z [578/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_expand_into_jagged_permute.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o 2025-05-07T20:06:48.8555904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:48.8557297Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:48.8557919Z ^ 2025-05-07T20:06:48.8558061Z 2025-05-07T20:06:48.8558324Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:48.8558681Z 2025-05-07T20:06:48.8559548Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:48.8562552Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:48.8563197Z ^ 2025-05-07T20:06:48.8563389Z 2025-05-07T20:06:48.8564249Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:48.8565765Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:48.8566388Z ^ 2025-05-07T20:06:48.8566524Z 2025-05-07T20:06:48.8566761Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:48.8567127Z 2025-05-07T20:06:48.8567977Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:48.8569356Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:48.8569972Z ^ 2025-05-07T20:06:48.8570183Z 2025-05-07T20:06:48.8571030Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:48.8572434Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:48.8573036Z ^ 2025-05-07T20:06:48.8573193Z 2025-05-07T20:06:48.8573428Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:48.8573774Z 2025-05-07T20:06:48.8574623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:48.8576003Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:48.8576636Z ^ 2025-05-07T20:06:48.8576827Z 2025-05-07T20:06:48.8577667Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:48.8579032Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:48.8579647Z ^ 2025-05-07T20:06:48.8579782Z 2025-05-07T20:06:48.8580018Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:48.8580376Z 2025-05-07T20:06:48.8581224Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:48.8582610Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:48.8583227Z ^ 2025-05-07T20:06:48.8583429Z 2025-05-07T20:06:48.8584268Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:48.8585637Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:48.8586271Z ^ 2025-05-07T20:06:48.8586406Z 2025-05-07T20:06:48.8586652Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:48.8586998Z 2025-05-07T20:06:48.8587852Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:48.8589293Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:48.8589920Z ^ 2025-05-07T20:06:48.8590110Z 2025-05-07T20:06:57.0752567Z [579/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_group_index.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o 2025-05-07T20:06:57.0764077Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:57.0765532Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:57.0766142Z ^ 2025-05-07T20:06:57.0766300Z 2025-05-07T20:06:57.0766544Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:57.0766894Z 2025-05-07T20:06:57.0767768Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:57.0769241Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:57.0769889Z ^ 2025-05-07T20:06:57.0770083Z 2025-05-07T20:06:57.0770945Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:57.0772503Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:57.0773133Z ^ 2025-05-07T20:06:57.0773270Z 2025-05-07T20:06:57.0773507Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:57.0773867Z 2025-05-07T20:06:57.0774722Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:57.0776111Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:57.0776727Z ^ 2025-05-07T20:06:57.0776943Z 2025-05-07T20:06:57.0777797Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:57.0780607Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:57.0781228Z ^ 2025-05-07T20:06:57.0781365Z 2025-05-07T20:06:57.0781619Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:57.0781969Z 2025-05-07T20:06:57.0782826Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:57.0784206Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:57.0784837Z ^ 2025-05-07T20:06:57.0785031Z 2025-05-07T20:06:57.0785872Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:57.0787243Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:57.0787846Z ^ 2025-05-07T20:06:57.0787998Z 2025-05-07T20:06:57.0788238Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:57.0788584Z 2025-05-07T20:06:57.0789448Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:57.0790820Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:57.0791449Z ^ 2025-05-07T20:06:57.0791642Z 2025-05-07T20:06:57.0792497Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:57.0794005Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:57.0794630Z ^ 2025-05-07T20:06:57.0794768Z 2025-05-07T20:06:57.0795003Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:57.0795396Z 2025-05-07T20:06:57.0796285Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:57.0797666Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:57.0798277Z ^ 2025-05-07T20:06:57.0798483Z 2025-05-07T20:06:58.8501502Z [580/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_block_bucketize_features.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o 2025-05-07T20:06:58.8513315Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:58.8514819Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:58.8515441Z ^ 2025-05-07T20:06:58.8515580Z 2025-05-07T20:06:58.8515821Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:58.8516182Z 2025-05-07T20:06:58.8517150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:58.8518540Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:58.8519222Z ^ 2025-05-07T20:06:58.8519415Z 2025-05-07T20:06:58.8520336Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:58.8521699Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:58.8522311Z ^ 2025-05-07T20:06:58.8522448Z 2025-05-07T20:06:58.8522698Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:58.8523048Z 2025-05-07T20:06:58.8523899Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:58.8525283Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:58.8525947Z ^ 2025-05-07T20:06:58.8526140Z 2025-05-07T20:06:58.8526980Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:58.8528520Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:58.8529149Z ^ 2025-05-07T20:06:58.8529286Z 2025-05-07T20:06:58.8529521Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:58.8529870Z 2025-05-07T20:06:58.8530743Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:58.8532118Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:58.8532753Z ^ 2025-05-07T20:06:58.8532944Z 2025-05-07T20:06:58.8533797Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:58.8535155Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:58.8535770Z ^ 2025-05-07T20:06:58.8535904Z 2025-05-07T20:06:58.8536150Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:58.8536497Z 2025-05-07T20:06:58.8537349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:58.8538730Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:58.8539339Z ^ 2025-05-07T20:06:58.8539543Z 2025-05-07T20:06:58.8540453Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:58.8541825Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:58.8542425Z ^ 2025-05-07T20:06:58.8542619Z 2025-05-07T20:06:58.8542852Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:58.8543200Z 2025-05-07T20:06:58.8544105Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:58.8545466Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:58.8546088Z ^ 2025-05-07T20:06:58.8546279Z 2025-05-07T20:07:00.5333258Z [581/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_index_add.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o 2025-05-07T20:07:00.5344732Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.5346131Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.5346783Z ^ 2025-05-07T20:07:00.5346934Z 2025-05-07T20:07:00.5347209Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:00.5347567Z 2025-05-07T20:07:00.5348541Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.5349972Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.5350713Z ^ 2025-05-07T20:07:00.5350916Z 2025-05-07T20:07:00.5351840Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.5353242Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.5353953Z ^ 2025-05-07T20:07:00.5354127Z 2025-05-07T20:07:00.5354385Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:00.5354746Z 2025-05-07T20:07:00.5355631Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.5357082Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.5357759Z ^ 2025-05-07T20:07:00.5357965Z 2025-05-07T20:07:00.5358840Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.5360226Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.5360871Z ^ 2025-05-07T20:07:00.5361017Z 2025-05-07T20:07:00.5361289Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:00.5361649Z 2025-05-07T20:07:00.5362525Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.5363940Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.5364590Z ^ 2025-05-07T20:07:00.5364789Z 2025-05-07T20:07:00.5365647Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.5367045Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.5367687Z ^ 2025-05-07T20:07:00.5367828Z 2025-05-07T20:07:00.5368074Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:00.5368424Z 2025-05-07T20:07:00.5369308Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.5370690Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.5371340Z ^ 2025-05-07T20:07:00.5371537Z 2025-05-07T20:07:00.5372977Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.5374361Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.5375054Z ^ 2025-05-07T20:07:00.5375200Z 2025-05-07T20:07:00.5375499Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:00.5375855Z 2025-05-07T20:07:00.5376715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:00.5378132Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:00.5378767Z ^ 2025-05-07T20:07:00.5378990Z 2025-05-07T20:07:02.3176359Z [582/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_invert_permute.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o 2025-05-07T20:07:02.3196143Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.3198605Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.3199663Z ^ 2025-05-07T20:07:02.3199931Z 2025-05-07T20:07:02.3200555Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:02.3201172Z 2025-05-07T20:07:02.3202710Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.3205398Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.3206553Z ^ 2025-05-07T20:07:02.3206879Z 2025-05-07T20:07:02.3208369Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.3210902Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.3212058Z ^ 2025-05-07T20:07:02.3212311Z 2025-05-07T20:07:02.3212741Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:02.3213404Z 2025-05-07T20:07:02.3214898Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.3217497Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.3218571Z ^ 2025-05-07T20:07:02.3218927Z 2025-05-07T20:07:02.3220492Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.3223145Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.3224228Z ^ 2025-05-07T20:07:02.3224481Z 2025-05-07T20:07:02.3224874Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:02.3225497Z 2025-05-07T20:07:02.3227085Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.3229883Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.3231081Z ^ 2025-05-07T20:07:02.3231456Z 2025-05-07T20:07:02.3233084Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.3235819Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.3237024Z ^ 2025-05-07T20:07:02.3237281Z 2025-05-07T20:07:02.3237719Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:02.3238425Z 2025-05-07T20:07:02.3240069Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.3242763Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.3244102Z ^ 2025-05-07T20:07:02.3244473Z 2025-05-07T20:07:02.3246117Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.3248892Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.3250159Z ^ 2025-05-07T20:07:02.3250423Z 2025-05-07T20:07:02.3250896Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:02.3251505Z 2025-05-07T20:07:02.3252901Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.3255333Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.3256386Z ^ 2025-05-07T20:07:02.3256713Z 2025-05-07T20:07:03.2624839Z [583/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_index_select.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o 2025-05-07T20:07:03.2646000Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.2648351Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.2649637Z ^ 2025-05-07T20:07:03.2649886Z 2025-05-07T20:07:03.2650309Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:03.2650975Z 2025-05-07T20:07:03.2652467Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.2655170Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.2656265Z ^ 2025-05-07T20:07:03.2656601Z 2025-05-07T20:07:03.2658106Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.2660644Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.2661707Z ^ 2025-05-07T20:07:03.2661938Z 2025-05-07T20:07:03.2662357Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:03.2662989Z 2025-05-07T20:07:03.2664416Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.2666963Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.2668083Z ^ 2025-05-07T20:07:03.2668440Z 2025-05-07T20:07:03.2669930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.2672385Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.2673526Z ^ 2025-05-07T20:07:03.2673762Z 2025-05-07T20:07:03.2674227Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:03.2674826Z 2025-05-07T20:07:03.2676394Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.2678849Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.2680050Z ^ 2025-05-07T20:07:03.2680411Z 2025-05-07T20:07:03.2681848Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.2684162Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.2685307Z ^ 2025-05-07T20:07:03.2685551Z 2025-05-07T20:07:03.2685978Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:03.2686615Z 2025-05-07T20:07:03.2688059Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.2690565Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.2691626Z ^ 2025-05-07T20:07:03.2691996Z 2025-05-07T20:07:03.2693438Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.2696071Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.2697142Z ^ 2025-05-07T20:07:03.2697395Z 2025-05-07T20:07:03.2697790Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:03.2698385Z 2025-05-07T20:07:03.2699891Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.2702323Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.2703403Z ^ 2025-05-07T20:07:03.2703735Z 2025-05-07T20:07:09.9164178Z [584/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute102.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o 2025-05-07T20:07:09.9184719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.9187379Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.9188454Z ^ 2025-05-07T20:07:09.9188740Z 2025-05-07T20:07:09.9189155Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:09.9189911Z 2025-05-07T20:07:09.9191579Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.9194078Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.9195192Z ^ 2025-05-07T20:07:09.9195523Z 2025-05-07T20:07:09.9196990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.9199408Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.9200495Z ^ 2025-05-07T20:07:09.9200755Z 2025-05-07T20:07:09.9201162Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:09.9201901Z 2025-05-07T20:07:09.9203405Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.9205757Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.9206790Z ^ 2025-05-07T20:07:09.9207139Z 2025-05-07T20:07:09.9208574Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.9210943Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.9211952Z ^ 2025-05-07T20:07:09.9212220Z 2025-05-07T20:07:09.9212612Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:09.9213216Z 2025-05-07T20:07:09.9214654Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.9216960Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.9218013Z ^ 2025-05-07T20:07:09.9218354Z 2025-05-07T20:07:09.9219719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.9222123Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.9223175Z ^ 2025-05-07T20:07:09.9223429Z 2025-05-07T20:07:09.9223859Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:09.9224485Z 2025-05-07T20:07:09.9225931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.9228804Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.9229820Z ^ 2025-05-07T20:07:09.9230185Z 2025-05-07T20:07:09.9232066Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.9234458Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.9235567Z ^ 2025-05-07T20:07:09.9235819Z 2025-05-07T20:07:09.9236264Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:09.9236905Z 2025-05-07T20:07:09.9238476Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.9240702Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.9241805Z ^ 2025-05-07T20:07:09.9242123Z 2025-05-07T20:07:10.0792755Z [585/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_pack_segments_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o 2025-05-07T20:07:10.0813416Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:10.0815848Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:10.0816910Z ^ 2025-05-07T20:07:10.0817325Z 2025-05-07T20:07:10.0817715Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:10.0818328Z 2025-05-07T20:07:10.0819874Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:10.0822259Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:10.0823359Z ^ 2025-05-07T20:07:10.0823744Z 2025-05-07T20:07:10.0825178Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:10.0827708Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:10.0828936Z ^ 2025-05-07T20:07:10.0829328Z 2025-05-07T20:07:10.0829802Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:10.0830403Z 2025-05-07T20:07:10.0831826Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:10.0834406Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:10.0835472Z ^ 2025-05-07T20:07:10.0835800Z 2025-05-07T20:07:10.0837252Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:10.0839712Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:10.0840822Z ^ 2025-05-07T20:07:10.0841047Z 2025-05-07T20:07:10.0841443Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:10.0842076Z 2025-05-07T20:07:10.0843527Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:10.0845919Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:10.0846956Z ^ 2025-05-07T20:07:10.0847305Z 2025-05-07T20:07:10.0848743Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:10.0851145Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:10.0852172Z ^ 2025-05-07T20:07:10.0852405Z 2025-05-07T20:07:10.0852822Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:10.0853413Z 2025-05-07T20:07:10.0855100Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:10.0857573Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:10.0858729Z ^ 2025-05-07T20:07:10.0859048Z 2025-05-07T20:07:10.0862256Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:10.0864816Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:10.0865882Z ^ 2025-05-07T20:07:10.0866124Z 2025-05-07T20:07:10.0866537Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:10.0867131Z 2025-05-07T20:07:10.0868668Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:10.0871102Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:10.0872301Z ^ 2025-05-07T20:07:10.0872608Z 2025-05-07T20:07:11.2353231Z [586/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_zipf.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o 2025-05-07T20:07:11.2373782Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.2376362Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.2377665Z ^ 2025-05-07T20:07:11.2377917Z 2025-05-07T20:07:11.2378350Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:11.2379156Z 2025-05-07T20:07:11.2380701Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.2383267Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.2384427Z ^ 2025-05-07T20:07:11.2384801Z 2025-05-07T20:07:11.2386347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.2388903Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.2390211Z ^ 2025-05-07T20:07:11.2390464Z 2025-05-07T20:07:11.2390932Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:11.2391561Z 2025-05-07T20:07:11.2393136Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.2395754Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.2396961Z ^ 2025-05-07T20:07:11.2397305Z 2025-05-07T20:07:11.2398833Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.2401406Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.2402541Z ^ 2025-05-07T20:07:11.2402780Z 2025-05-07T20:07:11.2403187Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:11.2403846Z 2025-05-07T20:07:11.2405305Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.2407794Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.2408876Z ^ 2025-05-07T20:07:11.2409213Z 2025-05-07T20:07:11.2410740Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.2413146Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.2414309Z ^ 2025-05-07T20:07:11.2414548Z 2025-05-07T20:07:11.2414996Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:11.2415641Z 2025-05-07T20:07:11.2417325Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.2419850Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.2421062Z ^ 2025-05-07T20:07:11.2421416Z 2025-05-07T20:07:11.2423026Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.2425530Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.2426654Z ^ 2025-05-07T20:07:11.2426883Z 2025-05-07T20:07:11.2427290Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:11.2427899Z 2025-05-07T20:07:11.2429636Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.2432172Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.2433429Z ^ 2025-05-07T20:07:11.2433889Z 2025-05-07T20:07:11.9493448Z [587/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_range.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o 2025-05-07T20:07:11.9514318Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.9516912Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.9518194Z ^ 2025-05-07T20:07:11.9518493Z 2025-05-07T20:07:11.9519060Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:11.9519695Z 2025-05-07T20:07:11.9521356Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.9524050Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.9525200Z ^ 2025-05-07T20:07:11.9525525Z 2025-05-07T20:07:11.9527077Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.9529861Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.9531248Z ^ 2025-05-07T20:07:11.9531492Z 2025-05-07T20:07:11.9531928Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:11.9532601Z 2025-05-07T20:07:11.9534206Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.9536916Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.9538107Z ^ 2025-05-07T20:07:11.9538504Z 2025-05-07T20:07:11.9540060Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.9542499Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.9543583Z ^ 2025-05-07T20:07:11.9543863Z 2025-05-07T20:07:11.9544299Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:11.9544916Z 2025-05-07T20:07:11.9546521Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.9549059Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.9550116Z ^ 2025-05-07T20:07:11.9550464Z 2025-05-07T20:07:11.9552029Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.9554714Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.9555847Z ^ 2025-05-07T20:07:11.9556100Z 2025-05-07T20:07:11.9556694Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:11.9557356Z 2025-05-07T20:07:11.9558973Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.9561792Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.9563020Z ^ 2025-05-07T20:07:11.9563393Z 2025-05-07T20:07:11.9564951Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.9567498Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.9568626Z ^ 2025-05-07T20:07:11.9568865Z 2025-05-07T20:07:11.9569300Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:11.9569928Z 2025-05-07T20:07:11.9571533Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.9574242Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.9575457Z ^ 2025-05-07T20:07:11.9575835Z 2025-05-07T20:07:13.1886283Z [588/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_1d.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o 2025-05-07T20:07:13.1897609Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.1899103Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.1899811Z ^ 2025-05-07T20:07:13.1899959Z 2025-05-07T20:07:13.1900204Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:13.1900614Z 2025-05-07T20:07:13.1901484Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.1902899Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.1903539Z ^ 2025-05-07T20:07:13.1903766Z 2025-05-07T20:07:13.1904625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.1906103Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.1906728Z ^ 2025-05-07T20:07:13.1906879Z 2025-05-07T20:07:13.1907159Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:13.1907519Z 2025-05-07T20:07:13.1908383Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.1909792Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.1910450Z ^ 2025-05-07T20:07:13.1910653Z 2025-05-07T20:07:13.1911501Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.1912897Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.1913543Z ^ 2025-05-07T20:07:13.1913685Z 2025-05-07T20:07:13.1914011Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:13.1914398Z 2025-05-07T20:07:13.1915265Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.1916675Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.1917309Z ^ 2025-05-07T20:07:13.1917509Z 2025-05-07T20:07:13.1918378Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.1919758Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.1920394Z ^ 2025-05-07T20:07:13.1920578Z 2025-05-07T20:07:13.1920850Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:13.1921204Z 2025-05-07T20:07:13.1922066Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.1923538Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.1924188Z ^ 2025-05-07T20:07:13.1924389Z 2025-05-07T20:07:13.1925240Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.1926618Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.1927263Z ^ 2025-05-07T20:07:13.1927407Z 2025-05-07T20:07:13.1927673Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:13.1928027Z 2025-05-07T20:07:13.1929115Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.1930598Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.1931263Z ^ 2025-05-07T20:07:13.1931463Z 2025-05-07T20:07:13.3182006Z [589/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_embeddings.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o 2025-05-07T20:07:13.3193234Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.3195044Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.3195681Z ^ 2025-05-07T20:07:13.3195861Z 2025-05-07T20:07:13.3196111Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:13.3196474Z 2025-05-07T20:07:13.3197369Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.3198774Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.3199435Z ^ 2025-05-07T20:07:13.3199642Z 2025-05-07T20:07:13.3200517Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.3201957Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.3202605Z ^ 2025-05-07T20:07:13.3202756Z 2025-05-07T20:07:13.3203031Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:13.3203394Z 2025-05-07T20:07:13.3204264Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.3205673Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.3206313Z ^ 2025-05-07T20:07:13.3206649Z 2025-05-07T20:07:13.3207484Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.3208843Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.3209439Z ^ 2025-05-07T20:07:13.3209604Z 2025-05-07T20:07:13.3209846Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:13.3210188Z 2025-05-07T20:07:13.3211045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.3212395Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.3213028Z ^ 2025-05-07T20:07:13.3213218Z 2025-05-07T20:07:13.3214062Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.3215432Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.3216065Z ^ 2025-05-07T20:07:13.3216202Z 2025-05-07T20:07:13.3216435Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:13.3216799Z 2025-05-07T20:07:13.3217701Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.3219071Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.3219685Z ^ 2025-05-07T20:07:13.3219900Z 2025-05-07T20:07:13.3220726Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.3222087Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.3222689Z ^ 2025-05-07T20:07:13.3222865Z 2025-05-07T20:07:13.3223102Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:13.3223447Z 2025-05-07T20:07:13.3224321Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.3225687Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.3226321Z ^ 2025-05-07T20:07:13.3226516Z 2025-05-07T20:07:13.5965430Z [590/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_segment_sum_csr.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o 2025-05-07T20:07:13.5976839Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.5978237Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.5978862Z ^ 2025-05-07T20:07:13.5979005Z 2025-05-07T20:07:13.5979246Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:13.5979595Z 2025-05-07T20:07:13.5980474Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.5981851Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.5982492Z ^ 2025-05-07T20:07:13.5982746Z 2025-05-07T20:07:13.5983612Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.5984976Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.5985599Z ^ 2025-05-07T20:07:13.5985737Z 2025-05-07T20:07:13.5985992Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:13.5986336Z 2025-05-07T20:07:13.5987189Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.5988572Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.5989191Z ^ 2025-05-07T20:07:13.5989398Z 2025-05-07T20:07:13.5990243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.5991619Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.5992223Z ^ 2025-05-07T20:07:13.5992369Z 2025-05-07T20:07:13.5992601Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:13.5992946Z 2025-05-07T20:07:13.5993887Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.5995270Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.5995895Z ^ 2025-05-07T20:07:13.5996089Z 2025-05-07T20:07:13.5996985Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.5998346Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.5998959Z ^ 2025-05-07T20:07:13.5999093Z 2025-05-07T20:07:13.5999359Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:13.5999721Z 2025-05-07T20:07:13.6000603Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.6001978Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.6002589Z ^ 2025-05-07T20:07:13.6002797Z 2025-05-07T20:07:13.6003636Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.6005000Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.6005603Z ^ 2025-05-07T20:07:13.6005753Z 2025-05-07T20:07:13.6006022Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:13.6006366Z 2025-05-07T20:07:13.6007215Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.6008590Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.6009225Z ^ 2025-05-07T20:07:13.6009413Z 2025-05-07T20:07:14.9584280Z [591/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_pack_segments_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o 2025-05-07T20:07:14.9596221Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.9597619Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.9598276Z ^ 2025-05-07T20:07:14.9598426Z 2025-05-07T20:07:14.9598705Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:14.9599062Z 2025-05-07T20:07:14.9599930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.9601341Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.9602049Z ^ 2025-05-07T20:07:14.9602248Z 2025-05-07T20:07:14.9603097Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.9604490Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.9605114Z ^ 2025-05-07T20:07:14.9605282Z 2025-05-07T20:07:14.9605532Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:14.9605893Z 2025-05-07T20:07:14.9606876Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.9608225Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.9608865Z ^ 2025-05-07T20:07:14.9609068Z 2025-05-07T20:07:14.9609917Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.9611261Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.9611891Z ^ 2025-05-07T20:07:14.9612034Z 2025-05-07T20:07:14.9612279Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:14.9612647Z 2025-05-07T20:07:14.9613488Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.9614867Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.9615482Z ^ 2025-05-07T20:07:14.9615702Z 2025-05-07T20:07:14.9616563Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.9617919Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.9618562Z ^ 2025-05-07T20:07:14.9618723Z 2025-05-07T20:07:14.9618991Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:14.9619338Z 2025-05-07T20:07:14.9620192Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.9621538Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.9622168Z ^ 2025-05-07T20:07:14.9622359Z 2025-05-07T20:07:14.9623182Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.9624542Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.9625196Z ^ 2025-05-07T20:07:14.9625336Z 2025-05-07T20:07:14.9625573Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:14.9625944Z 2025-05-07T20:07:14.9626781Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.9628154Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.9629129Z ^ 2025-05-07T20:07:14.9629354Z 2025-05-07T20:07:15.2272843Z [592/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_2d.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o 2025-05-07T20:07:15.2284221Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.2285613Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.2286226Z ^ 2025-05-07T20:07:15.2286487Z 2025-05-07T20:07:15.2286720Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:15.2287060Z 2025-05-07T20:07:15.2287901Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.2289326Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.2289939Z ^ 2025-05-07T20:07:15.2290126Z 2025-05-07T20:07:15.2290955Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.2292277Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.2292880Z ^ 2025-05-07T20:07:15.2293011Z 2025-05-07T20:07:15.2293240Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:15.2293590Z 2025-05-07T20:07:15.2294419Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.2295760Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.2296354Z ^ 2025-05-07T20:07:15.2296555Z 2025-05-07T20:07:15.2297377Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.2298707Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.2299300Z ^ 2025-05-07T20:07:15.2299443Z 2025-05-07T20:07:15.2299671Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:15.2300014Z 2025-05-07T20:07:15.2300843Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.2302181Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.2302791Z ^ 2025-05-07T20:07:15.2303020Z 2025-05-07T20:07:15.2303841Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.2305214Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.2305819Z ^ 2025-05-07T20:07:15.2305980Z 2025-05-07T20:07:15.2306213Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:15.2306562Z 2025-05-07T20:07:15.2307389Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.2308727Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.2309324Z ^ 2025-05-07T20:07:15.2309507Z 2025-05-07T20:07:15.2310334Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.2311688Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.2312287Z ^ 2025-05-07T20:07:15.2312420Z 2025-05-07T20:07:15.2312659Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:15.2312994Z 2025-05-07T20:07:15.2313889Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.2315430Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.2316061Z ^ 2025-05-07T20:07:15.2316254Z 2025-05-07T20:07:15.5194099Z [593/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_reorder_batched_ad.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o 2025-05-07T20:07:15.5205705Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.5207164Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.5207772Z ^ 2025-05-07T20:07:15.5207912Z 2025-05-07T20:07:15.5208143Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:15.5208495Z 2025-05-07T20:07:15.5209297Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.5210662Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.5211249Z ^ 2025-05-07T20:07:15.5211460Z 2025-05-07T20:07:15.5212252Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.5213550Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.5214126Z ^ 2025-05-07T20:07:15.5214283Z 2025-05-07T20:07:15.5214515Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:15.5214848Z 2025-05-07T20:07:15.5215644Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.5216965Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.5217573Z ^ 2025-05-07T20:07:15.5217758Z 2025-05-07T20:07:15.5218564Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.5219834Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.5220440Z ^ 2025-05-07T20:07:15.5220574Z 2025-05-07T20:07:15.5220802Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:15.5221152Z 2025-05-07T20:07:15.5221955Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.5223283Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.5223870Z ^ 2025-05-07T20:07:15.5224078Z 2025-05-07T20:07:15.5224866Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.5226238Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.5226812Z ^ 2025-05-07T20:07:15.5226977Z 2025-05-07T20:07:15.5227231Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:15.5227562Z 2025-05-07T20:07:15.5228540Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.5230127Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.5230788Z ^ 2025-05-07T20:07:15.5230992Z 2025-05-07T20:07:15.5231845Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.5233309Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.5234016Z ^ 2025-05-07T20:07:15.5234183Z 2025-05-07T20:07:15.5234431Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:15.5234788Z 2025-05-07T20:07:15.5235674Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.5237064Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.5237730Z ^ 2025-05-07T20:07:15.5237935Z 2025-05-07T20:07:16.8677097Z [594/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o 2025-05-07T20:07:17.5953891Z [595/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_py.so -o fbgemm_gpu_py.so CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_embedding_inplace_ops.so fbgemm_gpu_tbe_index_select.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_optimizers.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm_gpu_sparse_async_cumsum.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && : 2025-05-07T20:07:17.6605363Z [596/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so 1 2025-05-07T20:07:17.6606606Z ################################################################################ 2025-05-07T20:07:17.6606989Z [CMAKE] Running post-build script ... 2025-05-07T20:07:17.6607528Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:07:17.6608097Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:07:17.6608482Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:07:17.6608918Z ################################################################################ 2025-05-07T20:08:41.3746277Z [597/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o 2025-05-07T20:08:41.3758847Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:41.3760335Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:41.3760958Z ^ 2025-05-07T20:08:41.3761098Z 2025-05-07T20:08:41.3761351Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:41.3761705Z 2025-05-07T20:08:41.3762557Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:41.3763942Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:41.3764572Z ^ 2025-05-07T20:08:41.3764764Z 2025-05-07T20:08:41.3765602Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:41.3767039Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:41.3767597Z ^ 2025-05-07T20:08:41.3767736Z 2025-05-07T20:08:41.3767956Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:41.3768276Z 2025-05-07T20:08:41.3769072Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:41.3770341Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:41.3770924Z ^ 2025-05-07T20:08:41.3771100Z 2025-05-07T20:08:41.3771886Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:41.3773174Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:41.3773747Z ^ 2025-05-07T20:08:41.3773871Z 2025-05-07T20:08:41.3774089Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:41.3774449Z 2025-05-07T20:08:41.3775264Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:41.3776545Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:41.3777113Z ^ 2025-05-07T20:08:41.3777302Z 2025-05-07T20:08:41.3778078Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:41.3779347Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:41.3779900Z ^ 2025-05-07T20:08:41.3780041Z 2025-05-07T20:08:41.3780261Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:41.3780610Z 2025-05-07T20:08:41.3781416Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:41.3782680Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:41.3783262Z ^ 2025-05-07T20:08:41.3783439Z 2025-05-07T20:08:41.3784221Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:41.3785483Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:41.3786051Z ^ 2025-05-07T20:08:41.3786188Z 2025-05-07T20:08:41.3786415Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:41.3786762Z 2025-05-07T20:08:41.3787554Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:41.3788852Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:41.3789437Z ^ 2025-05-07T20:08:41.3789642Z 2025-05-07T20:08:44.5815386Z [598/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o 2025-05-07T20:08:44.5838153Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:44.5839577Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:44.5840201Z ^ 2025-05-07T20:08:44.5840360Z 2025-05-07T20:08:44.5840606Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:44.5840971Z 2025-05-07T20:08:44.5841845Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:44.5843244Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:44.5843882Z ^ 2025-05-07T20:08:44.5844079Z 2025-05-07T20:08:44.5844926Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:44.5846388Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:44.5846973Z ^ 2025-05-07T20:08:44.5847104Z 2025-05-07T20:08:44.5847325Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:44.5847664Z 2025-05-07T20:08:44.5848484Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:44.5849767Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:44.5850461Z ^ 2025-05-07T20:08:44.5850659Z 2025-05-07T20:08:44.5851441Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:44.5852765Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:44.5853383Z ^ 2025-05-07T20:08:44.5853529Z 2025-05-07T20:08:44.5853751Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:44.5854072Z 2025-05-07T20:08:44.5854862Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:44.5856144Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:44.5856728Z ^ 2025-05-07T20:08:44.5856907Z 2025-05-07T20:08:44.5857686Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:44.5859001Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:44.5859576Z ^ 2025-05-07T20:08:44.5859706Z 2025-05-07T20:08:44.5859926Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:44.5860266Z 2025-05-07T20:08:44.5861061Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:44.5862345Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:44.5862920Z ^ 2025-05-07T20:08:44.5863102Z 2025-05-07T20:08:44.5863897Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:44.5865163Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:44.5865744Z ^ 2025-05-07T20:08:44.5865875Z 2025-05-07T20:08:44.5866108Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:44.5866431Z 2025-05-07T20:08:44.5867223Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:44.5868507Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:44.5869096Z ^ 2025-05-07T20:08:44.5869279Z 2025-05-07T20:08:55.4345621Z [599/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:08:55.4358131Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:55.4359552Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:55.4360181Z ^ 2025-05-07T20:08:55.4360463Z 2025-05-07T20:08:55.4360706Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:55.4361057Z 2025-05-07T20:08:55.4361915Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:55.4363267Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:55.4363902Z ^ 2025-05-07T20:08:55.4364098Z 2025-05-07T20:08:55.4364948Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:55.4366292Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:55.4366914Z ^ 2025-05-07T20:08:55.4367056Z 2025-05-07T20:08:55.4367431Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:55.4367760Z 2025-05-07T20:08:55.4368603Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:55.4369902Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:55.4370522Z ^ 2025-05-07T20:08:55.4370731Z 2025-05-07T20:08:55.4371548Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:55.4372836Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:55.4373413Z ^ 2025-05-07T20:08:55.4373568Z 2025-05-07T20:08:55.4373797Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:55.4374123Z 2025-05-07T20:08:55.4374936Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:55.4376215Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:55.4376863Z ^ 2025-05-07T20:08:55.4377048Z 2025-05-07T20:08:55.4377833Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:55.4379120Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:55.4379713Z ^ 2025-05-07T20:08:55.4379848Z 2025-05-07T20:08:55.4380071Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:55.4380416Z 2025-05-07T20:08:55.4381209Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:55.4382511Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:55.4383097Z ^ 2025-05-07T20:08:55.4383298Z 2025-05-07T20:08:55.4384082Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:55.4385370Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:55.4385944Z ^ 2025-05-07T20:08:55.4386080Z 2025-05-07T20:08:55.4386322Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:55.4386652Z 2025-05-07T20:08:55.4387449Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:55.4388746Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:55.4389355Z ^ 2025-05-07T20:08:55.4389543Z 2025-05-07T20:08:57.0218682Z [600/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward.so -o fbgemm_gpu_tbe_training_backward.so CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_common.so fbgemm_gpu_sparse_async_cumsum.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && : 2025-05-07T20:08:57.6159281Z [601/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_dense.so -o fbgemm_gpu_tbe_training_backward_dense.so CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && : 2025-05-07T20:08:57.6399548Z [602/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_gwd.so -o fbgemm_gpu_tbe_training_backward_gwd.so CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build -L"/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs" -L"/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib" && : 2025-05-07T20:08:57.6733011Z [603/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 1 2025-05-07T20:08:57.6734350Z ################################################################################ 2025-05-07T20:08:57.6734699Z [CMAKE] Running post-build script ... 2025-05-07T20:08:57.6735340Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:08:57.6735999Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:08:57.6736379Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:08:57.6736801Z ################################################################################ 2025-05-07T20:08:57.7665747Z [604/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 1 2025-05-07T20:08:57.7667282Z ################################################################################ 2025-05-07T20:08:57.7667670Z [CMAKE] Running post-build script ... 2025-05-07T20:08:57.7668296Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:08:57.7669014Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:08:57.7669454Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:08:57.7669938Z ################################################################################ 2025-05-07T20:08:57.8497326Z [605/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so 1 2025-05-07T20:08:57.8498586Z ################################################################################ 2025-05-07T20:08:57.8498969Z [CMAKE] Running post-build script ... 2025-05-07T20:08:57.8499580Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:08:57.8500300Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:08:57.8500661Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:08:57.8501076Z ################################################################################ 2025-05-07T20:08:57.9296208Z [606/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_vbe.so -o fbgemm_gpu_tbe_training_backward_vbe.so CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && : 2025-05-07T20:08:58.2331182Z [607/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 1 2025-05-07T20:08:58.2332504Z ################################################################################ 2025-05-07T20:08:58.2333159Z [CMAKE] Running post-build script ... 2025-05-07T20:08:58.2333786Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:08:58.2334418Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:08:58.2334949Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:08:58.2335347Z ################################################################################ 2025-05-07T20:08:58.2336253Z [607/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/cmake/data/bin/cmake -P cmake_install.cmake 2025-05-07T20:08:58.2378743Z -- Install configuration: "Release" 2025-05-07T20:08:58.2380462Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/asmjit.so 2025-05-07T20:08:58.2405254Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm.so 2025-05-07T20:08:58.2406204Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_cache.so 2025-05-07T20:08:58.2434887Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_inference.so 2025-05-07T20:08:58.2435898Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_config.so 2025-05-07T20:08:58.2458645Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_utils.so 2025-05-07T20:08:58.2476455Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:08:58.2478938Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_common.so 2025-05-07T20:08:58.2482615Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:08:58.2507311Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:08:58.2508536Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:08:58.2509794Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:09:04.4762483Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:09:05.6059359Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:09:08.2225615Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:09:08.6885118Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:09:08.6888384Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:09:08.6890128Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py 2025-05-07T20:09:08.6891395Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py 2025-05-07T20:09:08.6892646Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py 2025-05-07T20:09:08.6893826Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py 2025-05-07T20:09:08.6895241Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py 2025-05-07T20:09:08.6896451Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py 2025-05-07T20:09:08.6897735Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py 2025-05-07T20:09:08.6899059Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py 2025-05-07T20:09:08.6900317Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py 2025-05-07T20:09:08.6901595Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py 2025-05-07T20:09:08.6902948Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py 2025-05-07T20:09:08.6904173Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py 2025-05-07T20:09:08.6905359Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py 2025-05-07T20:09:08.6906568Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py 2025-05-07T20:09:08.6907868Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py 2025-05-07T20:09:08.6909175Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py 2025-05-07T20:09:08.6910294Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:09:08.6924500Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_py.so 2025-05-07T20:09:08.6973374Z 2025-05-07T20:09:08.7016451Z 2025-05-07T20:09:08.7017033Z copying fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/__init__.py 2025-05-07T20:09:08.7017993Z copying fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py 2025-05-07T20:09:08.7019149Z copying fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/enums.py 2025-05-07T20:09:08.7019892Z copying fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/metrics.py 2025-05-07T20:09:08.7022170Z copying fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py 2025-05-07T20:09:08.7023350Z copying fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py 2025-05-07T20:09:08.7024385Z copying fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize_comm.py 2025-05-07T20:09:08.7025233Z copying fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize_utils.py 2025-05-07T20:09:08.7026067Z copying fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/runtime_monitor.py 2025-05-07T20:09:08.7026983Z copying fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sparse_ops.py 2025-05-07T20:09:08.7027875Z copying fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_configs.py 2025-05-07T20:09:08.7030785Z copying fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py 2025-05-07T20:09:08.7031935Z copying fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py 2025-05-07T20:09:08.7032935Z copying fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_utils.py 2025-05-07T20:09:08.7034122Z copying fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py 2025-05-07T20:09:08.7035351Z copying fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py 2025-05-07T20:09:08.7036636Z copying fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py 2025-05-07T20:09:08.7037960Z copying fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py 2025-05-07T20:09:08.7039334Z copying fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py 2025-05-07T20:09:08.7040642Z copying fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py 2025-05-07T20:09:08.7041730Z copying fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py 2025-05-07T20:09:08.7042546Z copying fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/uvm.py 2025-05-07T20:09:08.7043170Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/config 2025-05-07T20:09:08.7043893Z copying fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/config/__init__.py 2025-05-07T20:09:08.7044773Z copying fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/config/feature_list.py 2025-05-07T20:09:08.7045607Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs 2025-05-07T20:09:08.7046308Z copying fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/__init__.py 2025-05-07T20:09:08.7047081Z copying fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/common.py 2025-05-07T20:09:08.7047953Z copying fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/examples.py 2025-05-07T20:09:08.7049072Z copying fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py 2025-05-07T20:09:08.7050091Z copying fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py 2025-05-07T20:09:08.7051229Z copying fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py 2025-05-07T20:09:08.7052260Z copying fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/quantize_ops.py 2025-05-07T20:09:08.7053112Z copying fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/sparse_ops.py 2025-05-07T20:09:08.7053951Z copying fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/version.py 2025-05-07T20:09:08.7054683Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize 2025-05-07T20:09:08.7055666Z copying fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize/__init__.py 2025-05-07T20:09:08.7056593Z copying fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize/quantize_ops.py 2025-05-07T20:09:08.7057355Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll 2025-05-07T20:09:08.7058037Z copying fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/__init__.py 2025-05-07T20:09:08.7058728Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe 2025-05-07T20:09:08.7059390Z copying fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/__init__.py 2025-05-07T20:09:08.7060089Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton 2025-05-07T20:09:08.7060795Z copying fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/__init__.py 2025-05-07T20:09:08.7061629Z copying fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/common.py 2025-05-07T20:09:08.7062481Z copying fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/quantize.py 2025-05-07T20:09:08.7063631Z copying fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/quantize_ref.py 2025-05-07T20:09:08.7064512Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils 2025-05-07T20:09:08.7065230Z copying fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils/__init__.py 2025-05-07T20:09:08.7066064Z copying fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils/filestore.py 2025-05-07T20:09:08.7066929Z copying fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils/loader.py 2025-05-07T20:09:08.7067827Z copying fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils/torch_library.py 2025-05-07T20:09:08.7068755Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/cpu 2025-05-07T20:09:08.7069498Z copying fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/cpu/__init__.py 2025-05-07T20:09:08.7070332Z copying fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py 2025-05-07T20:09:08.7071143Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/meta 2025-05-07T20:09:08.7071894Z copying fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/meta/__init__.py 2025-05-07T20:09:08.7072750Z copying fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py 2025-05-07T20:09:08.7073554Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton 2025-05-07T20:09:08.7074464Z copying fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/__init__.py 2025-05-07T20:09:08.7075353Z copying fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/common.py 2025-05-07T20:09:08.7076485Z copying fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py 2025-05-07T20:09:08.7077797Z copying fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py 2025-05-07T20:09:08.7078949Z copying fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py 2025-05-07T20:09:08.7080113Z copying fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py 2025-05-07T20:09:08.7081462Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py 2025-05-07T20:09:08.7082901Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py 2025-05-07T20:09:08.7084365Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py 2025-05-07T20:09:08.7085730Z copying fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py 2025-05-07T20:09:08.7087138Z copying fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py 2025-05-07T20:09:08.7088455Z copying fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py 2025-05-07T20:09:08.7089740Z copying fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py 2025-05-07T20:09:08.7090767Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench 2025-05-07T20:09:08.7091535Z copying fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/__init__.py 2025-05-07T20:09:08.7092443Z copying fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py 2025-05-07T20:09:08.7093396Z copying fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py 2025-05-07T20:09:08.7094320Z copying fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py 2025-05-07T20:09:08.7095350Z copying fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py 2025-05-07T20:09:08.7096484Z copying fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py 2025-05-07T20:09:08.7097532Z copying fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/reporter.py 2025-05-07T20:09:08.7098486Z copying fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py 2025-05-07T20:09:08.7099550Z copying fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py 2025-05-07T20:09:08.7100780Z copying fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py 2025-05-07T20:09:08.7101813Z copying fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/utils.py 2025-05-07T20:09:08.7102576Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/cache 2025-05-07T20:09:08.7103314Z copying fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/cache/__init__.py 2025-05-07T20:09:08.7104354Z copying fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py 2025-05-07T20:09:08.7105299Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:08.7106041Z copying fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py 2025-05-07T20:09:08.7106930Z copying fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/common.py 2025-05-07T20:09:08.7107847Z copying fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/inference.py 2025-05-07T20:09:08.7108784Z copying fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/training.py 2025-05-07T20:09:08.7109590Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils 2025-05-07T20:09:08.7110365Z copying fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/__init__.py 2025-05-07T20:09:08.7111299Z copying fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/common.py 2025-05-07T20:09:08.7112245Z copying fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/offsets.py 2025-05-07T20:09:08.7113174Z copying fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/quantize.py 2025-05-07T20:09:08.7114226Z copying fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/requests.py 2025-05-07T20:09:08.7115029Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/stats 2025-05-07T20:09:08.7115837Z copying fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/stats/__init__.py 2025-05-07T20:09:08.7116900Z copying fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py 2025-05-07T20:09:08.7117818Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:08.7118680Z copying fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py 2025-05-07T20:09:08.7119833Z copying fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py 2025-05-07T20:09:08.7120864Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/jagged 2025-05-07T20:09:08.7121734Z copying fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/jagged/__init__.py 2025-05-07T20:09:08.7122844Z copying fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py 2025-05-07T20:09:08.7123571Z 2025-05-07T20:09:08.7207973Z INFO:root:running bdist_wheel 2025-05-07T20:09:08.7238347Z INFO:root:running build 2025-05-07T20:09:08.7238691Z INFO:root:running build_py 2025-05-07T20:09:08.7243800Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:08.7245699Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:08.7247690Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:08.7249216Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:08.7250902Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:08.7252672Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:08.7254415Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:08.7255848Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:08.7257871Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:08.7259327Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:08.7260857Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:08.7262815Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:08.7264243Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:08.7266108Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:08.7267593Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:08.7269003Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:08.7270468Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:08.7272009Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:08.7273944Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:08.7277338Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:08.7278872Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:08.7280377Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:08.7281664Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:08.7283376Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/config 2025-05-07T20:09:08.7285006Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/config 2025-05-07T20:09:08.7286381Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/config 2025-05-07T20:09:08.7288353Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:08.7289490Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:08.7290941Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:08.7292273Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:08.7293663Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:08.7295174Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:08.7296663Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:08.7298095Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:08.7299462Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:08.7300994Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:08.7302889Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/quantize 2025-05-07T20:09:08.7304130Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/quantize 2025-05-07T20:09:08.7305635Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/quantize 2025-05-07T20:09:08.7307383Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll 2025-05-07T20:09:08.7308629Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll 2025-05-07T20:09:08.7310494Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe 2025-05-07T20:09:08.7311587Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe 2025-05-07T20:09:08.7313743Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton 2025-05-07T20:09:08.7314888Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton 2025-05-07T20:09:08.7316504Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton 2025-05-07T20:09:08.7317989Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton 2025-05-07T20:09:08.7319509Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton 2025-05-07T20:09:08.7321578Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils 2025-05-07T20:09:08.7322700Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils 2025-05-07T20:09:08.7324150Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils 2025-05-07T20:09:08.7325984Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils 2025-05-07T20:09:08.7327416Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils 2025-05-07T20:09:08.7329795Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/cpu 2025-05-07T20:09:08.7331223Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/cpu 2025-05-07T20:09:08.7332621Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/cpu 2025-05-07T20:09:08.7334420Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/meta 2025-05-07T20:09:08.7335546Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/meta 2025-05-07T20:09:08.7337045Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/meta 2025-05-07T20:09:08.7339470Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:08.7340613Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:08.7342223Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:08.7344273Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:08.7345904Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:08.7347595Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:08.7349284Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:08.7350911Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:08.7352603Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:08.7354381Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:08.7356068Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:08.7357747Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:08.7359375Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:08.7360969Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:08.7362254Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:08.7363510Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:08.7364983Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:08.7366444Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:08.7367923Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:08.7369543Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:08.7371105Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:08.7372631Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:08.7374068Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:08.7375646Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:08.7377197Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:08.7378674Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:08.7379802Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/cache 2025-05-07T20:09:08.7380929Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/cache 2025-05-07T20:09:08.7382400Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/cache 2025-05-07T20:09:08.7384041Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:08.7385252Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:08.7386737Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:08.7388145Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:08.7389674Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:08.7392677Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils 2025-05-07T20:09:08.7393859Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils 2025-05-07T20:09:08.7395532Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils 2025-05-07T20:09:08.7396970Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils 2025-05-07T20:09:08.7398508Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils 2025-05-07T20:09:08.7399944Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils 2025-05-07T20:09:08.7401572Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/stats 2025-05-07T20:09:08.7402805Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/stats 2025-05-07T20:09:08.7404365Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/stats 2025-05-07T20:09:08.7406045Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:08.7407281Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:08.7408844Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:08.7410506Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton/jagged 2025-05-07T20:09:08.7411678Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton/jagged 2025-05-07T20:09:08.7413205Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton/jagged 2025-05-07T20:09:08.7457621Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/asmjit.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:08.7501101Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:08.7838972Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_cache.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:08.9046146Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_inference.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:12.3182713Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_config.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:12.3185530Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_utils.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:12.4481554Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:12.4612931Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_common.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:12.4824068Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:12.5520469Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:15.3414106Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:15.4240465Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:22.7231079Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:23.8521147Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:26.4679081Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:26.9331202Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:26.9716309Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:27.2417269Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:27.2418828Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:27.2422692Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:27.2429044Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:27.2439546Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:27.2451868Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:27.2457165Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:27.2468808Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:27.2475095Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:27.2486205Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:27.2493378Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:27.2512216Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:27.2514784Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:27.2521621Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:27.2527830Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:27.2539025Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:27.2540582Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:27.2556376Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:27.2560748Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:27.2600582Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_py.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:27.8315345Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:27.8316731Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:27.8318070Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:27.8319314Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:27.8320631Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:27.8322090Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:27.8323464Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:27.8324736Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:27.8326029Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:27.8327301Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:27.8328938Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:27.8330376Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:27.8331954Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:27.8333329Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:27.8334702Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:27.8336170Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:27.8337680Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:27.8339819Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:27.8342766Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:27.8344299Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:27.8345989Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:27.8347440Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:27.8349143Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/config 2025-05-07T20:09:27.8350680Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/config 2025-05-07T20:09:27.8352100Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:27.8353703Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:27.8355320Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:27.8356787Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:27.8358338Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:27.8360358Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:27.8362077Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:27.8363708Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:27.8365408Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:27.8366831Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/quantize 2025-05-07T20:09:27.8370279Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/quantize 2025-05-07T20:09:27.8371644Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll 2025-05-07T20:09:27.8373186Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe 2025-05-07T20:09:27.8374738Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton 2025-05-07T20:09:27.8376250Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton 2025-05-07T20:09:27.8377665Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton 2025-05-07T20:09:27.8379308Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton 2025-05-07T20:09:27.8380805Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils 2025-05-07T20:09:27.8382303Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils 2025-05-07T20:09:27.8383682Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils 2025-05-07T20:09:27.8385109Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils 2025-05-07T20:09:27.8386579Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/cpu 2025-05-07T20:09:27.8388039Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/cpu 2025-05-07T20:09:27.8390061Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/meta 2025-05-07T20:09:27.8391556Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/meta 2025-05-07T20:09:27.8393116Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:27.8394789Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:27.8396473Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:27.8398093Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:27.8399631Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:27.8401168Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:27.8402786Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:27.8404493Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:27.8406181Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:27.8407837Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:27.8409513Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:27.8411122Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:27.8412731Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:27.8414261Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:27.8415665Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:27.8417103Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:27.8418509Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:27.8420200Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:27.8422139Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:27.8425197Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:27.8426733Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:27.8428229Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:27.8429950Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:27.8431634Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:27.8433019Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/cache 2025-05-07T20:09:27.8434663Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/cache 2025-05-07T20:09:27.8436100Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:27.8437464Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:27.8438918Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:27.8440482Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:27.8442756Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils 2025-05-07T20:09:27.8444201Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils 2025-05-07T20:09:27.8445601Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils 2025-05-07T20:09:27.8447122Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils 2025-05-07T20:09:27.8448890Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils 2025-05-07T20:09:27.8450494Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/stats 2025-05-07T20:09:27.8452072Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/stats 2025-05-07T20:09:27.8453659Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:27.8455303Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:27.8456989Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton/jagged 2025-05-07T20:09:27.8458528Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton/jagged 2025-05-07T20:09:27.8473793Z INFO:skbuild:copied 90 files 2025-05-07T20:09:27.8474111Z INFO:root:running build_ext 2025-05-07T20:09:27.8474602Z INFO:root:installing to _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:09:27.8475097Z INFO:root:running install 2025-05-07T20:09:27.8530278Z INFO:root:running install_lib 2025-05-07T20:09:27.8530803Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:09:27.8531957Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu 2025-05-07T20:09:27.8533059Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/config 2025-05-07T20:09:27.8534297Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/config 2025-05-07T20:09:27.8535865Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/config 2025-05-07T20:09:27.8537031Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/docs 2025-05-07T20:09:27.8538135Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:27.8539636Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:27.8541154Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:27.8542696Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:27.8544326Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:27.8545993Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:27.8547599Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:27.8549137Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:27.8550994Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:27.8552133Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/quantize 2025-05-07T20:09:27.8553393Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/quantize 2025-05-07T20:09:27.8555085Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/quantize 2025-05-07T20:09:27.8556247Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll 2025-05-07T20:09:27.8557000Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/cpu 2025-05-07T20:09:27.8558161Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/cpu 2025-05-07T20:09:27.8559691Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/cpu 2025-05-07T20:09:27.8560895Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/meta 2025-05-07T20:09:27.8562057Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/meta 2025-05-07T20:09:27.8563604Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/meta 2025-05-07T20:09:27.8564779Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/triton 2025-05-07T20:09:27.8565955Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:27.8567556Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:27.8569272Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:27.8571096Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:27.8572816Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:27.8574546Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:27.8576355Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:27.8578249Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:27.8580127Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:27.8582032Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:27.8583881Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:27.8585692Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:27.8587494Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:27.8589207Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll 2025-05-07T20:09:27.8590284Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe 2025-05-07T20:09:27.8591041Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/bench 2025-05-07T20:09:27.8592206Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:27.8593872Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:27.8595508Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:27.8597091Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:27.8598761Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:27.8600498Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:27.8602148Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:27.8603760Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:27.8605466Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:27.8607184Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:27.8608891Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:27.8610074Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/cache 2025-05-07T20:09:27.8611252Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/cache 2025-05-07T20:09:27.8612894Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/cache 2025-05-07T20:09:27.8614141Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:27.8614925Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:27.8616146Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:27.8617933Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:27.8619645Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:09:27.8621172Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:09:27.8622745Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:09:27.8624326Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:09:27.8625498Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/utils 2025-05-07T20:09:27.8626677Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:27.8628255Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:27.8629990Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:27.8631595Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:27.8633253Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:27.8634508Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/stats 2025-05-07T20:09:27.8635688Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/stats 2025-05-07T20:09:27.8637413Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/stats 2025-05-07T20:09:27.8638989Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe 2025-05-07T20:09:27.8640093Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/triton 2025-05-07T20:09:27.8640864Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/triton/jagged 2025-05-07T20:09:27.8642102Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton/jagged 2025-05-07T20:09:27.8643830Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton/jagged 2025-05-07T20:09:27.8645518Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:09:27.8647062Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:09:27.8648494Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:09:27.8649935Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:09:27.8651018Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/utils 2025-05-07T20:09:27.8652066Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:09:27.8653465Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:09:27.8654888Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:09:27.8656316Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:09:27.8657680Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/asmjit.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:27.8658982Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:27.8669563Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_cache.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:27.8802954Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_inference.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:28.1502448Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_config.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:28.1504059Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_utils.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:28.1604866Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:28.1619908Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_common.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:28.1635134Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:28.1694322Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:28.3813610Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:28.3876991Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:28.9488675Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:29.0357444Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:29.2351409Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:29.2707241Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:29.2735302Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:29.2948512Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:29.2950168Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:29.2952572Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:29.2954819Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:29.2957100Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:29.2959198Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:29.2961329Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:29.2963508Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:29.2965782Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:29.2967949Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:29.2970153Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:29.2972410Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:29.2974528Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:29.2976609Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:29.2978741Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:29.2980304Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:29.2981959Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:29.2984150Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:29.2986016Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:29.2987545Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_py.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:29.3419494Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:29.3421086Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:29.3422793Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:29.3424206Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:29.3425710Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:29.3427345Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:29.3429085Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:29.3430550Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:29.3432030Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:29.3433488Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:29.3435038Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:29.3436647Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:29.3438271Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:29.3439909Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:29.3441496Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:29.3443270Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:29.3444958Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:29.3446664Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:29.3448380Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:29.3450089Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:29.3451971Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:29.3453427Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:29.3454265Z INFO:skbuild:copied 125 files 2025-05-07T20:09:29.3454566Z INFO:root:running install_egg_info 2025-05-07T20:09:29.3474965Z INFO:root:running egg_info 2025-05-07T20:09:29.3499710Z INFO:root:writing fbgemm_gpu_nightly.egg-info/PKG-INFO 2025-05-07T20:09:29.3500633Z INFO:root:writing dependency_links to fbgemm_gpu_nightly.egg-info/dependency_links.txt 2025-05-07T20:09:29.3502574Z INFO:root:writing requirements to fbgemm_gpu_nightly.egg-info/requires.txt 2025-05-07T20:09:29.3503477Z INFO:root:writing top-level names to fbgemm_gpu_nightly.egg-info/top_level.txt 2025-05-07T20:09:29.3586096Z INFO:root:reading manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T20:09:29.3617439Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T20:09:29.3618384Z INFO:root:Copying fbgemm_gpu_nightly.egg-info to _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu_nightly-2025.5.7-py3.11.egg-info 2025-05-07T20:09:29.3622350Z INFO:root:running install_scripts 2025-05-07T20:09:29.3623008Z INFO:skbuild:copied 0 files 2025-05-07T20:09:32.0675910Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu_nightly-2025.5.7.dist-info/WHEEL 2025-05-07T20:09:32.0677307Z INFO:wheel:creating '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-radia1_8/fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl' and adding '_skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel' to it 2025-05-07T20:09:32.0678386Z INFO:wheel:adding 'fbgemm_gpu/__init__.py' 2025-05-07T20:09:32.0936954Z INFO:wheel:adding 'fbgemm_gpu/asmjit.so' 2025-05-07T20:09:32.0953209Z INFO:wheel:adding 'fbgemm_gpu/batched_unary_embeddings_ops.py' 2025-05-07T20:09:32.0953868Z INFO:wheel:adding 'fbgemm_gpu/enums.py' 2025-05-07T20:09:32.2973614Z INFO:wheel:adding 'fbgemm_gpu/fbgemm.so' 2025-05-07T20:09:32.3105197Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_config.so' 2025-05-07T20:09:32.3235489Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so' 2025-05-07T20:09:34.0270439Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_py.so' 2025-05-07T20:09:34.2276384Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so' 2025-05-07T20:09:34.9340599Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_cache.so' 2025-05-07T20:09:35.0405021Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_common.so' 2025-05-07T20:09:35.6298400Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_index_select.so' 2025-05-07T20:09:53.6304477Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_inference.so' 2025-05-07T20:09:54.8900826Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so' 2025-05-07T20:10:22.5764590Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so' 2025-05-07T20:10:25.3637397Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so' 2025-05-07T20:10:28.9501471Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so' 2025-05-07T20:10:29.6362387Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so' 2025-05-07T20:10:29.8538895Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so' 2025-05-07T20:10:38.3962766Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so' 2025-05-07T20:10:49.2325306Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so' 2025-05-07T20:10:50.6894356Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_utils.so' 2025-05-07T20:10:50.7245294Z INFO:wheel:adding 'fbgemm_gpu/metrics.py' 2025-05-07T20:10:50.7247152Z INFO:wheel:adding 'fbgemm_gpu/permute_pooled_embedding_modules.py' 2025-05-07T20:10:50.7248737Z INFO:wheel:adding 'fbgemm_gpu/permute_pooled_embedding_modules_split.py' 2025-05-07T20:10:50.7251994Z INFO:wheel:adding 'fbgemm_gpu/quantize_comm.py' 2025-05-07T20:10:50.7254730Z INFO:wheel:adding 'fbgemm_gpu/quantize_utils.py' 2025-05-07T20:10:50.7257604Z INFO:wheel:adding 'fbgemm_gpu/runtime_monitor.py' 2025-05-07T20:10:50.7268176Z INFO:wheel:adding 'fbgemm_gpu/sparse_ops.py' 2025-05-07T20:10:50.7271682Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_configs.py' 2025-05-07T20:10:50.7274401Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_inference_converter.py' 2025-05-07T20:10:50.7275981Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_ops.py' 2025-05-07T20:10:50.7277318Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_utils.py' 2025-05-07T20:10:50.7279062Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops.py' 2025-05-07T20:10:50.7282119Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_common.py' 2025-05-07T20:10:50.7304506Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_inference.py' 2025-05-07T20:10:50.7348292Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_training.py' 2025-05-07T20:10:50.7351014Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py' 2025-05-07T20:10:50.7352360Z INFO:wheel:adding 'fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py' 2025-05-07T20:10:50.7354299Z INFO:wheel:adding 'fbgemm_gpu/tbe_input_multiplexer.py' 2025-05-07T20:10:50.7355748Z INFO:wheel:adding 'fbgemm_gpu/uvm.py' 2025-05-07T20:10:50.7357282Z INFO:wheel:adding 'fbgemm_gpu/config/__init__.py' 2025-05-07T20:10:50.7358963Z INFO:wheel:adding 'fbgemm_gpu/config/feature_list.py' 2025-05-07T20:10:50.7360615Z INFO:wheel:adding 'fbgemm_gpu/docs/__init__.py' 2025-05-07T20:10:50.7361791Z INFO:wheel:adding 'fbgemm_gpu/docs/common.py' 2025-05-07T20:10:50.7363460Z INFO:wheel:adding 'fbgemm_gpu/docs/examples.py' 2025-05-07T20:10:50.7365768Z INFO:wheel:adding 'fbgemm_gpu/docs/jagged_tensor_ops.py' 2025-05-07T20:10:50.7367386Z INFO:wheel:adding 'fbgemm_gpu/docs/merge_pooled_embedding_ops.py' 2025-05-07T20:10:50.7369400Z INFO:wheel:adding 'fbgemm_gpu/docs/permute_pooled_embedding_ops.py' 2025-05-07T20:10:50.7370898Z INFO:wheel:adding 'fbgemm_gpu/docs/quantize_ops.py' 2025-05-07T20:10:50.7376495Z INFO:wheel:adding 'fbgemm_gpu/docs/sparse_ops.py' 2025-05-07T20:10:50.7378260Z INFO:wheel:adding 'fbgemm_gpu/docs/version.py' 2025-05-07T20:10:50.7379816Z INFO:wheel:adding 'fbgemm_gpu/quantize/__init__.py' 2025-05-07T20:10:50.7382201Z INFO:wheel:adding 'fbgemm_gpu/quantize/quantize_ops.py' 2025-05-07T20:10:50.7383420Z INFO:wheel:adding 'fbgemm_gpu/sll/__init__.py' 2025-05-07T20:10:50.7385193Z INFO:wheel:adding 'fbgemm_gpu/sll/cpu/__init__.py' 2025-05-07T20:10:50.7391232Z INFO:wheel:adding 'fbgemm_gpu/sll/cpu/cpu_sll.py' 2025-05-07T20:10:50.7393552Z INFO:wheel:adding 'fbgemm_gpu/sll/meta/__init__.py' 2025-05-07T20:10:50.7396066Z INFO:wheel:adding 'fbgemm_gpu/sll/meta/meta_sll.py' 2025-05-07T20:10:50.7398479Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/__init__.py' 2025-05-07T20:10:50.7399773Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/common.py' 2025-05-07T20:10:50.7401449Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py' 2025-05-07T20:10:50.7403948Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py' 2025-05-07T20:10:50.7407453Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_bmm.py' 2025-05-07T20:10:50.7411266Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py' 2025-05-07T20:10:50.7413166Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py' 2025-05-07T20:10:50.7415247Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py' 2025-05-07T20:10:50.7420605Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py' 2025-05-07T20:10:50.7425772Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py' 2025-05-07T20:10:50.7427828Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py' 2025-05-07T20:10:50.7431765Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_softmax.py' 2025-05-07T20:10:50.7437068Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py' 2025-05-07T20:10:50.7439551Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py' 2025-05-07T20:10:50.7442422Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py' 2025-05-07T20:10:50.7445846Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py' 2025-05-07T20:10:50.7447834Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py' 2025-05-07T20:10:50.7449607Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py' 2025-05-07T20:10:50.7452410Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py' 2025-05-07T20:10:50.7455429Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py' 2025-05-07T20:10:50.7458240Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py' 2025-05-07T20:10:50.7461177Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py' 2025-05-07T20:10:50.7464089Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py' 2025-05-07T20:10:50.7466970Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py' 2025-05-07T20:10:50.7483366Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py' 2025-05-07T20:10:50.7485840Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py' 2025-05-07T20:10:50.7486632Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py' 2025-05-07T20:10:50.7487283Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py' 2025-05-07T20:10:50.7488026Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py' 2025-05-07T20:10:50.7488628Z INFO:wheel:adding 'fbgemm_gpu/tbe/__init__.py' 2025-05-07T20:10:50.7489007Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/__init__.py' 2025-05-07T20:10:50.7489406Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/bench_config.py' 2025-05-07T20:10:50.7490095Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/bench_runs.py' 2025-05-07T20:10:50.7492349Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/eeg_cli.py' 2025-05-07T20:10:50.7494472Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/embedding_ops_common_config.py' 2025-05-07T20:10:50.7496240Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/eval_compression.py' 2025-05-07T20:10:50.7497669Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/reporter.py' 2025-05-07T20:10:50.7500671Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config.py' 2025-05-07T20:10:50.7503338Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config_loader.py' 2025-05-07T20:10:50.7505559Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py' 2025-05-07T20:10:50.7507095Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/utils.py' 2025-05-07T20:10:50.7508594Z INFO:wheel:adding 'fbgemm_gpu/tbe/cache/__init__.py' 2025-05-07T20:10:50.7510095Z INFO:wheel:adding 'fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py' 2025-05-07T20:10:50.7511493Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/__init__.py' 2025-05-07T20:10:50.7512690Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/common.py' 2025-05-07T20:10:50.7518532Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/inference.py' 2025-05-07T20:10:50.7544449Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/training.py' 2025-05-07T20:10:50.7546819Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/utils/__init__.py' 2025-05-07T20:10:50.7549462Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py' 2025-05-07T20:10:50.7551050Z INFO:wheel:adding 'fbgemm_gpu/tbe/stats/__init__.py' 2025-05-07T20:10:50.7553593Z INFO:wheel:adding 'fbgemm_gpu/tbe/stats/bench_params_reporter.py' 2025-05-07T20:10:50.7555431Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/__init__.py' 2025-05-07T20:10:50.7556715Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/common.py' 2025-05-07T20:10:50.7558291Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/offsets.py' 2025-05-07T20:10:50.7560615Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/quantize.py' 2025-05-07T20:10:50.7566021Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/requests.py' 2025-05-07T20:10:50.7567997Z INFO:wheel:adding 'fbgemm_gpu/triton/__init__.py' 2025-05-07T20:10:50.7569598Z INFO:wheel:adding 'fbgemm_gpu/triton/common.py' 2025-05-07T20:10:50.7577006Z INFO:wheel:adding 'fbgemm_gpu/triton/quantize.py' 2025-05-07T20:10:50.7581419Z INFO:wheel:adding 'fbgemm_gpu/triton/quantize_ref.py' 2025-05-07T20:10:50.7583150Z INFO:wheel:adding 'fbgemm_gpu/triton/jagged/__init__.py' 2025-05-07T20:10:50.7590864Z INFO:wheel:adding 'fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py' 2025-05-07T20:10:50.7592976Z INFO:wheel:adding 'fbgemm_gpu/utils/__init__.py' 2025-05-07T20:10:50.7595151Z INFO:wheel:adding 'fbgemm_gpu/utils/filestore.py' 2025-05-07T20:10:50.7596584Z INFO:wheel:adding 'fbgemm_gpu/utils/loader.py' 2025-05-07T20:10:50.7598618Z INFO:wheel:adding 'fbgemm_gpu/utils/torch_library.py' 2025-05-07T20:10:50.7601061Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/METADATA' 2025-05-07T20:10:50.7601963Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/WHEEL' 2025-05-07T20:10:50.7602818Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/top_level.txt' 2025-05-07T20:10:50.7609152Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/RECORD' 2025-05-07T20:10:50.7612171Z INFO:root:removing _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:10:50.9111876Z ╒════════════════════════════╤════════════════════════════════════════════════╕ 2025-05-07T20:10:50.9112458Z │ │ Version │ 2025-05-07T20:10:50.9113002Z ╞════════════════════════════╪════════════════════════════════════════════════╡ 2025-05-07T20:10:50.9113521Z │ PyTorch │ 2.8.0.dev20250507+cu126 │ 2025-05-07T20:10:50.9114047Z ├────────────────────────────┼────────────────────────────────────────────────┤ 2025-05-07T20:10:50.9114711Z │ CUDA (Declared by PyTorch) │ 12.6 │ 2025-05-07T20:10:50.9115276Z ├────────────────────────────┼────────────────────────────────────────────────┤ 2025-05-07T20:10:50.9116605Z │ CUDA (Actual) │ nvcc: NVIDIA (R) Cuda compiler driver │ 2025-05-07T20:10:50.9117158Z │ │ Copyright (c) 2005-2024 NVIDIA Corporation │ 2025-05-07T20:10:50.9117626Z │ │ Built on Tue_Oct_29_23:50:19_PDT_2024 │ 2025-05-07T20:10:50.9118181Z │ │ Cuda compilation tools, release 12.6, V12.6.85 │ 2025-05-07T20:10:50.9118720Z │ │ Build cuda_12.6.r12.6/compiler.35059454_0 │ 2025-05-07T20:10:50.9119256Z ╘════════════════════════════╧════════════════════════════════════════════════╛ 2025-05-07T20:10:51.1596934Z Successfully built fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl 2025-05-07T20:10:51.2345029Z 2025-05-07T20:10:51.2516704Z ################################################################################ 2025-05-07T20:10:51.2518136Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so 2025-05-07T20:10:51.2519428Z [CHECK] Listing out library size: 2025-05-07T20:10:51.2520589Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so 2025-05-07T20:10:51.2521496Z 2025-05-07T20:10:51.2529723Z 1 ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so 2025-05-07T20:10:51.2529986Z 2025-05-07T20:10:51.2530789Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so 2025-05-07T20:10:51.2531961Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:51.2532494Z 2025-05-07T20:10:51.2606286Z GLIBC_2.2.5 2025-05-07T20:10:51.2606906Z GLIBC_2.14 2025-05-07T20:10:51.2607269Z 2025-05-07T20:10:51.2607492Z 2025-05-07T20:10:51.2611284Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so 2025-05-07T20:10:51.2613222Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:51.2613799Z 2025-05-07T20:10:51.2678614Z GLIBCXX_3.4 2025-05-07T20:10:51.2680878Z 2025-05-07T20:10:51.2680976Z 2025-05-07T20:10:51.2706365Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so > /tmp/tmp.42Z15e74BL.symbols.txt 2025-05-07T20:10:51.2707629Z 2025-05-07T20:10:51.2738692Z 2025-05-07T20:10:51.2770855Z [CHECK] Total Number of symbols: 841 2025-05-07T20:10:51.2788988Z [CHECK] Number of fbgemm symbols: 0 2025-05-07T20:10:51.2815665Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so > /tmp/tmp.00WJ15kfWC.usymbols.txt 2025-05-07T20:10:51.2816111Z 2025-05-07T20:10:51.2833783Z 2025-05-07T20:10:51.2863717Z [CHECK] Listing out undefined symbols (51 total): 2025-05-07T20:10:51.2884228Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:51.2884953Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:51.2885421Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:51.2885780Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:51.2886095Z U __errno_location@GLIBC_2.2.5 2025-05-07T20:10:51.2886433Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:51.2886747Z U abort@GLIBC_2.2.5 2025-05-07T20:10:51.2887036Z U bcmp@GLIBC_2.2.5 2025-05-07T20:10:51.2887483Z U close@GLIBC_2.2.5 2025-05-07T20:10:51.2887773Z U fputs@GLIBC_2.2.5 2025-05-07T20:10:51.2888068Z U free@GLIBC_2.2.5 2025-05-07T20:10:51.2888344Z U ftruncate64@GLIBC_2.2.5 2025-05-07T20:10:51.2888651Z U fwrite@GLIBC_2.2.5 2025-05-07T20:10:51.2888928Z U getenv@GLIBC_2.2.5 2025-05-07T20:10:51.2889226Z U getpagesize@GLIBC_2.2.5 2025-05-07T20:10:51.2889519Z U madvise@GLIBC_2.2.5 2025-05-07T20:10:51.2889811Z U malloc@GLIBC_2.2.5 2025-05-07T20:10:51.2890090Z U memcmp@GLIBC_2.2.5 2025-05-07T20:10:51.2890457Z U memcpy@GLIBC_2.14 2025-05-07T20:10:51.2890756Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:51.2891035Z U memset@GLIBC_2.2.5 2025-05-07T20:10:51.2891325Z U mmap@GLIBC_2.2.5 2025-05-07T20:10:51.2891607Z U mprotect@GLIBC_2.2.5 2025-05-07T20:10:51.2891913Z U munmap@GLIBC_2.2.5 2025-05-07T20:10:51.2892189Z U open64@GLIBC_2.2.5 2025-05-07T20:10:51.2892569Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:51.2892906Z U pthread_mutex_destroy@GLIBC_2.2.5 2025-05-07T20:10:51.2893250Z U pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:51.2893589Z U pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:51.2893899Z U read@GLIBC_2.2.5 2025-05-07T20:10:51.2894190Z U realloc@GLIBC_2.2.5 2025-05-07T20:10:51.2894476Z U shm_open@GLIBC_2.2.5 2025-05-07T20:10:51.2894781Z U shm_unlink@GLIBC_2.2.5 2025-05-07T20:10:51.2895073Z U snprintf@GLIBC_2.2.5 2025-05-07T20:10:51.2895409Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:51.2895711Z U stderr@GLIBC_2.2.5 2025-05-07T20:10:51.2896005Z U strcmp@GLIBC_2.2.5 2025-05-07T20:10:51.2896302Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:51.2896575Z U strtol@GLIBC_2.2.5 2025-05-07T20:10:51.2896921Z U syscall@GLIBC_2.2.5 2025-05-07T20:10:51.2897305Z U sysconf@GLIBC_2.2.5 2025-05-07T20:10:51.2897593Z U uname@GLIBC_2.2.5 2025-05-07T20:10:51.2897875Z U unlink@GLIBC_2.2.5 2025-05-07T20:10:51.2898169Z U vsnprintf@GLIBC_2.2.5 2025-05-07T20:10:51.2898504Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:51.2898923Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:51.2899436Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:10:51.2899799Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:51.2900106Z w _ITM_registerTMCloneTable 2025-05-07T20:10:51.2900391Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:51.2900675Z w __gmon_start__ 2025-05-07T20:10:51.2900970Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:51.2901355Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so 2025-05-07T20:10:51.2901588Z 2025-05-07T20:10:51.2931228Z linux-vdso.so.1 (0x00007ffc6ed12000) 2025-05-07T20:10:51.2932178Z libtorch.so => not found 2025-05-07T20:10:51.2932900Z libc10.so => not found 2025-05-07T20:10:51.2933575Z libnvrtc.so.12 => not found 2025-05-07T20:10:51.2934326Z libc10_cuda.so => not found 2025-05-07T20:10:51.2934863Z libnccl.so.2 => not found 2025-05-07T20:10:51.2935121Z libcuda.so.1 => not found 2025-05-07T20:10:51.2935373Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:51.2935665Z libtorch_cpu.so => not found 2025-05-07T20:10:51.2935926Z libtorch_cuda.so => not found 2025-05-07T20:10:51.2936268Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fdd8d9e8000) 2025-05-07T20:10:51.2936703Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fdd8d992000) 2025-05-07T20:10:51.2937335Z librt.so.1 => /lib64/librt.so.1 (0x00007fdd8d98b000) 2025-05-07T20:10:51.2937836Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fdd8d95d000) 2025-05-07T20:10:51.2938245Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007fdd8d958000) 2025-05-07T20:10:51.2938639Z libc.so.6 => /lib64/libc.so.6 (0x00007fdd8d750000) 2025-05-07T20:10:51.2938966Z libm.so.6 => /lib64/libm.so.6 (0x00007fdd8d675000) 2025-05-07T20:10:51.2939310Z /lib64/ld-linux-x86-64.so.2 (0x00007fdd8dcc9000) 2025-05-07T20:10:51.2939543Z 2025-05-07T20:10:51.2939658Z [CHECK] Displaying ELF information: 2025-05-07T20:10:51.2939998Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so 2025-05-07T20:10:51.2940255Z 2025-05-07T20:10:51.2971644Z 2025-05-07T20:10:51.2972419Z Dynamic section at offset 0x75898 contains 39 entries: 2025-05-07T20:10:51.2973640Z Tag Type Name/Value 2025-05-07T20:10:51.2974060Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:51.2974574Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:51.2975176Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:51.2975860Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:51.2976332Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:51.2976807Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:51.2977295Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:51.2977774Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:51.2978266Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:51.2978733Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:51.2979208Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:10:51.2979669Z 0x0000000000000001 (NEEDED) Shared library: [librt.so.1] 2025-05-07T20:10:51.2980134Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:51.2980658Z 0x0000000000000001 (NEEDED) Shared library: [libpthread.so.0] 2025-05-07T20:10:51.2981121Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:51.2981584Z 0x000000000000000e (SONAME) Library soname: [asmjit.so] 2025-05-07T20:10:51.2981957Z 0x000000000000000c (INIT) 0x19000 2025-05-07T20:10:51.2982268Z 0x000000000000000d (FINI) 0x56a1c 2025-05-07T20:10:51.2982564Z 0x0000000000000019 (INIT_ARRAY) 0x74ac0 2025-05-07T20:10:51.2982889Z 0x000000000000001b (INIT_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:51.2983211Z 0x000000000000001a (FINI_ARRAY) 0x74ac8 2025-05-07T20:10:51.2983511Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:51.2983854Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:51.2984173Z 0x0000000000000005 (STRTAB) 0x6980 2025-05-07T20:10:51.2984467Z 0x0000000000000006 (SYMTAB) 0x1a90 2025-05-07T20:10:51.2984799Z 0x000000000000000a (STRSZ) 48829 (bytes) 2025-05-07T20:10:51.2985292Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:51.2985634Z 0x0000000000000003 (PLTGOT) 0x75fe8 2025-05-07T20:10:51.2985969Z 0x0000000000000002 (PLTRELSZ) 8472 (bytes) 2025-05-07T20:10:51.2986483Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:51.2986801Z 0x0000000000000017 (JMPREL) 0x162e0 2025-05-07T20:10:51.2987137Z 0x0000000000000007 (RELA) 0x12f98 2025-05-07T20:10:51.2987498Z 0x0000000000000008 (RELASZ) 13128 (bytes) 2025-05-07T20:10:51.2987846Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:51.2988199Z 0x000000006ffffffe (VERNEED) 0x12ed8 2025-05-07T20:10:51.2988528Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:51.2988905Z 0x000000006ffffff0 (VERSYM) 0x1283e 2025-05-07T20:10:51.2989229Z 0x000000006ffffff9 (RELACOUNT) 3 2025-05-07T20:10:51.2989551Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:51.2989757Z 2025-05-07T20:10:51.2989884Z ################################################################################ 2025-05-07T20:10:51.2990110Z 2025-05-07T20:10:51.2990114Z 2025-05-07T20:10:51.2990224Z ################################################################################ 2025-05-07T20:10:51.2990712Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:10:51.2991179Z [CHECK] Listing out library size: 2025-05-07T20:10:51.2991655Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:10:51.2992010Z 2025-05-07T20:10:51.2992214Z 1 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:10:51.2992500Z 2025-05-07T20:10:51.2992864Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:10:51.2993845Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:51.2994508Z 2025-05-07T20:10:51.3037210Z GLIBC_2.2.5 2025-05-07T20:10:51.3037870Z GLIBC_2.14 2025-05-07T20:10:51.3038623Z 2025-05-07T20:10:51.3038748Z 2025-05-07T20:10:51.3040148Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:10:51.3043056Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:51.3044471Z 2025-05-07T20:10:51.3093885Z GLIBCXX_3.4 2025-05-07T20:10:51.3094313Z GLIBCXX_3.4.9 2025-05-07T20:10:51.3094528Z GLIBCXX_3.4.21 2025-05-07T20:10:51.3094669Z 2025-05-07T20:10:51.3094674Z 2025-05-07T20:10:51.3117402Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so > /tmp/tmp.6ZfWimAy93.symbols.txt 2025-05-07T20:10:51.3119075Z 2025-05-07T20:10:51.3146251Z 2025-05-07T20:10:51.3176591Z [CHECK] Total Number of symbols: 116 2025-05-07T20:10:51.3191595Z [CHECK] Number of fbgemm symbols: 4 2025-05-07T20:10:51.3219282Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so > /tmp/tmp.jU9RL5Cer2.usymbols.txt 2025-05-07T20:10:51.3219983Z 2025-05-07T20:10:51.3233957Z 2025-05-07T20:10:51.3264341Z [CHECK] Listing out undefined symbols (55 total): 2025-05-07T20:10:51.3278653Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:51.3279686Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:51.3279990Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:51.3280310Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:51.3280624Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:51.3280950Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:51.3281277Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:51.3281587Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:51.3281900Z U __errno_location@GLIBC_2.2.5 2025-05-07T20:10:51.3282214Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:51.3282539Z U c10::BoolType::get() 2025-05-07T20:10:51.3282829Z U c10::StringType::get() 2025-05-07T20:10:51.3283164Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:51.3283927Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:51.3285137Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:51.3286068Z U getenv@GLIBC_2.2.5 2025-05-07T20:10:51.3286360Z U memcmp@GLIBC_2.2.5 2025-05-07T20:10:51.3286659Z U memcpy@GLIBC_2.14 2025-05-07T20:10:51.3286952Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:51.3287237Z U memset@GLIBC_2.2.5 2025-05-07T20:10:51.3287554Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:51.3287898Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:51.3288313Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:10:51.3289018Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:10:51.3289855Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:51.3290702Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:51.3291454Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:51.3291831Z U std::__throw_invalid_argument(char const*)@GLIBCXX_3.4 2025-05-07T20:10:51.3292224Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:51.3292609Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:51.3293085Z U std::__throw_out_of_range(char const*)@GLIBCXX_3.4 2025-05-07T20:10:51.3293536Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:51.3294400Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:51.3295151Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:51.3295473Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:51.3295840Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:51.3296145Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:51.3296460Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:51.3296739Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:51.3297010Z U strtol@GLIBC_2.2.5 2025-05-07T20:10:51.3297307Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:51.3298061Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:51.3299182Z U torch::Library::_def(std::variant&&, torch::CppFunction&&, std::vector > const&) & 2025-05-07T20:10:51.3300121Z U torch::jit::parseSchemaOrName(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:51.3300715Z U typeinfo for std::invalid_argument@GLIBCXX_3.4 2025-05-07T20:10:51.3301096Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:51.3301481Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:51.3301884Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:51.3302447Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:51.3303063Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:51.3303488Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:51.3303789Z w _ITM_registerTMCloneTable 2025-05-07T20:10:51.3304112Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:51.3304381Z w __gmon_start__ 2025-05-07T20:10:51.3304659Z w __pthread_key_create@GLIBC_2.2.5 2025-05-07T20:10:51.3305005Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:51.3305406Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:10:51.3305694Z 2025-05-07T20:10:51.3323494Z linux-vdso.so.1 (0x00007fff8cdcf000) 2025-05-07T20:10:51.3324393Z libtorch.so => not found 2025-05-07T20:10:51.3325103Z libc10.so => not found 2025-05-07T20:10:51.3325793Z libnvrtc.so.12 => not found 2025-05-07T20:10:51.3326518Z libc10_cuda.so => not found 2025-05-07T20:10:51.3327341Z libnccl.so.2 => not found 2025-05-07T20:10:51.3328041Z libcuda.so.1 => not found 2025-05-07T20:10:51.3329139Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:51.3329900Z libtorch_cpu.so => not found 2025-05-07T20:10:51.3330659Z libtorch_cuda.so => not found 2025-05-07T20:10:51.3331619Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f20eff47000) 2025-05-07T20:10:51.3332846Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f20efef1000) 2025-05-07T20:10:51.3333556Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f20efec1000) 2025-05-07T20:10:51.3334006Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f20efebc000) 2025-05-07T20:10:51.3334426Z libc.so.6 => /lib64/libc.so.6 (0x00007f20efcb4000) 2025-05-07T20:10:51.3334778Z libm.so.6 => /lib64/libm.so.6 (0x00007f20efbd9000) 2025-05-07T20:10:51.3335154Z /lib64/ld-linux-x86-64.so.2 (0x00007f20f01bc000) 2025-05-07T20:10:51.3335398Z 2025-05-07T20:10:51.3335508Z [CHECK] Displaying ELF information: 2025-05-07T20:10:51.3335934Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:10:51.3336257Z 2025-05-07T20:10:51.3358024Z 2025-05-07T20:10:51.3358714Z Dynamic section at offset 0x8c98 contains 38 entries: 2025-05-07T20:10:51.3359812Z Tag Type Name/Value 2025-05-07T20:10:51.3361067Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:51.3362584Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:51.3363840Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:51.3364351Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:51.3364839Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:51.3365333Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:51.3365827Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:51.3366343Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:51.3366847Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:51.3367339Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:51.3367839Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:10:51.3368319Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:51.3368823Z 0x0000000000000001 (NEEDED) Shared library: [libpthread.so.0] 2025-05-07T20:10:51.3369311Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:51.3369811Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_config.so] 2025-05-07T20:10:51.3370238Z 0x000000000000000c (INIT) 0x4000 2025-05-07T20:10:51.3370550Z 0x000000000000000d (FINI) 0x6f80 2025-05-07T20:10:51.3370871Z 0x0000000000000019 (INIT_ARRAY) 0x9bb0 2025-05-07T20:10:51.3371193Z 0x000000000000001b (INIT_ARRAYSZ) 16 (bytes) 2025-05-07T20:10:51.3371534Z 0x000000000000001a (FINI_ARRAY) 0x9bc0 2025-05-07T20:10:51.3371852Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:51.3372188Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:51.3372604Z 0x0000000000000005 (STRTAB) 0xed0 2025-05-07T20:10:51.3372916Z 0x0000000000000006 (SYMTAB) 0x3d8 2025-05-07T20:10:51.3373257Z 0x000000000000000a (STRSZ) 7795 (bytes) 2025-05-07T20:10:51.3373596Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:51.3373931Z 0x0000000000000003 (PLTGOT) 0x9fe8 2025-05-07T20:10:51.3374263Z 0x0000000000000002 (PLTRELSZ) 1632 (bytes) 2025-05-07T20:10:51.3374771Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:51.3375085Z 0x0000000000000017 (JMPREL) 0x33a0 2025-05-07T20:10:51.3375449Z 0x0000000000000007 (RELA) 0x2ef0 2025-05-07T20:10:51.3375785Z 0x0000000000000008 (RELASZ) 1200 (bytes) 2025-05-07T20:10:51.3376145Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:51.3376495Z 0x000000006ffffffe (VERNEED) 0x2e30 2025-05-07T20:10:51.3376819Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:10:51.3377156Z 0x000000006ffffff0 (VERSYM) 0x2d44 2025-05-07T20:10:51.3377474Z 0x000000006ffffff9 (RELACOUNT) 4 2025-05-07T20:10:51.3377815Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:51.3378020Z 2025-05-07T20:10:51.3378132Z ################################################################################ 2025-05-07T20:10:51.3378371Z 2025-05-07T20:10:51.3378375Z 2025-05-07T20:10:51.3378485Z ################################################################################ 2025-05-07T20:10:51.3378918Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so 2025-05-07T20:10:51.3379340Z [CHECK] Listing out library size: 2025-05-07T20:10:51.3379742Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so 2025-05-07T20:10:51.3380049Z 2025-05-07T20:10:51.3380194Z 6 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so 2025-05-07T20:10:51.3380446Z 2025-05-07T20:10:51.3380770Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so 2025-05-07T20:10:51.3381632Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:51.3382185Z 2025-05-07T20:10:51.3651897Z GLIBC_2.2.5 2025-05-07T20:10:51.3652269Z GLIBC_2.3 2025-05-07T20:10:51.3652475Z GLIBC_2.14 2025-05-07T20:10:51.3652603Z 2025-05-07T20:10:51.3652608Z 2025-05-07T20:10:51.3652949Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so 2025-05-07T20:10:51.3653966Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:51.3654585Z 2025-05-07T20:10:51.3920106Z GLIBCXX_3.4 2025-05-07T20:10:51.3920746Z GLIBCXX_3.4.9 2025-05-07T20:10:51.3921476Z GLIBCXX_3.4.11 2025-05-07T20:10:51.3921681Z GLIBCXX_3.4.14 2025-05-07T20:10:51.3921897Z GLIBCXX_3.4.15 2025-05-07T20:10:51.3922108Z GLIBCXX_3.4.18 2025-05-07T20:10:51.3922316Z GLIBCXX_3.4.21 2025-05-07T20:10:51.3922437Z 2025-05-07T20:10:51.3922442Z 2025-05-07T20:10:51.3940680Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so > /tmp/tmp.cktZK2QyNW.symbols.txt 2025-05-07T20:10:51.3941679Z 2025-05-07T20:10:51.4165701Z 2025-05-07T20:10:51.4193697Z [CHECK] Total Number of symbols: 4951 2025-05-07T20:10:51.4212045Z [CHECK] Number of fbgemm symbols: 3554 2025-05-07T20:10:51.4229511Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so > /tmp/tmp.aetBDNxuwz.usymbols.txt 2025-05-07T20:10:51.4229978Z 2025-05-07T20:10:51.4257849Z 2025-05-07T20:10:51.4282140Z [CHECK] Listing out undefined symbols (133 total): 2025-05-07T20:10:51.4299065Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:51.4300088Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:51.4301153Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:51.4302059Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:51.4302770Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:51.4303097Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:51.4303541Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:51.4303841Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:51.4304146Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:51.4304460Z U __cxa_init_primary_exception@CXXABI_1.3.11 2025-05-07T20:10:51.4304792Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:51.4305077Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:10:51.4305379Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:51.4305710Z U __extendhfsf2 2025-05-07T20:10:51.4306009Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:51.4306327Z U __once_proxy@GLIBCXX_3.4.11 2025-05-07T20:10:51.4306616Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:51.4306902Z U __truncsfhf2 2025-05-07T20:10:51.4307144Z U abort@GLIBC_2.2.5 2025-05-07T20:10:51.4307646Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:10:51.4308348Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:10:51.4309252Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:10:51.4310355Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:10:51.4311421Z U asmjit::_abi_1_13::BaseEmitter::emitArgsAssignment(asmjit::_abi_1_13::FuncFrame const&, asmjit::_abi_1_13::FuncArgsAssignment const&) 2025-05-07T20:10:51.4312145Z U asmjit::_abi_1_13::BaseEmitter::emitEpilog(asmjit::_abi_1_13::FuncFrame const&) 2025-05-07T20:10:51.4312747Z U asmjit::_abi_1_13::BaseEmitter::emitProlog(asmjit::_abi_1_13::FuncFrame const&) 2025-05-07T20:10:51.4313319Z U asmjit::_abi_1_13::CodeHolder::CodeHolder(asmjit::_abi_1_13::Support::Temporary const*) 2025-05-07T20:10:51.4313926Z U asmjit::_abi_1_13::CodeHolder::init(asmjit::_abi_1_13::Environment const&, unsigned long) 2025-05-07T20:10:51.4314695Z U asmjit::_abi_1_13::CodeHolder::~CodeHolder() 2025-05-07T20:10:51.4315265Z U asmjit::_abi_1_13::FuncArgsAssignment::updateFuncFrame(asmjit::_abi_1_13::FuncFrame&) const 2025-05-07T20:10:51.4316036Z U asmjit::_abi_1_13::FuncDetail::init(asmjit::_abi_1_13::FuncSignature const&, asmjit::_abi_1_13::Environment const&) 2025-05-07T20:10:51.4316643Z U asmjit::_abi_1_13::FuncFrame::finalize() 2025-05-07T20:10:51.4317081Z U asmjit::_abi_1_13::FuncFrame::init(asmjit::_abi_1_13::FuncDetail const&) 2025-05-07T20:10:51.4317716Z U asmjit::_abi_1_13::JitRuntime::JitRuntime(asmjit::_abi_1_13::JitAllocator::CreateParams const*) 2025-05-07T20:10:51.4318260Z U asmjit::_abi_1_13::JitRuntime::~JitRuntime() 2025-05-07T20:10:51.4318735Z U asmjit::_abi_1_13::x86::Assembler::Assembler(asmjit::_abi_1_13::CodeHolder*) 2025-05-07T20:10:51.4319219Z U asmjit::_abi_1_13::x86::Assembler::~Assembler() 2025-05-07T20:10:51.4319568Z U bcmp@GLIBC_2.2.5 2025-05-07T20:10:51.4319866Z U ceilf@GLIBC_2.2.5 2025-05-07T20:10:51.4320151Z U cpuinfo_get_packages 2025-05-07T20:10:51.4320471Z U cpuinfo_get_packages_count 2025-05-07T20:10:51.4320772Z U cpuinfo_initialize 2025-05-07T20:10:51.4321068Z U cpuinfo_isa 2025-05-07T20:10:51.4321361Z U floor@GLIBC_2.2.5 2025-05-07T20:10:51.4321644Z U fma@GLIBC_2.2.5 2025-05-07T20:10:51.4321909Z U fmaf@GLIBC_2.2.5 2025-05-07T20:10:51.4322194Z U free@GLIBC_2.2.5 2025-05-07T20:10:51.4322486Z U fwrite@GLIBC_2.2.5 2025-05-07T20:10:51.4322764Z U getenv@GLIBC_2.2.5 2025-05-07T20:10:51.4323054Z U ldexp@GLIBC_2.2.5 2025-05-07T20:10:51.4323330Z U log2@GLIBC_2.2.5 2025-05-07T20:10:51.4323618Z U log2f@GLIBC_2.2.5 2025-05-07T20:10:51.4323891Z U lrintf@GLIBC_2.2.5 2025-05-07T20:10:51.4324190Z U memcpy@GLIBC_2.14 2025-05-07T20:10:51.4324492Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:51.4324791Z U memset@GLIBC_2.2.5 2025-05-07T20:10:51.4325087Z U nearbyint@GLIBC_2.2.5 2025-05-07T20:10:51.4325384Z U nearbyintf@GLIBC_2.2.5 2025-05-07T20:10:51.4325713Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:51.4326054Z U operator delete[](void*)@GLIBCXX_3.4 2025-05-07T20:10:51.4326414Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:51.4326898Z U operator new[](unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:51.4327234Z U posix_memalign@GLIBC_2.2.5 2025-05-07T20:10:51.4327513Z U sqrtf@GLIBC_2.2.5 2025-05-07T20:10:51.4327892Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:10:51.4328533Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:10:51.4329212Z U std::_Rb_tree_increment(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:10:51.4329893Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:10:51.4330655Z U std::__atomic_futex_unsigned_base::_M_futex_notify_all(unsigned int*)@GLIBCXX_3.4.21 2025-05-07T20:10:51.4331689Z U std::__atomic_futex_unsigned_base::_M_futex_wait_until(unsigned int*, unsigned int, bool, std::chrono::duration >, std::chrono::duration >)@GLIBCXX_3.4.21 2025-05-07T20:10:51.4332914Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:51.4333661Z U std::__detail::_Prime_rehash_policy::_M_next_bkt(unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:51.4334159Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:10:51.4334572Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:10:51.4335038Z U std::__exception_ptr::exception_ptr::exception_ptr(void*)@CXXABI_1.3.11 2025-05-07T20:10:51.4335561Z U std::__future_base::_Result_base::_Result_base()@GLIBCXX_3.4.15 2025-05-07T20:10:51.4336045Z U std::__future_base::_Result_base::~_Result_base()@GLIBCXX_3.4.15 2025-05-07T20:10:51.4336450Z U std::__once_call@GLIBCXX_3.4.11 2025-05-07T20:10:51.4336794Z U std::__once_callable@GLIBCXX_3.4.11 2025-05-07T20:10:51.4337126Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:51.4337476Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:51.4337801Z U std::__throw_bad_cast()@GLIBCXX_3.4 2025-05-07T20:10:51.4338163Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:10:51.4338550Z U std::__throw_future_error(int)@GLIBCXX_3.4.14 2025-05-07T20:10:51.4338932Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:51.4339323Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:51.4339684Z U std::bad_alloc::~bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:51.4340544Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:51.4341337Z U std::cerr@GLIBCXX_3.4 2025-05-07T20:10:51.4341634Z U std::cout@GLIBCXX_3.4 2025-05-07T20:10:51.4342000Z U std::ctype::_M_widen_init() const@GLIBCXX_3.4.11 2025-05-07T20:10:51.4342383Z U std::future_category()@GLIBCXX_3.4.15 2025-05-07T20:10:51.4342759Z U std::future_error::~future_error()@GLIBCXX_3.4.14 2025-05-07T20:10:51.4343306Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:51.4343654Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:51.4344304Z U std::logic_error::logic_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:10:51.4345019Z U std::logic_error::logic_error(std::logic_error const&)@GLIBCXX_3.4.21 2025-05-07T20:10:51.4345575Z U std::ostream& std::ostream::_M_insert(double)@GLIBCXX_3.4.9 2025-05-07T20:10:51.4346078Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:51.4346606Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:51.4347082Z U std::ostream::flush()@GLIBCXX_3.4 2025-05-07T20:10:51.4347430Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:51.4347798Z U std::ostream::put(char)@GLIBCXX_3.4 2025-05-07T20:10:51.4348260Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:10:51.4348776Z U std::runtime_error::runtime_error(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:51.4349226Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:10:51.4349579Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:51.4349969Z U stderr@GLIBC_2.2.5 2025-05-07T20:10:51.4350260Z U strcmp@GLIBC_2.2.5 2025-05-07T20:10:51.4350554Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:51.4350848Z U strstr@GLIBC_2.2.5 2025-05-07T20:10:51.4351132Z U tolower@GLIBC_2.2.5 2025-05-07T20:10:51.4351439Z U toupper@GLIBC_2.2.5 2025-05-07T20:10:51.4351813Z U typeinfo for std::__future_base::_Result_base@GLIBCXX_3.4.15 2025-05-07T20:10:51.4352244Z U typeinfo for std::bad_alloc@GLIBCXX_3.4 2025-05-07T20:10:51.4352620Z U typeinfo for std::future_error@GLIBCXX_3.4.14 2025-05-07T20:10:51.4353016Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:10:51.4353422Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:51.4353844Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:51.4354334Z U vtable for std::bad_alloc@GLIBCXX_3.4 2025-05-07T20:10:51.4354698Z U vtable for std::future_error@GLIBCXX_3.4.14 2025-05-07T20:10:51.4355076Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:51.4355478Z w _ITM_registerTMCloneTable 2025-05-07T20:10:51.4355811Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:51.4356136Z w __gmon_start__ 2025-05-07T20:10:51.4356415Z w __pthread_key_create 2025-05-07T20:10:51.4356742Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:51.4357079Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:51.4357420Z w pthread_once 2025-05-07T20:10:51.4357698Z w pthread_rwlock_rdlock 2025-05-07T20:10:51.4358015Z w pthread_rwlock_unlock 2025-05-07T20:10:51.4358310Z w pthread_rwlock_wrlock 2025-05-07T20:10:51.4358654Z w pthread_self@GLIBC_2.2.5 2025-05-07T20:10:51.4359010Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:51.4359431Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so 2025-05-07T20:10:51.4359686Z 2025-05-07T20:10:51.4359833Z linux-vdso.so.1 (0x00007ffff3721000) 2025-05-07T20:10:51.4360129Z libc10.so => not found 2025-05-07T20:10:51.4360385Z libnvrtc.so.12 => not found 2025-05-07T20:10:51.4360647Z libc10_cuda.so => not found 2025-05-07T20:10:51.4360919Z libnccl.so.2 => not found 2025-05-07T20:10:51.4361165Z libcuda.so.1 => not found 2025-05-07T20:10:51.4361725Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so (0x00007f51c0f89000) 2025-05-07T20:10:51.4362312Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:51.4362580Z libtorch.so => not found 2025-05-07T20:10:51.4362846Z libtorch_cpu.so => not found 2025-05-07T20:10:51.4363110Z libtorch_cuda.so => not found 2025-05-07T20:10:51.4363453Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f51c0d25000) 2025-05-07T20:10:51.4363840Z libm.so.6 => /lib64/libm.so.6 (0x00007f51c0c4a000) 2025-05-07T20:10:51.4364259Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f51c1575000) 2025-05-07T20:10:51.4364635Z libc.so.6 => /lib64/libc.so.6 (0x00007f51c0a42000) 2025-05-07T20:10:51.4365000Z /lib64/ld-linux-x86-64.so.2 (0x00007f51c15ab000) 2025-05-07T20:10:51.4365339Z libtorch.so => not found 2025-05-07T20:10:51.4365587Z libc10.so => not found 2025-05-07T20:10:51.4365841Z libnvrtc.so.12 => not found 2025-05-07T20:10:51.4366100Z libc10_cuda.so => not found 2025-05-07T20:10:51.4366371Z libnccl.so.2 => not found 2025-05-07T20:10:51.4366731Z libcuda.so.1 => not found 2025-05-07T20:10:51.4366980Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:51.4367234Z libtorch_cpu.so => not found 2025-05-07T20:10:51.4367492Z libtorch_cuda.so => not found 2025-05-07T20:10:51.4367789Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f51c09ec000) 2025-05-07T20:10:51.4368168Z librt.so.1 => /lib64/librt.so.1 (0x00007f51c156c000) 2025-05-07T20:10:51.4368558Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f51c1567000) 2025-05-07T20:10:51.4368851Z 2025-05-07T20:10:51.4368954Z [CHECK] Displaying ELF information: 2025-05-07T20:10:51.4369313Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so 2025-05-07T20:10:51.4369572Z 2025-05-07T20:10:51.4383280Z 2025-05-07T20:10:51.4384067Z Dynamic section at offset 0x54d6c8 contains 40 entries: 2025-05-07T20:10:51.4385218Z Tag Type Name/Value 2025-05-07T20:10:51.4386441Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:51.4387925Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:51.4389429Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:51.4390895Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:51.4392431Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:51.4393223Z 0x0000000000000001 (NEEDED) Shared library: [asmjit.so] 2025-05-07T20:10:51.4393738Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:51.4394385Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:51.4394890Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:51.4395425Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:51.4395953Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:51.4396454Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:10:51.4396958Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:51.4397447Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:51.4397968Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:51.4398598Z 0x000000000000000e (SONAME) Library soname: [fbgemm.so] 2025-05-07T20:10:51.4399101Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:51.4399517Z 0x000000000000000c (INIT) 0xff000 2025-05-07T20:10:51.4399852Z 0x000000000000000d (FINI) 0x4c1c58 2025-05-07T20:10:51.4400197Z 0x0000000000000019 (INIT_ARRAY) 0x54a1c0 2025-05-07T20:10:51.4400548Z 0x000000000000001b (INIT_ARRAYSZ) 1224 (bytes) 2025-05-07T20:10:51.4400916Z 0x000000000000001a (FINI_ARRAY) 0x54a688 2025-05-07T20:10:51.4401283Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:51.4401634Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:10:51.4401959Z 0x0000000000000005 (STRTAB) 0x26de0 2025-05-07T20:10:51.4402292Z 0x0000000000000006 (SYMTAB) 0x9da0 2025-05-07T20:10:51.4402647Z 0x000000000000000a (STRSZ) 754246 (bytes) 2025-05-07T20:10:51.4403007Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:51.4403364Z 0x0000000000000003 (PLTGOT) 0x551fe8 2025-05-07T20:10:51.4403739Z 0x0000000000000002 (PLTRELSZ) 25992 (bytes) 2025-05-07T20:10:51.4404100Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:51.4404419Z 0x0000000000000017 (JMPREL) 0xf8458 2025-05-07T20:10:51.4404758Z 0x0000000000000007 (RELA) 0xe1838 2025-05-07T20:10:51.4405115Z 0x0000000000000008 (RELASZ) 93216 (bytes) 2025-05-07T20:10:51.4405471Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:51.4405832Z 0x000000006ffffffe (VERNEED) 0xe16d8 2025-05-07T20:10:51.4406160Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:51.4406499Z 0x000000006ffffff0 (VERSYM) 0xdf026 2025-05-07T20:10:51.4406821Z 0x000000006ffffff9 (RELACOUNT) 155 2025-05-07T20:10:51.4407135Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:51.4407338Z 2025-05-07T20:10:51.4407451Z ################################################################################ 2025-05-07T20:10:51.4407721Z 2025-05-07T20:10:51.4407728Z 2025-05-07T20:10:51.4407949Z ################################################################################ 2025-05-07T20:10:51.4408435Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:10:51.4408900Z [CHECK] Listing out library size: 2025-05-07T20:10:51.4409379Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:10:51.4409737Z 2025-05-07T20:10:51.4409961Z 3 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:10:51.4410255Z 2025-05-07T20:10:51.4410626Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:10:51.4411576Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:51.4412141Z 2025-05-07T20:10:51.4458405Z GLIBC_2.2.5 2025-05-07T20:10:51.4459060Z GLIBC_2.14 2025-05-07T20:10:51.4459420Z 2025-05-07T20:10:51.4459433Z 2025-05-07T20:10:51.4460028Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:10:51.4461137Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:51.4461747Z 2025-05-07T20:10:51.4517370Z GLIBCXX_3.4 2025-05-07T20:10:51.4518022Z GLIBCXX_3.4.9 2025-05-07T20:10:51.4518595Z GLIBCXX_3.4.14 2025-05-07T20:10:51.4519189Z GLIBCXX_3.4.20 2025-05-07T20:10:51.4519736Z GLIBCXX_3.4.21 2025-05-07T20:10:51.4520095Z 2025-05-07T20:10:51.4520108Z 2025-05-07T20:10:51.4539774Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so > /tmp/tmp.qyqi0wPmMy.symbols.txt 2025-05-07T20:10:51.4541513Z 2025-05-07T20:10:51.4570577Z 2025-05-07T20:10:51.4596514Z [CHECK] Total Number of symbols: 550 2025-05-07T20:10:51.4617810Z [CHECK] Number of fbgemm symbols: 48 2025-05-07T20:10:51.4634230Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so > /tmp/tmp.hir3VPj2ps.usymbols.txt 2025-05-07T20:10:51.4634713Z 2025-05-07T20:10:51.4659792Z 2025-05-07T20:10:51.4690436Z [CHECK] Listing out undefined symbols (179 total): 2025-05-07T20:10:51.4715488Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:51.4717600Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:51.4718698Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:51.4719899Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:51.4721019Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:51.4722146Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:51.4722702Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:51.4723178Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:51.4723545Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:51.4723936Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:51.4724294Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:51.4724610Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:51.4724951Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:51.4725266Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:51.4725623Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:51.4725940Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:51.4726285Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:51.4726604Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:51.4726944Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:51.4727290Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:51.4727862Z U at::Tensor::index_put_(std::initializer_list, at::Tensor const&) 2025-05-07T20:10:51.4728619Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:10:51.4729382Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:10:51.4730352Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:51.4731381Z U at::_ops::is_nonzero::call(at::Tensor const&) 2025-05-07T20:10:51.4731835Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:51.4732361Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:10:51.4733059Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:10:51.4734188Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:10:51.4735185Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:10:51.4736102Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:51.4736876Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:10:51.4737405Z U at::get_num_threads() 2025-05-07T20:10:51.4737735Z U at::get_thread_num() 2025-05-07T20:10:51.4738131Z U at::internal::set_thread_num(int) 2025-05-07T20:10:51.4738532Z U at::toAccumulateType(c10::ScalarType, bool) 2025-05-07T20:10:51.4738975Z U c10::BoolType::get() 2025-05-07T20:10:51.4739361Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:51.4740003Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:51.4740612Z U c10::Error::what() const 2025-05-07T20:10:51.4740997Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:10:51.4741470Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:51.4741925Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:51.4742282Z U c10::IntType::get() 2025-05-07T20:10:51.4742673Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:51.4743100Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:51.4743593Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:51.4744089Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:10:51.4744451Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:10:51.4744855Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:10:51.4745256Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:51.4745935Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:51.4746612Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:51.4747165Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:10:51.4747571Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:51.4748041Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:10:51.4748468Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:51.4748808Z U c10::SymIntType::get() 2025-05-07T20:10:51.4749171Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:51.4749561Z U c10::TensorType::get() 2025-05-07T20:10:51.4749894Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:51.4750850Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:51.4751826Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:51.4752244Z U c10::cuda::CUDAKernelLaunchRegistry::get_singleton_ref() 2025-05-07T20:10:51.4752821Z U c10::cuda::CUDAKernelLaunchRegistry::get_uvm_assertions_ptr_for_current_device() 2025-05-07T20:10:51.4753537Z U c10::cuda::CUDAKernelLaunchRegistry::insert(char const*, char const*, unsigned int, char const*, int) 2025-05-07T20:10:51.4754199Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:51.4754757Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:51.4755118Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:51.4755580Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:51.4755939Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:51.4756453Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:51.4756967Z U c10::cuda::device_count() 2025-05-07T20:10:51.4757323Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:51.4757798Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:51.4758199Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:51.4758630Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:51.4759051Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:51.4759476Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:51.4760356Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:51.4761211Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:51.4762053Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:51.4762968Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:51.4764146Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:51.4764980Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:51.4765371Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:51.4765724Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:51.4766106Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:10:51.4766473Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:10:51.4766878Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:51.4767282Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:51.4767704Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:10:51.4768116Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:10:51.4768679Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:10:51.4769071Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:51.4769499Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:51.4769980Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:51.4770400Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:51.4770794Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:51.4771197Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:51.4771558Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:51.4771949Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:51.4772323Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:51.4772723Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:51.4773090Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:51.4773464Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:51.4773835Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:51.4774207Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:51.4774605Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:51.4775787Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:51.4777511Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:51.4779085Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:51.4780701Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:51.4782376Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:51.4784036Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:51.4785750Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:51.4787461Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:51.4789173Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:51.4790864Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:51.4792584Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:51.4794396Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:51.4795765Z U fbgemm::fbgemmAlignedAlloc(unsigned long, unsigned long, bool) 2025-05-07T20:10:51.4796219Z U fbgemm::fbgemmAlignedFree(void*) 2025-05-07T20:10:51.4796742Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:10:51.4797269Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:10:51.4797725Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:51.4798169Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:51.4798567Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:51.4799036Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:51.4799472Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:51.4799901Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:51.4800298Z U memcpy@GLIBC_2.14 2025-05-07T20:10:51.4800601Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:51.4801028Z U memset@GLIBC_2.2.5 2025-05-07T20:10:51.4801317Z U omp_get_max_threads@OMP_1.0 2025-05-07T20:10:51.4801648Z U omp_get_thread_num@OMP_1.0 2025-05-07T20:10:51.4801995Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:51.4802362Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:51.4802935Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:51.4803721Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:51.4804338Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:51.4804867Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:10:51.4806734Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:51.4807181Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:51.4807619Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:51.4808198Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:51.4809244Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:51.4810100Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:51.4810500Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:51.4810870Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:51.4811259Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:51.4811614Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:51.4812102Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:51.4812794Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:51.4813381Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:51.4813928Z U std::pair fbgemm::radix_sort_parallel(int*, int*, int*, int*, long, long, bool) 2025-05-07T20:10:51.4814834Z U std::pair*> fbgemm::radix_sort_parallel >(int*, std::pair*, int*, std::pair*, long, long, bool) 2025-05-07T20:10:51.4815907Z U std::pair*> fbgemm::radix_sort_parallel >(int*, std::pair*, int*, std::pair*, long, long, bool) 2025-05-07T20:10:51.4816612Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:51.4816915Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:51.4817420Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:51.4818319Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:51.4819649Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:51.4820520Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:51.4821293Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:51.4821882Z U typeinfo for c10::Error 2025-05-07T20:10:51.4822257Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:10:51.4822697Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:10:51.4823187Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:51.4823638Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:51.4824116Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:51.4824525Z U vtable for c10::Error 2025-05-07T20:10:51.4825072Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:51.4825775Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:51.4826286Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:51.4826629Z w _ITM_registerTMCloneTable 2025-05-07T20:10:51.4826988Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:51.4827415Z w __gmon_start__ 2025-05-07T20:10:51.4827729Z w __pthread_key_create 2025-05-07T20:10:51.4828086Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:51.4828740Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:10:51.4829159Z 2025-05-07T20:10:51.4829270Z linux-vdso.so.1 (0x00007ffc5326c000) 2025-05-07T20:10:51.4829640Z libc10.so => not found 2025-05-07T20:10:51.4829995Z libnvrtc.so.12 => not found 2025-05-07T20:10:51.4830275Z libc10_cuda.so => not found 2025-05-07T20:10:51.4830586Z libnccl.so.2 => not found 2025-05-07T20:10:51.4830866Z libcuda.so.1 => not found 2025-05-07T20:10:51.4831442Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so (0x00007f57bb600000) 2025-05-07T20:10:51.4832378Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so (0x00007f57bbbf1000) 2025-05-07T20:10:51.4833077Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:51.4833399Z libtorch.so => not found 2025-05-07T20:10:51.4833674Z libtorch_cpu.so => not found 2025-05-07T20:10:51.4833994Z libtorch_cuda.so => not found 2025-05-07T20:10:51.4834361Z libcudart.so.12 => not found 2025-05-07T20:10:51.4834744Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f57bb39c000) 2025-05-07T20:10:51.4835187Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f57bbb99000) 2025-05-07T20:10:51.4835691Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f57bbb6b000) 2025-05-07T20:10:51.4836091Z libc.so.6 => /lib64/libc.so.6 (0x00007f57bb194000) 2025-05-07T20:10:51.4836462Z libc10.so => not found 2025-05-07T20:10:51.4836724Z libnvrtc.so.12 => not found 2025-05-07T20:10:51.4837016Z libc10_cuda.so => not found 2025-05-07T20:10:51.4837309Z libnccl.so.2 => not found 2025-05-07T20:10:51.4837573Z libcuda.so.1 => not found 2025-05-07T20:10:51.4838138Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so (0x00007f57bb11d000) 2025-05-07T20:10:51.4838726Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:51.4839037Z libtorch.so => not found 2025-05-07T20:10:51.4839299Z libtorch_cpu.so => not found 2025-05-07T20:10:51.4839600Z libtorch_cuda.so => not found 2025-05-07T20:10:51.4839905Z libm.so.6 => /lib64/libm.so.6 (0x00007f57bb042000) 2025-05-07T20:10:51.4840308Z /lib64/ld-linux-x86-64.so.2 (0x00007f57bbf26000) 2025-05-07T20:10:51.4840674Z libtorch.so => not found 2025-05-07T20:10:51.4840932Z libc10.so => not found 2025-05-07T20:10:51.4841213Z libnvrtc.so.12 => not found 2025-05-07T20:10:51.4841492Z libc10_cuda.so => not found 2025-05-07T20:10:51.4841785Z libnccl.so.2 => not found 2025-05-07T20:10:51.4842047Z libcuda.so.1 => not found 2025-05-07T20:10:51.4842340Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:51.4842628Z libtorch_cpu.so => not found 2025-05-07T20:10:51.4842932Z libtorch_cuda.so => not found 2025-05-07T20:10:51.4843290Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f57bb03d000) 2025-05-07T20:10:51.4843715Z libtorch.so => not found 2025-05-07T20:10:51.4844001Z libc10.so => not found 2025-05-07T20:10:51.4844255Z libnvrtc.so.12 => not found 2025-05-07T20:10:51.4844553Z libc10_cuda.so => not found 2025-05-07T20:10:51.4844824Z libnccl.so.2 => not found 2025-05-07T20:10:51.4845160Z libcuda.so.1 => not found 2025-05-07T20:10:51.4845436Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:51.4845749Z libtorch_cpu.so => not found 2025-05-07T20:10:51.4846147Z libtorch_cuda.so => not found 2025-05-07T20:10:51.4846501Z librt.so.1 => /lib64/librt.so.1 (0x00007f57bb038000) 2025-05-07T20:10:51.4846779Z 2025-05-07T20:10:51.4846916Z [CHECK] Displaying ELF information: 2025-05-07T20:10:51.4847347Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:10:51.4847711Z 2025-05-07T20:10:51.4847716Z 2025-05-07T20:10:51.4847882Z Dynamic section at offset 0x2b5a90 contains 41 entries: 2025-05-07T20:10:51.4848303Z Tag Type Name/Value 2025-05-07T20:10:51.4848737Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:51.4849262Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:51.4849770Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:51.4850306Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:51.4850837Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:51.4851360Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:10:51.4851864Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:10:51.4852425Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:51.4852963Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:51.4853467Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:51.4854012Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:51.4854529Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:51.4855075Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:51.4855586Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:10:51.4856309Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:51.4856861Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:51.4857392Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:10:51.4857948Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:51.4858360Z 0x000000000000000c (INIT) 0x16000 2025-05-07T20:10:51.4858722Z 0x000000000000000d (FINI) 0x6243c 2025-05-07T20:10:51.4859071Z 0x0000000000000019 (INIT_ARRAY) 0x2b5a40 2025-05-07T20:10:51.4859448Z 0x000000000000001b (INIT_ARRAYSZ) 72 (bytes) 2025-05-07T20:10:51.4859830Z 0x000000000000001a (FINI_ARRAY) 0x2b5a88 2025-05-07T20:10:51.4860189Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:51.4860572Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:51.4860909Z 0x0000000000000005 (STRTAB) 0x40a0 2025-05-07T20:10:51.4861279Z 0x0000000000000006 (SYMTAB) 0xcf8 2025-05-07T20:10:51.4861640Z 0x000000000000000a (STRSZ) 48233 (bytes) 2025-05-07T20:10:51.4862035Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:51.4862428Z 0x0000000000000003 (PLTGOT) 0x2b6fe8 2025-05-07T20:10:51.4862798Z 0x0000000000000002 (PLTRELSZ) 9240 (bytes) 2025-05-07T20:10:51.4863197Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:51.4863534Z 0x0000000000000017 (JMPREL) 0x13a68 2025-05-07T20:10:51.4863896Z 0x0000000000000007 (RELA) 0x10258 2025-05-07T20:10:51.4864257Z 0x0000000000000008 (RELASZ) 14352 (bytes) 2025-05-07T20:10:51.4864631Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:51.4864967Z 0x000000006ffffffe (VERNEED) 0x10158 2025-05-07T20:10:51.4865466Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:51.4865791Z 0x000000006ffffff0 (VERSYM) 0xfd0a 2025-05-07T20:10:51.4866108Z 0x000000006ffffff9 (RELACOUNT) 337 2025-05-07T20:10:51.4866422Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:51.4866617Z 2025-05-07T20:10:51.4866728Z ################################################################################ 2025-05-07T20:10:51.4866966Z 2025-05-07T20:10:51.4866970Z 2025-05-07T20:10:51.4867080Z ################################################################################ 2025-05-07T20:10:51.4867565Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:10:51.4868082Z [CHECK] Listing out library size: 2025-05-07T20:10:51.4868640Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:10:51.4868989Z 2025-05-07T20:10:51.4869180Z 9 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:10:51.4869485Z 2025-05-07T20:10:51.4869855Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:10:51.4870819Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:51.4886458Z 2025-05-07T20:10:51.4898356Z GLIBC_2.2.5 2025-05-07T20:10:51.4898971Z GLIBC_2.3 2025-05-07T20:10:51.4899558Z GLIBC_2.14 2025-05-07T20:10:51.4899885Z 2025-05-07T20:10:51.4899898Z 2025-05-07T20:10:51.4901168Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:10:51.4904287Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:51.4906154Z 2025-05-07T20:10:51.4960749Z GLIBCXX_3.4 2025-05-07T20:10:51.4961391Z GLIBCXX_3.4.9 2025-05-07T20:10:51.4962002Z GLIBCXX_3.4.11 2025-05-07T20:10:51.4962597Z GLIBCXX_3.4.18 2025-05-07T20:10:51.4963193Z GLIBCXX_3.4.21 2025-05-07T20:10:51.4963835Z 2025-05-07T20:10:51.4963858Z 2025-05-07T20:10:51.4985201Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so > /tmp/tmp.41nzzCOJaO.symbols.txt 2025-05-07T20:10:51.4985687Z 2025-05-07T20:10:51.5018620Z 2025-05-07T20:10:51.5051509Z [CHECK] Total Number of symbols: 347 2025-05-07T20:10:51.5073519Z [CHECK] Number of fbgemm symbols: 16 2025-05-07T20:10:51.5093293Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so > /tmp/tmp.Q5OmcBMQf6.usymbols.txt 2025-05-07T20:10:51.5094675Z 2025-05-07T20:10:51.5111799Z 2025-05-07T20:10:51.5147377Z [CHECK] Listing out undefined symbols (124 total): 2025-05-07T20:10:51.5159815Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:51.5160690Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:51.5161251Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:51.5161611Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:51.5162024Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:51.5162407Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:51.5162799Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:51.5163190Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:51.5163545Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:51.5163920Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:51.5164266Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:51.5164584Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:51.5164891Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:51.5167261Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:51.5167569Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:10:51.5167925Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:51.5168264Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:51.5168592Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:10:51.5169001Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:10:51.5169468Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:10:51.5170023Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:10:51.5170413Z U c10::BoolType::get() 2025-05-07T20:10:51.5170776Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:51.5171160Z U c10::FloatType::get() 2025-05-07T20:10:51.5171476Z U c10::GeneratorImpl::device() const 2025-05-07T20:10:51.5171890Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:51.5172391Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:51.5172756Z U c10::IntType::get() 2025-05-07T20:10:51.5173138Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:51.5173536Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:51.5173918Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:51.5174316Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:51.5175196Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:51.5175810Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:51.5176148Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:51.5176466Z U c10::TensorType::get() 2025-05-07T20:10:51.5176767Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:51.5177715Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:51.5178712Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:51.5179044Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:51.5179375Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:51.5179702Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:51.5180011Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:51.5180331Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:51.5180762Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:51.5181203Z U c10::cuda::device_count() 2025-05-07T20:10:51.5181511Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:51.5181873Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:51.5182236Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:51.5182589Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:51.5182967Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:51.5183311Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:51.5184007Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:51.5184835Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:51.5185661Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:51.5186542Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:51.5187506Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:51.5188250Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:51.5188604Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:51.5188913Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:51.5189264Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:51.5189630Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:51.5189949Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:51.5190330Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:51.5190759Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:51.5191109Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:51.5191435Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:51.5191764Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:51.5192096Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:51.5192401Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:51.5192732Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:51.5193074Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:10:51.5193429Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:51.5193738Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:51.5194065Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:51.5194470Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:51.5195010Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:51.5195445Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:51.5195787Z U float at::Tensor::item() const 2025-05-07T20:10:51.5196175Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:51.5196580Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:51.5196990Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:51.5197354Z U memcpy@GLIBC_2.14 2025-05-07T20:10:51.5197635Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:51.5197933Z U memset@GLIBC_2.2.5 2025-05-07T20:10:51.5198229Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:51.5198582Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:51.5199153Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:51.5199998Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:51.5200936Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:51.5201681Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:51.5202236Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:51.5202832Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:51.5203210Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:51.5203929Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:51.5204301Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:51.5204854Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:51.5205787Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:51.5206580Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:51.5206942Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:51.5207340Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:51.5207693Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:51.5208089Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:51.5208625Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:51.5209094Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:51.5209478Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:51.5209796Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:51.5210102Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:51.5210924Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:51.5212073Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:51.5212903Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:51.5213642Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:51.5214291Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:51.5214719Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:51.5215149Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:51.5215743Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:51.5216413Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:51.5216867Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:51.5217200Z w _ITM_registerTMCloneTable 2025-05-07T20:10:51.5217510Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:51.5217819Z w __gmon_start__ 2025-05-07T20:10:51.5218105Z w __pthread_key_create 2025-05-07T20:10:51.5218517Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:51.5218958Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:51.5219314Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:51.5219759Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:10:51.5220066Z 2025-05-07T20:10:51.5220187Z linux-vdso.so.1 (0x00007ffc2c953000) 2025-05-07T20:10:51.5220462Z libtorch.so => not found 2025-05-07T20:10:51.5220709Z libc10.so => not found 2025-05-07T20:10:51.5220937Z libnvrtc.so.12 => not found 2025-05-07T20:10:51.5221194Z libc10_cuda.so => not found 2025-05-07T20:10:51.5221433Z libnccl.so.2 => not found 2025-05-07T20:10:51.5221676Z libcuda.so.1 => not found 2025-05-07T20:10:51.5221912Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:51.5222175Z libtorch_cpu.so => not found 2025-05-07T20:10:51.5222416Z libtorch_cuda.so => not found 2025-05-07T20:10:51.5222696Z libcudart.so.12 => not found 2025-05-07T20:10:51.5223008Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f753019c000) 2025-05-07T20:10:51.5223396Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f7530e2b000) 2025-05-07T20:10:51.5223771Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f7530dfd000) 2025-05-07T20:10:51.5224114Z libc.so.6 => /lib64/libc.so.6 (0x00007f752ff94000) 2025-05-07T20:10:51.5224450Z /lib64/ld-linux-x86-64.so.2 (0x00007f7530e89000) 2025-05-07T20:10:51.5224773Z libm.so.6 => /lib64/libm.so.6 (0x00007f7530d22000) 2025-05-07T20:10:51.5224994Z 2025-05-07T20:10:51.5225096Z [CHECK] Displaying ELF information: 2025-05-07T20:10:51.5225532Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:10:51.5225858Z 2025-05-07T20:10:51.5238744Z 2025-05-07T20:10:51.5240032Z Dynamic section at offset 0x8a7a10 contains 39 entries: 2025-05-07T20:10:51.5241168Z Tag Type Name/Value 2025-05-07T20:10:51.5242328Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:51.5242847Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:51.5243484Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:51.5244005Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:51.5244502Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:51.5245006Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:51.5245524Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:51.5246037Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:51.5246585Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:51.5247127Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:51.5247640Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:51.5248172Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:10:51.5248732Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:51.5249254Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:51.5249771Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:51.5250355Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_optimizers.so] 2025-05-07T20:10:51.5250843Z 0x000000000000000c (INIT) 0x10000 2025-05-07T20:10:51.5251186Z 0x000000000000000d (FINI) 0x333cc 2025-05-07T20:10:51.5251540Z 0x0000000000000019 (INIT_ARRAY) 0x8a71f8 2025-05-07T20:10:51.5251887Z 0x000000000000001b (INIT_ARRAYSZ) 48 (bytes) 2025-05-07T20:10:51.5252260Z 0x000000000000001a (FINI_ARRAY) 0x8a7228 2025-05-07T20:10:51.5252612Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:51.5252978Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:10:51.5253327Z 0x0000000000000005 (STRTAB) 0x2a78 2025-05-07T20:10:51.5253650Z 0x0000000000000006 (SYMTAB) 0x9d8 2025-05-07T20:10:51.5254015Z 0x000000000000000a (STRSZ) 38407 (bytes) 2025-05-07T20:10:51.5254373Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:51.5254739Z 0x0000000000000003 (PLTGOT) 0x8a7fe8 2025-05-07T20:10:51.5255100Z 0x0000000000000002 (PLTRELSZ) 4728 (bytes) 2025-05-07T20:10:51.5255461Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:51.5255791Z 0x0000000000000017 (JMPREL) 0xe230 2025-05-07T20:10:51.5256138Z 0x0000000000000007 (RELA) 0xc448 2025-05-07T20:10:51.5256492Z 0x0000000000000008 (RELASZ) 7656 (bytes) 2025-05-07T20:10:51.5256839Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:51.5257249Z 0x000000006ffffffe (VERNEED) 0xc338 2025-05-07T20:10:51.5257572Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:51.5257910Z 0x000000006ffffff0 (VERSYM) 0xc080 2025-05-07T20:10:51.5258239Z 0x000000006ffffff9 (RELACOUNT) 136 2025-05-07T20:10:51.5258560Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:51.5258780Z 2025-05-07T20:10:51.5258909Z ################################################################################ 2025-05-07T20:10:51.5259136Z 2025-05-07T20:10:51.5259140Z 2025-05-07T20:10:51.5259255Z ################################################################################ 2025-05-07T20:10:51.5259800Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:10:51.5260276Z [CHECK] Listing out library size: 2025-05-07T20:10:51.5260737Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:10:51.5261099Z 2025-05-07T20:10:51.5261309Z 21 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:10:51.5261613Z 2025-05-07T20:10:51.5262021Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:10:51.5263002Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:51.5263579Z 2025-05-07T20:10:51.5331132Z GLIBC_2.2.5 2025-05-07T20:10:51.5331747Z GLIBC_2.14 2025-05-07T20:10:51.5332098Z 2025-05-07T20:10:51.5332112Z 2025-05-07T20:10:51.5333336Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:10:51.5336278Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:51.5338065Z 2025-05-07T20:10:51.5412402Z GLIBCXX_3.4 2025-05-07T20:10:51.5413064Z GLIBCXX_3.4.9 2025-05-07T20:10:51.5413637Z GLIBCXX_3.4.11 2025-05-07T20:10:51.5414238Z GLIBCXX_3.4.20 2025-05-07T20:10:51.5414794Z GLIBCXX_3.4.21 2025-05-07T20:10:51.5415413Z 2025-05-07T20:10:51.5415448Z 2025-05-07T20:10:51.5431076Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so > /tmp/tmp.GkrenAueYR.symbols.txt 2025-05-07T20:10:51.5431557Z 2025-05-07T20:10:51.5476248Z 2025-05-07T20:10:51.5502288Z [CHECK] Total Number of symbols: 783 2025-05-07T20:10:51.5515736Z [CHECK] Number of fbgemm symbols: 73 2025-05-07T20:10:51.5531998Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so > /tmp/tmp.9YkvAHoZVm.usymbols.txt 2025-05-07T20:10:51.5532031Z 2025-05-07T20:10:51.5551311Z 2025-05-07T20:10:51.5577172Z [CHECK] Listing out undefined symbols (147 total): 2025-05-07T20:10:51.5593524Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:51.5593655Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:51.5593879Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:51.5594048Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:51.5594298Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:51.5594443Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:51.5594595Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:51.5594719Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:51.5594856Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:51.5594962Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:51.5595091Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:51.5595194Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:51.5595298Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:51.5595420Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:51.5595679Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:51.5595780Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:51.5595898Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:51.5596048Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:10:51.5596229Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:10:51.5596695Z U at::_ops::arange::call(c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:51.5597338Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:51.5597989Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:51.5598176Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:10:51.5598853Z U at::_ops::full_like::call(at::Tensor const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:51.5599065Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:10:51.5599443Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:10:51.5599922Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:51.5600516Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:51.5600726Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:10:51.5600975Z U at::cuda::getDeviceProperties(signed char) 2025-05-07T20:10:51.5601091Z U c10::BoolType::get() 2025-05-07T20:10:51.5601242Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:51.5601355Z U c10::GeneratorImpl::device() const 2025-05-07T20:10:51.5601534Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:51.5601670Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:51.5601765Z U c10::IntType::get() 2025-05-07T20:10:51.5601993Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:51.5602141Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:51.5602273Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:51.5602669Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:51.5602801Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:51.5602909Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:51.5603072Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:51.5603168Z U c10::TensorType::get() 2025-05-07T20:10:51.5603286Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:51.5603963Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:51.5604119Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:51.5604234Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:51.5604367Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:51.5604473Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:51.5604585Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:51.5604692Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:51.5604934Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:51.5605059Z U c10::cuda::current_device() 2025-05-07T20:10:51.5605160Z U c10::cuda::device_count() 2025-05-07T20:10:51.5605304Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:51.5605431Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:51.5605566Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:51.5605710Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:51.5605885Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:51.5605993Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:51.5606483Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:51.5606719Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:51.5607178Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:51.5607512Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:51.5608055Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:51.5608241Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:51.5608348Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:51.5608491Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:10:51.5608654Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:10:51.5608781Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:51.5608919Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:51.5609053Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:51.5609177Z U c10::throwNullDataPtrError() 2025-05-07T20:10:51.5609277Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:51.5609387Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:10:51.5609586Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:51.5609699Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:51.5609827Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:10:51.5609974Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:51.5610102Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:51.5610213Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:51.5610338Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:51.5610468Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:51.5610579Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:51.5610697Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:51.5610839Z U cudaFuncGetAttributes@libcudart.so.12 2025-05-07T20:10:51.5610976Z U cudaGetDevice@libcudart.so.12 2025-05-07T20:10:51.5611093Z U cudaGetDeviceCount@libcudart.so.12 2025-05-07T20:10:51.5611238Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:51.5611352Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:51.5611462Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:51.5611578Z U cudaMemcpyAsync@libcudart.so.12 2025-05-07T20:10:51.5611717Z U cudaMemsetAsync@libcudart.so.12 2025-05-07T20:10:51.5611998Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.12 2025-05-07T20:10:51.5612148Z U cudaPeekAtLastError@libcudart.so.12 2025-05-07T20:10:51.5612279Z U cudaSetDevice@libcudart.so.12 2025-05-07T20:10:51.5612392Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:51.5612515Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:51.5612659Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:51.5612843Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:51.5612966Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:51.5613125Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:51.5613223Z U log2f@GLIBC_2.2.5 2025-05-07T20:10:51.5613395Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:51.5613530Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:51.5613711Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:51.5613811Z U memcpy@GLIBC_2.14 2025-05-07T20:10:51.5613914Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:51.5614024Z U memset@GLIBC_2.2.5 2025-05-07T20:10:51.5614132Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:51.5614250Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:51.5614625Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:51.5615003Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:51.5615124Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:51.5615293Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:51.5615435Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:51.5615606Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:51.5615768Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:51.5616004Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:51.5616550Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:51.5616706Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:51.5616832Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:51.5616954Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:51.5617095Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:51.5617277Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:51.5617509Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:51.5617644Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:51.5617745Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:51.5617902Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:51.5618478Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:51.5618913Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:51.5619163Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:51.5619546Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:51.5619757Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:51.5619906Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:51.5620072Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:51.5620245Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:51.5620552Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:51.5620779Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:51.5620889Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:51.5620993Z w _ITM_registerTMCloneTable 2025-05-07T20:10:51.5621104Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:51.5621193Z w __gmon_start__ 2025-05-07T20:10:51.5621286Z w __pthread_key_create 2025-05-07T20:10:51.5621390Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:51.5621510Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:51.5621649Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:51.5621839Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:10:51.5621870Z 2025-05-07T20:10:51.5633250Z linux-vdso.so.1 (0x00007ffdbf3a5000) 2025-05-07T20:10:51.5633370Z libtorch.so => not found 2025-05-07T20:10:51.5633460Z libc10.so => not found 2025-05-07T20:10:51.5633570Z libnvrtc.so.12 => not found 2025-05-07T20:10:51.5633661Z libc10_cuda.so => not found 2025-05-07T20:10:51.5633752Z libnccl.so.2 => not found 2025-05-07T20:10:51.5633852Z libcuda.so.1 => not found 2025-05-07T20:10:51.5633952Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:51.5634051Z libtorch_cpu.so => not found 2025-05-07T20:10:51.5634223Z libtorch_cuda.so => not found 2025-05-07T20:10:51.5634333Z libcudart.so.12 => not found 2025-05-07T20:10:51.5634494Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f572879c000) 2025-05-07T20:10:51.5634618Z libm.so.6 => /lib64/libm.so.6 (0x00007f572a169000) 2025-05-07T20:10:51.5634783Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f572a113000) 2025-05-07T20:10:51.5634935Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f572a0e5000) 2025-05-07T20:10:51.5635058Z libc.so.6 => /lib64/libc.so.6 (0x00007f5728594000) 2025-05-07T20:10:51.5635185Z /lib64/ld-linux-x86-64.so.2 (0x00007f572a24c000) 2025-05-07T20:10:51.5635913Z 2025-05-07T20:10:51.5636053Z [CHECK] Displaying ELF information: 2025-05-07T20:10:51.5636553Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:10:51.5636566Z 2025-05-07T20:10:51.5668002Z 2025-05-07T20:10:51.5668689Z Dynamic section at offset 0x14b76f0 contains 39 entries: 2025-05-07T20:10:51.5669102Z Tag Type Name/Value 2025-05-07T20:10:51.5669742Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:51.5670314Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:51.5670898Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:51.5671735Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:51.5672335Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:51.5672889Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:51.5673491Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:51.5673912Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:51.5674220Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:51.5675792Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:51.5676016Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:51.5676206Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:10:51.5676404Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:10:51.5676602Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:51.5676852Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:51.5677076Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:10:51.5677189Z 0x000000000000000c (INIT) 0x2d000 2025-05-07T20:10:51.5677316Z 0x000000000000000d (FINI) 0xd6d2c 2025-05-07T20:10:51.5677438Z 0x0000000000000019 (INIT_ARRAY) 0x14b5318 2025-05-07T20:10:51.5677567Z 0x000000000000001b (INIT_ARRAYSZ) 208 (bytes) 2025-05-07T20:10:51.5677705Z 0x000000000000001a (FINI_ARRAY) 0x14b53e8 2025-05-07T20:10:51.5677826Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:51.5677945Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:51.5678057Z 0x0000000000000005 (STRTAB) 0x5fa8 2025-05-07T20:10:51.5678180Z 0x0000000000000006 (SYMTAB) 0x1628 2025-05-07T20:10:51.5678316Z 0x000000000000000a (STRSZ) 113302 (bytes) 2025-05-07T20:10:51.5678433Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:51.5678601Z 0x0000000000000003 (PLTGOT) 0x14b7fe8 2025-05-07T20:10:51.5678737Z 0x0000000000000002 (PLTRELSZ) 10368 (bytes) 2025-05-07T20:10:51.5678842Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:51.5678967Z 0x0000000000000017 (JMPREL) 0x29e58 2025-05-07T20:10:51.5679072Z 0x0000000000000007 (RELA) 0x22160 2025-05-07T20:10:51.5679199Z 0x0000000000000008 (RELASZ) 31992 (bytes) 2025-05-07T20:10:51.5679317Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:51.5679450Z 0x000000006ffffffe (VERNEED) 0x22060 2025-05-07T20:10:51.5679560Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:51.5679673Z 0x000000006ffffff0 (VERSYM) 0x21a3e 2025-05-07T20:10:51.5679795Z 0x000000006ffffff9 (RELACOUNT) 498 2025-05-07T20:10:51.5679895Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:51.5679901Z 2025-05-07T20:10:51.5680021Z ################################################################################ 2025-05-07T20:10:51.5680026Z 2025-05-07T20:10:51.5680030Z 2025-05-07T20:10:51.5680155Z ################################################################################ 2025-05-07T20:10:51.5680419Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:10:51.5680521Z [CHECK] Listing out library size: 2025-05-07T20:10:51.5680790Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:10:51.5680798Z 2025-05-07T20:10:51.5680996Z 17 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:10:51.5681001Z 2025-05-07T20:10:51.5681725Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:10:51.5682264Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:51.5682271Z 2025-05-07T20:10:51.5739375Z GLIBC_2.2.5 2025-05-07T20:10:51.5739621Z GLIBC_2.14 2025-05-07T20:10:51.5740114Z 2025-05-07T20:10:51.5740392Z 2025-05-07T20:10:51.5741431Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:10:51.5741976Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:51.5741982Z 2025-05-07T20:10:51.5801491Z GLIBCXX_3.4 2025-05-07T20:10:51.5801765Z GLIBCXX_3.4.9 2025-05-07T20:10:51.5802035Z GLIBCXX_3.4.20 2025-05-07T20:10:51.5802260Z GLIBCXX_3.4.21 2025-05-07T20:10:51.5802290Z 2025-05-07T20:10:51.5802432Z 2025-05-07T20:10:51.5824249Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so > /tmp/tmp.mz93v2oZYn.symbols.txt 2025-05-07T20:10:51.5824305Z 2025-05-07T20:10:51.5851666Z 2025-05-07T20:10:51.5876671Z [CHECK] Total Number of symbols: 452 2025-05-07T20:10:51.5891983Z [CHECK] Number of fbgemm symbols: 13 2025-05-07T20:10:51.5909590Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so > /tmp/tmp.ItjzDeZAVo.usymbols.txt 2025-05-07T20:10:51.5909639Z 2025-05-07T20:10:51.5924767Z 2025-05-07T20:10:51.5961759Z [CHECK] Listing out undefined symbols (149 total): 2025-05-07T20:10:51.5985261Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:51.5985750Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:51.5986204Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:51.5986617Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:51.5986989Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:51.5987391Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:51.5987781Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:51.5990036Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:51.5990425Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:51.5990771Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:51.5991050Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:51.5991344Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:51.5991656Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:51.5991952Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:51.5992243Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:51.5992531Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:51.5992822Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:51.5993092Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:51.5993386Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:51.5993649Z U at::_ops::add_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:51.5993813Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:10:51.5994507Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:51.5995173Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:51.5995429Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:10:51.5995606Z U at::_ops::mul_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:51.5995859Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:10:51.5996083Z U at::_ops::sub__Tensor::call(at::Tensor&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:51.5996196Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:10:51.5996690Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:51.5997305Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:51.5997428Z U c10::BoolType::get() 2025-05-07T20:10:51.5997590Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:51.5997733Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:51.5997848Z U c10::IntType::get() 2025-05-07T20:10:51.5998048Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:51.5998174Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:51.5998401Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:51.5998572Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:51.5998711Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:51.5999118Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:51.5999268Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:51.5999385Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:51.5999497Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:51.5999614Z U c10::SymIntType::get() 2025-05-07T20:10:51.5999765Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:51.5999951Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:51.6000059Z U c10::TensorType::get() 2025-05-07T20:10:51.6000288Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:51.6001072Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:51.6001210Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:51.6001316Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:51.6001426Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:51.6001540Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:51.6001646Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:51.6001753Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:51.6001993Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:51.6002090Z U c10::cuda::current_device() 2025-05-07T20:10:51.6002180Z U c10::cuda::device_count() 2025-05-07T20:10:51.6002315Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:51.6002438Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:51.6002570Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:51.6002709Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:51.6002853Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:51.6002952Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:51.6003456Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:51.6003691Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:51.6004152Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:51.6004473Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:51.6005027Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:51.6005136Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:51.6005242Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:51.6005380Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:10:51.6005557Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:10:51.6005674Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:51.6005802Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:51.6005923Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:51.6006036Z U c10::throwNullDataPtrError() 2025-05-07T20:10:51.6006135Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:51.6006247Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:10:51.6006449Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:51.6006563Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:51.6006691Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:10:51.6006813Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:51.6006982Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:51.6007102Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:51.6007221Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:51.6007345Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:51.6007456Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:51.6007574Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:51.6007713Z U cudaFuncGetAttributes@libcudart.so.12 2025-05-07T20:10:51.6007819Z U cudaGetDevice@libcudart.so.12 2025-05-07T20:10:51.6007934Z U cudaGetDeviceCount@libcudart.so.12 2025-05-07T20:10:51.6008065Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:10:51.6008190Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:51.6008299Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:51.6008408Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:51.6008532Z U cudaMemcpyAsync@libcudart.so.12 2025-05-07T20:10:51.6008638Z U cudaMemsetAsync@libcudart.so.12 2025-05-07T20:10:51.6008911Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.12 2025-05-07T20:10:51.6009038Z U cudaPeekAtLastError@libcudart.so.12 2025-05-07T20:10:51.6009139Z U cudaSetDevice@libcudart.so.12 2025-05-07T20:10:51.6009242Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:51.6009365Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:51.6009491Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:51.6009606Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:51.6009743Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:51.6009865Z U log2@GLIBC_2.2.5 2025-05-07T20:10:51.6010030Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:51.6010156Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:51.6010307Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:51.6010396Z U memcpy@GLIBC_2.14 2025-05-07T20:10:51.6010486Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:51.6010589Z U memset@GLIBC_2.2.5 2025-05-07T20:10:51.6010696Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:51.6010829Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:51.6011150Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:51.6011526Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:51.6011636Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:51.6011788Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:51.6011933Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:51.6012095Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:51.6012316Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:51.6012864Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:51.6012988Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:51.6013116Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:51.6013232Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:51.6013366Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:51.6013474Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:51.6013659Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:51.6013879Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:51.6013999Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:51.6014119Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:51.6014219Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:51.6014336Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:51.6014918Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:51.6015355Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:51.6015612Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:51.6015947Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:51.6016070Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:10:51.6016229Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:51.6016379Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:51.6016529Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:51.6016851Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:51.6017083Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:51.6017197Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:51.6017314Z w _ITM_registerTMCloneTable 2025-05-07T20:10:51.6017413Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:51.6017503Z w __gmon_start__ 2025-05-07T20:10:51.6017593Z w __pthread_key_create 2025-05-07T20:10:51.6017743Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:51.6017967Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:10:51.6017974Z 2025-05-07T20:10:51.6035231Z linux-vdso.so.1 (0x00007ffc8c6d8000) 2025-05-07T20:10:51.6035537Z libtorch.so => not found 2025-05-07T20:10:51.6035812Z libc10.so => not found 2025-05-07T20:10:51.6036081Z libnvrtc.so.12 => not found 2025-05-07T20:10:51.6036370Z libc10_cuda.so => not found 2025-05-07T20:10:51.6036631Z libnccl.so.2 => not found 2025-05-07T20:10:51.6036885Z libcuda.so.1 => not found 2025-05-07T20:10:51.6037188Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:51.6037602Z libtorch_cpu.so => not found 2025-05-07T20:10:51.6037874Z libtorch_cuda.so => not found 2025-05-07T20:10:51.6038140Z libcudart.so.12 => not found 2025-05-07T20:10:51.6038641Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007ff2dd19c000) 2025-05-07T20:10:51.6039045Z libm.so.6 => /lib64/libm.so.6 (0x00007ff2de5e8000) 2025-05-07T20:10:51.6039478Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007ff2de592000) 2025-05-07T20:10:51.6039922Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007ff2de564000) 2025-05-07T20:10:51.6040276Z libc.so.6 => /lib64/libc.so.6 (0x00007ff2dcf94000) 2025-05-07T20:10:51.6040643Z /lib64/ld-linux-x86-64.so.2 (0x00007ff2de6cb000) 2025-05-07T20:10:51.6040900Z 2025-05-07T20:10:51.6041581Z [CHECK] Displaying ELF information: 2025-05-07T20:10:51.6042533Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:10:51.6042600Z 2025-05-07T20:10:51.6076217Z 2025-05-07T20:10:51.6076667Z Dynamic section at offset 0x104fa28 contains 39 entries: 2025-05-07T20:10:51.6076828Z Tag Type Name/Value 2025-05-07T20:10:51.6077170Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:51.6077517Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:51.6077884Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:51.6078227Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:51.6078580Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:51.6078947Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:51.6079285Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:51.6079604Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:51.6079915Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:51.6080266Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:51.6080602Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:51.6080962Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:10:51.6081282Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:10:51.6081482Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:51.6081689Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:51.6081909Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:10:51.6082024Z 0x000000000000000c (INIT) 0x11000 2025-05-07T20:10:51.6082269Z 0x000000000000000d (FINI) 0x8746c 2025-05-07T20:10:51.6082389Z 0x0000000000000019 (INIT_ARRAY) 0x104ff20 2025-05-07T20:10:51.6082517Z 0x000000000000001b (INIT_ARRAYSZ) 96 (bytes) 2025-05-07T20:10:51.6082638Z 0x000000000000001a (FINI_ARRAY) 0x104ff80 2025-05-07T20:10:51.6082772Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:51.6082883Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:51.6082992Z 0x0000000000000005 (STRTAB) 0x3660 2025-05-07T20:10:51.6083116Z 0x0000000000000006 (SYMTAB) 0xbe8 2025-05-07T20:10:51.6083247Z 0x000000000000000a (STRSZ) 35790 (bytes) 2025-05-07T20:10:51.6083397Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:51.6083530Z 0x0000000000000003 (PLTGOT) 0x1050fe8 2025-05-07T20:10:51.6083659Z 0x0000000000000002 (PLTRELSZ) 6480 (bytes) 2025-05-07T20:10:51.6083766Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:51.6083877Z 0x0000000000000017 (JMPREL) 0xf060 2025-05-07T20:10:51.6084000Z 0x0000000000000007 (RELA) 0xc6a8 2025-05-07T20:10:51.6084159Z 0x0000000000000008 (RELASZ) 10680 (bytes) 2025-05-07T20:10:51.6084277Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:51.6084404Z 0x000000006ffffffe (VERNEED) 0xc5b8 2025-05-07T20:10:51.6084512Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:51.6084623Z 0x000000006ffffff0 (VERSYM) 0xc22e 2025-05-07T20:10:51.6084735Z 0x000000006ffffff9 (RELACOUNT) 116 2025-05-07T20:10:51.6084852Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:51.6084858Z 2025-05-07T20:10:51.6084976Z ################################################################################ 2025-05-07T20:10:51.6084981Z 2025-05-07T20:10:51.6084985Z 2025-05-07T20:10:51.6085117Z ################################################################################ 2025-05-07T20:10:51.6085430Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:10:51.6085538Z [CHECK] Listing out library size: 2025-05-07T20:10:51.6085865Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:10:51.6085886Z 2025-05-07T20:10:51.6094322Z 2 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:10:51.6094332Z 2025-05-07T20:10:51.6095102Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:10:51.6096012Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:51.6096021Z 2025-05-07T20:10:51.6155988Z GLIBC_2.2.5 2025-05-07T20:10:51.6156137Z GLIBC_2.14 2025-05-07T20:10:51.6156503Z 2025-05-07T20:10:51.6156510Z 2025-05-07T20:10:51.6157312Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:10:51.6158095Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:51.6158105Z 2025-05-07T20:10:51.6215663Z GLIBCXX_3.4 2025-05-07T20:10:51.6215754Z GLIBCXX_3.4.9 2025-05-07T20:10:51.6215841Z GLIBCXX_3.4.21 2025-05-07T20:10:51.6215906Z 2025-05-07T20:10:51.6215966Z 2025-05-07T20:10:51.6235549Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so > /tmp/tmp.Lhqz2vQBub.symbols.txt 2025-05-07T20:10:51.6235562Z 2025-05-07T20:10:51.6257968Z 2025-05-07T20:10:51.6291859Z [CHECK] Total Number of symbols: 277 2025-05-07T20:10:51.6304587Z [CHECK] Number of fbgemm symbols: 44 2025-05-07T20:10:51.6319474Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so > /tmp/tmp.cqgBXegPJF.usymbols.txt 2025-05-07T20:10:51.6321223Z 2025-05-07T20:10:51.6337507Z 2025-05-07T20:10:51.6364360Z [CHECK] Listing out undefined symbols (127 total): 2025-05-07T20:10:51.6381832Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:51.6383461Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:51.6384510Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:51.6384960Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:51.6385344Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:51.6385738Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:51.6386258Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:51.6386610Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:51.6386992Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:51.6387339Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:51.6387675Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:51.6387983Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:51.6388360Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:51.6388700Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:51.6389020Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:51.6389442Z U at::_ops::contiguous::call(at::Tensor const&, c10::MemoryFormat) 2025-05-07T20:10:51.6390336Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:51.6391683Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:51.6392630Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:10:51.6393029Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:10:51.6393779Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:51.6394659Z U at::get_thread_num() 2025-05-07T20:10:51.6394967Z U at::internal::set_thread_num(int) 2025-05-07T20:10:51.6395776Z U at::native::empty_like(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:51.6396728Z U at::native::resize_(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:10:51.6397312Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:10:51.6397773Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:51.6398188Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:10:51.6398608Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:51.6398986Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:10:51.6399358Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:10:51.6399760Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:51.6400168Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:51.6400555Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:51.6400926Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:51.6401349Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:51.6401723Z U c10::TensorType::get() 2025-05-07T20:10:51.6402095Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:51.6403057Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:51.6404013Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:51.6404390Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:51.6404748Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:51.6405083Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:51.6405505Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:51.6405847Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:51.6406333Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:51.6406897Z U c10::cuda::device_count() 2025-05-07T20:10:51.6407230Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:51.6407628Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:51.6407987Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:51.6408358Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:51.6408729Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:51.6409241Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:51.6409934Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:51.6410741Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:51.6411546Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:51.6412451Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:51.6412973Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:51.6413291Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:51.6413628Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:10:51.6414034Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:10:51.6414413Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:51.6414755Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:51.6415125Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:51.6415453Z U c10::throwNullDataPtrError() 2025-05-07T20:10:51.6415766Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:51.6416060Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:10:51.6416454Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:51.6416859Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:51.6417182Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:10:51.6417533Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:51.6417872Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:51.6418221Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:51.6418545Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:51.6418874Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:51.6419194Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:51.6419515Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:51.6419890Z U cudaFuncGetAttributes@libcudart.so.12 2025-05-07T20:10:51.6420206Z U cudaGetDevice@libcudart.so.12 2025-05-07T20:10:51.6420535Z U cudaGetDeviceCount@libcudart.so.12 2025-05-07T20:10:51.6420857Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:51.6421185Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:51.6421496Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:51.6422162Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.12 2025-05-07T20:10:51.6422689Z U cudaPeekAtLastError@libcudart.so.12 2025-05-07T20:10:51.6423224Z U cudaSetDevice@libcudart.so.12 2025-05-07T20:10:51.6423561Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:51.6423904Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:51.6424264Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:51.6424633Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:10:51.6425015Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:51.6425453Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:51.6425866Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:51.6426199Z U memcpy@GLIBC_2.14 2025-05-07T20:10:51.6426471Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:51.6426759Z U memset@GLIBC_2.2.5 2025-05-07T20:10:51.6427051Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:51.6427398Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:51.6427977Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:51.6428993Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:51.6429613Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:51.6430026Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:51.6430422Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:51.6430916Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:51.6431836Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:51.6432636Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:51.6432981Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:51.6433335Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:51.6433677Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:51.6434117Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:51.6434539Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:51.6434843Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:51.6435160Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:51.6435980Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:51.6437130Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:51.6437956Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:51.6438683Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:51.6439341Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:51.6439768Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:51.6440188Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:51.6440791Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:51.6441459Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:51.6441932Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:51.6442266Z w _ITM_registerTMCloneTable 2025-05-07T20:10:51.6442576Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:51.6442877Z w __gmon_start__ 2025-05-07T20:10:51.6443146Z w __pthread_key_create 2025-05-07T20:10:51.6443490Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:51.6443979Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:10:51.6444328Z 2025-05-07T20:10:51.6444492Z linux-vdso.so.1 (0x00007ffc311f8000) 2025-05-07T20:10:51.6444786Z libc10.so => not found 2025-05-07T20:10:51.6445025Z libnvrtc.so.12 => not found 2025-05-07T20:10:51.6445294Z libc10_cuda.so => not found 2025-05-07T20:10:51.6445551Z libnccl.so.2 => not found 2025-05-07T20:10:51.6445812Z libcuda.so.1 => not found 2025-05-07T20:10:51.6446653Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f145ae00000) 2025-05-07T20:10:51.6447272Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:51.6447526Z libtorch.so => not found 2025-05-07T20:10:51.6447753Z libtorch_cpu.so => not found 2025-05-07T20:10:51.6448002Z libtorch_cuda.so => not found 2025-05-07T20:10:51.6448237Z libcudart.so.12 => not found 2025-05-07T20:10:51.6448548Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f145ab9c000) 2025-05-07T20:10:51.6448931Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f145bf3d000) 2025-05-07T20:10:51.6449307Z libc.so.6 => /lib64/libc.so.6 (0x00007f145a994000) 2025-05-07T20:10:51.6449610Z libtorch.so => not found 2025-05-07T20:10:51.6449826Z libc10.so => not found 2025-05-07T20:10:51.6450051Z libnvrtc.so.12 => not found 2025-05-07T20:10:51.6450279Z libc10_cuda.so => not found 2025-05-07T20:10:51.6450521Z libnccl.so.2 => not found 2025-05-07T20:10:51.6450740Z libcuda.so.1 => not found 2025-05-07T20:10:51.6450977Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:51.6451218Z libtorch_cpu.so => not found 2025-05-07T20:10:51.6451463Z libtorch_cuda.so => not found 2025-05-07T20:10:51.6451703Z libcudart.so.12 => not found 2025-05-07T20:10:51.6451980Z libm.so.6 => /lib64/libm.so.6 (0x00007f145a8b9000) 2025-05-07T20:10:51.6452332Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f145a863000) 2025-05-07T20:10:51.6452678Z /lib64/ld-linux-x86-64.so.2 (0x00007f145c11a000) 2025-05-07T20:10:51.6452893Z 2025-05-07T20:10:51.6453004Z [CHECK] Displaying ELF information: 2025-05-07T20:10:51.6453421Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:10:51.6453772Z 2025-05-07T20:10:51.6460827Z 2025-05-07T20:10:51.6460987Z Dynamic section at offset 0x16eba8 contains 39 entries: 2025-05-07T20:10:51.6461374Z Tag Type Name/Value 2025-05-07T20:10:51.6462178Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:51.6462760Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:51.6463331Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:51.6463848Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:51.6464343Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:51.6464980Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:10:51.6465524Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:51.6466047Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:51.6466559Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:51.6467070Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:51.6467593Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:51.6468099Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:51.6468644Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:51.6469143Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:51.6469681Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:10:51.6470231Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:51.6470622Z 0x000000000000000c (INIT) 0xa000 2025-05-07T20:10:51.6470999Z 0x000000000000000d (FINI) 0x1a14c 2025-05-07T20:10:51.6471318Z 0x0000000000000019 (INIT_ARRAY) 0x16f890 2025-05-07T20:10:51.6471664Z 0x000000000000001b (INIT_ARRAYSZ) 32 (bytes) 2025-05-07T20:10:51.6471998Z 0x000000000000001a (FINI_ARRAY) 0x16f8b0 2025-05-07T20:10:51.6472343Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:51.6472684Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:51.6473009Z 0x0000000000000005 (STRTAB) 0x2108 2025-05-07T20:10:51.6473334Z 0x0000000000000006 (SYMTAB) 0x6f8 2025-05-07T20:10:51.6473662Z 0x000000000000000a (STRSZ) 20443 (bytes) 2025-05-07T20:10:51.6474019Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:51.6474457Z 0x0000000000000003 (PLTGOT) 0x16ffe8 2025-05-07T20:10:51.6474819Z 0x0000000000000002 (PLTRELSZ) 3936 (bytes) 2025-05-07T20:10:51.6475166Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:51.6475573Z 0x0000000000000017 (JMPREL) 0x8150 2025-05-07T20:10:51.6475901Z 0x0000000000000007 (RELA) 0x73d0 2025-05-07T20:10:51.6476229Z 0x0000000000000008 (RELASZ) 3456 (bytes) 2025-05-07T20:10:51.6476587Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:51.6476920Z 0x000000006ffffffe (VERNEED) 0x7310 2025-05-07T20:10:51.6477256Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:10:51.6477576Z 0x000000006ffffff0 (VERSYM) 0x70e4 2025-05-07T20:10:51.6477903Z 0x000000006ffffff9 (RELACOUNT) 7 2025-05-07T20:10:51.6478213Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:51.6478415Z 2025-05-07T20:10:51.6478545Z ################################################################################ 2025-05-07T20:10:51.6478778Z 2025-05-07T20:10:51.6478782Z 2025-05-07T20:10:51.6478891Z ################################################################################ 2025-05-07T20:10:51.6479436Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:10:51.6479983Z [CHECK] Listing out library size: 2025-05-07T20:10:51.6480490Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:10:51.6480908Z 2025-05-07T20:10:51.6481151Z 11 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:10:51.6481519Z 2025-05-07T20:10:51.6481957Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:10:51.6483040Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:51.6485068Z 2025-05-07T20:10:51.6939821Z GLIBC_2.2.5 2025-05-07T20:10:51.6940447Z GLIBC_2.3 2025-05-07T20:10:51.6940987Z GLIBC_2.14 2025-05-07T20:10:51.6941392Z 2025-05-07T20:10:51.6941400Z 2025-05-07T20:10:51.6941865Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:10:51.6943001Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:51.6943658Z 2025-05-07T20:10:51.7401424Z GLIBCXX_3.4 2025-05-07T20:10:51.7402058Z GLIBCXX_3.4.9 2025-05-07T20:10:51.7402944Z GLIBCXX_3.4.11 2025-05-07T20:10:51.7403536Z GLIBCXX_3.4.15 2025-05-07T20:10:51.7404081Z GLIBCXX_3.4.18 2025-05-07T20:10:51.7404657Z GLIBCXX_3.4.20 2025-05-07T20:10:51.7405199Z GLIBCXX_3.4.21 2025-05-07T20:10:51.7405575Z 2025-05-07T20:10:51.7405589Z 2025-05-07T20:10:51.7425510Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so > /tmp/tmp.9ieqWCMN87.symbols.txt 2025-05-07T20:10:51.7427134Z 2025-05-07T20:10:51.7833084Z 2025-05-07T20:10:51.7863128Z [CHECK] Total Number of symbols: 4395 2025-05-07T20:10:51.7891920Z [CHECK] Number of fbgemm symbols: 4 2025-05-07T20:10:51.7909846Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so > /tmp/tmp.ycgzW2nHPD.usymbols.txt 2025-05-07T20:10:51.7911474Z 2025-05-07T20:10:51.7938803Z 2025-05-07T20:10:51.7964287Z [CHECK] Listing out undefined symbols (185 total): 2025-05-07T20:10:51.7981593Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:51.7982482Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:51.7983061Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:51.7983409Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:51.7983748Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:51.7984261Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:51.7984589Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:51.7984931Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:51.7985260Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:51.7985594Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:51.7985927Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:51.7986240Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:51.7986548Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:51.7986857Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:51.7987189Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:51.7987562Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:10:51.7987989Z U at::RecordFunction::currentThreadId() 2025-05-07T20:10:51.7988347Z U at::RecordFunction::end() 2025-05-07T20:10:51.7988798Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:10:51.7989285Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:10:51.7989792Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:10:51.7990459Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:10:51.7991159Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:10:51.7991749Z U at::_ops::clone::call(at::Tensor const&, std::optional) 2025-05-07T20:10:51.7992659Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:51.7993638Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:10:51.7994180Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:51.7994778Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:10:51.7995179Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:10:51.7995675Z U at::sequence_number::get_and_increment() 2025-05-07T20:10:51.7996004Z U bcmp@GLIBC_2.2.5 2025-05-07T20:10:51.7996369Z U c10::AnyType::get() 2025-05-07T20:10:51.7996678Z U c10::BoolType::get() 2025-05-07T20:10:51.7997033Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:51.7997500Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:10:51.7997915Z U c10::Dispatcher::realSingleton() 2025-05-07T20:10:51.7998722Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:10:51.8000009Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:10:51.8001119Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:51.8001726Z U c10::Error::what() const 2025-05-07T20:10:51.8002028Z U c10::FloatType::get() 2025-05-07T20:10:51.8002351Z U c10::GradMode::is_enabled() 2025-05-07T20:10:51.8002686Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:10:51.8003064Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:10:51.8003473Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:10:51.8003844Z U c10::IValue::isBoolList() const 2025-05-07T20:10:51.8004201Z U c10::IValue::isDoubleList() const 2025-05-07T20:10:51.8004541Z U c10::IValue::isIntList() const 2025-05-07T20:10:51.8004866Z U c10::IValue::isSymIntList() const 2025-05-07T20:10:51.8005214Z U c10::IValue::isTensorList() const 2025-05-07T20:10:51.8005573Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:51.8005940Z U c10::IntType::get() 2025-05-07T20:10:51.8006624Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:51.8007493Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:51.8007901Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:51.8008238Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:51.8008596Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:51.8009021Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:51.8009611Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:10:51.8010088Z U c10::StringType::get() 2025-05-07T20:10:51.8010420Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:10:51.8010816Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:51.8011192Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:51.8011598Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:10:51.8012245Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:51.8012918Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:51.8013297Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:10:51.8013653Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:10:51.8014020Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:51.8014376Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:10:51.8014712Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:51.8015043Z U c10::SymIntType::get() 2025-05-07T20:10:51.8015382Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:10:51.8015722Z U c10::TensorType::get() 2025-05-07T20:10:51.8016031Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:51.8016672Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:51.8017701Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:51.8018525Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:51.8019345Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:51.8020236Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:51.8021241Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:51.8022183Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:10:51.8022812Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:10:51.8023226Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:51.8023591Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:10:51.8024200Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:51.8024768Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:10:51.8025157Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:10:51.8025575Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:10:51.8025947Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:10:51.8026383Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:51.8026785Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:51.8027256Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:10:51.8027883Z U fbgemm_gpu::reshape_vbe_output(at::Tensor const&, long, at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:51.8028524Z U free@GLIBC_2.2.5 2025-05-07T20:10:51.8029078Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:51.8029499Z U memcmp@GLIBC_2.2.5 2025-05-07T20:10:51.8029879Z U memcpy@GLIBC_2.14 2025-05-07T20:10:51.8030164Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:51.8030469Z U memset@GLIBC_2.2.5 2025-05-07T20:10:51.8030771Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:51.8031144Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:51.8031615Z U realloc@GLIBC_2.2.5 2025-05-07T20:10:51.8032025Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:10:51.8032723Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:51.8033567Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:51.8034527Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:51.8035406Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:51.8036261Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:51.8036872Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:51.8037225Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:51.8037621Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:51.8038030Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:51.8038455Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:51.8038890Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:51.8039287Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:10:51.8039785Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:51.8040729Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:51.8041541Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:51.8041962Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:51.8042326Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:51.8042679Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:51.8043033Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:51.8043433Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:51.8043981Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:51.8044475Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:51.8044870Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:51.8045298Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:10:51.8045966Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:10:51.8046654Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:10:51.8047030Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:51.8047344Z U strcmp@GLIBC_2.2.5 2025-05-07T20:10:51.8047648Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:51.8047960Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:51.8048806Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:51.8050219Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:51.8051009Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:51.8051487Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:10:51.8051980Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:10:51.8052542Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:10:51.8053032Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:10:51.8053504Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:10:51.8054154Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:10:51.8054741Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:10:51.8055158Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:10:51.8055624Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:10:51.8056013Z U torch::autograd::Node::assign_parent() 2025-05-07T20:10:51.8056379Z U torch::autograd::Node::metadata() 2025-05-07T20:10:51.8056711Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:10:51.8057191Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:10:51.8057791Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:10:51.8058280Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:10:51.8058729Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:10:51.8059240Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:10:51.8062063Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:10:51.8064833Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:10:51.8065225Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:10:51.8065642Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:10:51.8066658Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:10:51.8067639Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:10:51.8068286Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:10:51.8069128Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:51.8069672Z U typeinfo for c10::Error 2025-05-07T20:10:51.8070019Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:51.8070414Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:10:51.8070780Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:10:51.8071137Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:10:51.8071502Z U typeinfo for torch::autograd::Node 2025-05-07T20:10:51.8088201Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:51.8088759Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:51.8089415Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:10:51.8090253Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:51.8090705Z U vtable for c10::Error 2025-05-07T20:10:51.8091255Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:51.8091866Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:51.8092362Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:51.8092862Z U vtable for torch::autograd::Node 2025-05-07T20:10:51.8093267Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:51.8093662Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:51.8093989Z w _ITM_registerTMCloneTable 2025-05-07T20:10:51.8094313Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:51.8094608Z w __gmon_start__ 2025-05-07T20:10:51.8094899Z w __pthread_key_create 2025-05-07T20:10:51.8095200Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:51.8095543Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:51.8095910Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:51.8096437Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:10:51.8096814Z 2025-05-07T20:10:51.8096996Z linux-vdso.so.1 (0x00007ffc8bfbb000) 2025-05-07T20:10:51.8097342Z libc10.so => not found 2025-05-07T20:10:51.8097589Z libnvrtc.so.12 => not found 2025-05-07T20:10:51.8097867Z libc10_cuda.so => not found 2025-05-07T20:10:51.8098123Z libnccl.so.2 => not found 2025-05-07T20:10:51.8098385Z libcuda.so.1 => not found 2025-05-07T20:10:51.8099010Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007f7a2d400000) 2025-05-07T20:10:51.8100058Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f7a2d000000) 2025-05-07T20:10:51.8101190Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f7a2ce59000) 2025-05-07T20:10:51.8102101Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:51.8102382Z libtorch.so => not found 2025-05-07T20:10:51.8102984Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so (0x00007f7a2f78e000) 2025-05-07T20:10:51.8103746Z libtorch_cpu.so => not found 2025-05-07T20:10:51.8104009Z libtorch_cuda.so => not found 2025-05-07T20:10:51.8104345Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f7a2cbf5000) 2025-05-07T20:10:51.8104760Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f7a2f760000) 2025-05-07T20:10:51.8105130Z libc.so.6 => /lib64/libc.so.6 (0x00007f7a2c9ed000) 2025-05-07T20:10:51.8105489Z /lib64/ld-linux-x86-64.so.2 (0x00007f7a2f7a1000) 2025-05-07T20:10:51.8105805Z libtorch.so => not found 2025-05-07T20:10:51.8106055Z libc10.so => not found 2025-05-07T20:10:51.8106290Z libnvrtc.so.12 => not found 2025-05-07T20:10:51.8106557Z libc10_cuda.so => not found 2025-05-07T20:10:51.8106800Z libnccl.so.2 => not found 2025-05-07T20:10:51.8107063Z libcuda.so.1 => not found 2025-05-07T20:10:51.8107367Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:51.8107631Z libtorch_cpu.so => not found 2025-05-07T20:10:51.8107904Z libtorch_cuda.so => not found 2025-05-07T20:10:51.8108167Z libcudart.so.12 => not found 2025-05-07T20:10:51.8108466Z libm.so.6 => /lib64/libm.so.6 (0x00007f7a2f681000) 2025-05-07T20:10:51.8108839Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f7a2ebaa000) 2025-05-07T20:10:51.8109182Z libc10.so => not found 2025-05-07T20:10:51.8109422Z libnvrtc.so.12 => not found 2025-05-07T20:10:51.8109685Z libc10_cuda.so => not found 2025-05-07T20:10:51.8109939Z libnccl.so.2 => not found 2025-05-07T20:10:51.8110190Z libcuda.so.1 => not found 2025-05-07T20:10:51.8110740Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so (0x00007f7a2c400000) 2025-05-07T20:10:51.8111301Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:51.8111579Z libtorch.so => not found 2025-05-07T20:10:51.8111821Z libtorch_cpu.so => not found 2025-05-07T20:10:51.8112096Z libtorch_cuda.so => not found 2025-05-07T20:10:51.8112352Z libcudart.so.12 => not found 2025-05-07T20:10:51.8112614Z libc10.so => not found 2025-05-07T20:10:51.8112848Z libnvrtc.so.12 => not found 2025-05-07T20:10:51.8113138Z libc10_cuda.so => not found 2025-05-07T20:10:51.8113387Z libnccl.so.2 => not found 2025-05-07T20:10:51.8113648Z libcuda.so.1 => not found 2025-05-07T20:10:51.8114389Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f7a2b200000) 2025-05-07T20:10:51.8115232Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:51.8115598Z libtorch.so => not found 2025-05-07T20:10:51.8115850Z libtorch_cpu.so => not found 2025-05-07T20:10:51.8116135Z libtorch_cuda.so => not found 2025-05-07T20:10:51.8116400Z libcudart.so.12 => not found 2025-05-07T20:10:51.8116676Z libtorch.so => not found 2025-05-07T20:10:51.8116918Z libc10.so => not found 2025-05-07T20:10:51.8117169Z libnvrtc.so.12 => not found 2025-05-07T20:10:51.8117445Z libc10_cuda.so => not found 2025-05-07T20:10:51.8117701Z libnccl.so.2 => not found 2025-05-07T20:10:51.8117960Z libcuda.so.1 => not found 2025-05-07T20:10:51.8118385Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:51.8118664Z libtorch_cpu.so => not found 2025-05-07T20:10:51.8118923Z libtorch_cuda.so => not found 2025-05-07T20:10:51.8119281Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f7a2f66e000) 2025-05-07T20:10:51.8119662Z libc10.so => not found 2025-05-07T20:10:51.8119913Z libnvrtc.so.12 => not found 2025-05-07T20:10:51.8120173Z libc10_cuda.so => not found 2025-05-07T20:10:51.8120438Z libnccl.so.2 => not found 2025-05-07T20:10:51.8120693Z libcuda.so.1 => not found 2025-05-07T20:10:51.8121205Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so (0x00007f7a2eb33000) 2025-05-07T20:10:51.8121778Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:51.8122047Z libtorch.so => not found 2025-05-07T20:10:51.8122314Z libtorch_cpu.so => not found 2025-05-07T20:10:51.8122578Z libtorch_cuda.so => not found 2025-05-07T20:10:51.8122845Z libtorch.so => not found 2025-05-07T20:10:51.8123091Z libc10.so => not found 2025-05-07T20:10:51.8123339Z libnvrtc.so.12 => not found 2025-05-07T20:10:51.8123594Z libc10_cuda.so => not found 2025-05-07T20:10:51.8123857Z libnccl.so.2 => not found 2025-05-07T20:10:51.8124118Z libcuda.so.1 => not found 2025-05-07T20:10:51.8124369Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:51.8124649Z libtorch_cpu.so => not found 2025-05-07T20:10:51.8124908Z libtorch_cuda.so => not found 2025-05-07T20:10:51.8125185Z libcudart.so.12 => not found 2025-05-07T20:10:51.8125445Z libtorch.so => not found 2025-05-07T20:10:51.8125705Z libc10.so => not found 2025-05-07T20:10:51.8125935Z libnvrtc.so.12 => not found 2025-05-07T20:10:51.8126210Z libc10_cuda.so => not found 2025-05-07T20:10:51.8126306Z libnccl.so.2 => not found 2025-05-07T20:10:51.8126396Z libcuda.so.1 => not found 2025-05-07T20:10:51.8126508Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:51.8126637Z libtorch_cpu.so => not found 2025-05-07T20:10:51.8126732Z libtorch_cuda.so => not found 2025-05-07T20:10:51.8126999Z librt.so.1 => /lib64/librt.so.1 (0x00007f7a2f65f000) 2025-05-07T20:10:51.8127006Z 2025-05-07T20:10:51.8127111Z [CHECK] Displaying ELF information: 2025-05-07T20:10:51.8127390Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:10:51.8127395Z 2025-05-07T20:10:51.8127400Z 2025-05-07T20:10:51.8127567Z Dynamic section at offset 0xa44058 contains 42 entries: 2025-05-07T20:10:51.8127682Z Tag Type Name/Value 2025-05-07T20:10:51.8127906Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:51.8128120Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:51.8128312Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:51.8129007Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:51.8129211Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:51.8129511Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:10:51.8129751Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:10:51.8129997Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:10:51.8130208Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:51.8130399Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:51.8130624Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:10:51.8130821Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:51.8131022Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:51.8131239Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:51.8131432Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:51.8131659Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:51.8131885Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:51.8132159Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_pt2.so] 2025-05-07T20:10:51.8132339Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:51.8132470Z 0x000000000000000c (INIT) 0x190000 2025-05-07T20:10:51.8132587Z 0x000000000000000d (FINI) 0x8ac368 2025-05-07T20:10:51.8132700Z 0x0000000000000019 (INIT_ARRAY) 0xa37c40 2025-05-07T20:10:51.8132827Z 0x000000000000001b (INIT_ARRAYSZ) 256 (bytes) 2025-05-07T20:10:51.8132949Z 0x000000000000001a (FINI_ARRAY) 0xa37d40 2025-05-07T20:10:51.8133067Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:51.8133181Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:51.8133311Z 0x0000000000000005 (STRTAB) 0x23008 2025-05-07T20:10:51.8133416Z 0x0000000000000006 (SYMTAB) 0x93e8 2025-05-07T20:10:51.8133552Z 0x000000000000000a (STRSZ) 1248185 (bytes) 2025-05-07T20:10:51.8133679Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:51.8133793Z 0x0000000000000003 (PLTGOT) 0xa47fe8 2025-05-07T20:10:51.8133929Z 0x0000000000000002 (PLTRELSZ) 42648 (bytes) 2025-05-07T20:10:51.8134042Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:51.8134169Z 0x0000000000000017 (JMPREL) 0x184d90 2025-05-07T20:10:51.8134278Z 0x0000000000000007 (RELA) 0x155f30 2025-05-07T20:10:51.8134412Z 0x0000000000000008 (RELASZ) 192096 (bytes) 2025-05-07T20:10:51.8134540Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:51.8134691Z 0x000000006ffffffe (VERNEED) 0x155e20 2025-05-07T20:10:51.8134801Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:10:51.8134935Z 0x000000006ffffff0 (VERSYM) 0x153bc2 2025-05-07T20:10:51.8135039Z 0x000000006ffffff9 (RELACOUNT) 34 2025-05-07T20:10:51.8135139Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:51.8135144Z 2025-05-07T20:10:51.8135261Z ################################################################################ 2025-05-07T20:10:51.8135265Z 2025-05-07T20:10:51.8135283Z 2025-05-07T20:10:51.8135392Z ################################################################################ 2025-05-07T20:10:51.8135705Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:10:51.8135811Z [CHECK] Listing out library size: 2025-05-07T20:10:51.8136096Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:10:51.8136104Z 2025-05-07T20:10:51.8136308Z 429 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:10:51.8136312Z 2025-05-07T20:10:51.8136734Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:10:51.8137239Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:51.8137244Z 2025-05-07T20:10:51.8483696Z GLIBC_2.2.5 2025-05-07T20:10:51.8483946Z GLIBC_2.14 2025-05-07T20:10:51.8484185Z 2025-05-07T20:10:51.8484566Z 2025-05-07T20:10:51.8485559Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:10:51.8486138Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:51.8486150Z 2025-05-07T20:10:51.8881888Z GLIBCXX_3.4 2025-05-07T20:10:51.8882796Z GLIBCXX_3.4.9 2025-05-07T20:10:51.8883059Z GLIBCXX_3.4.11 2025-05-07T20:10:51.8883628Z GLIBCXX_3.4.14 2025-05-07T20:10:51.8883892Z GLIBCXX_3.4.18 2025-05-07T20:10:51.8884119Z GLIBCXX_3.4.20 2025-05-07T20:10:51.8884350Z GLIBCXX_3.4.21 2025-05-07T20:10:51.8884388Z 2025-05-07T20:10:51.8884400Z 2025-05-07T20:10:51.8905639Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so > /tmp/tmp.lZQttKPNnb.symbols.txt 2025-05-07T20:10:51.8905662Z 2025-05-07T20:10:51.9262418Z 2025-05-07T20:10:51.9289025Z [CHECK] Total Number of symbols: 5083 2025-05-07T20:10:51.9316376Z [CHECK] Number of fbgemm symbols: 3788 2025-05-07T20:10:51.9331053Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so > /tmp/tmp.SAlxcBnZ21.usymbols.txt 2025-05-07T20:10:51.9331080Z 2025-05-07T20:10:51.9362906Z 2025-05-07T20:10:51.9391341Z [CHECK] Listing out undefined symbols (246 total): 2025-05-07T20:10:51.9404518Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:51.9405388Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:51.9405619Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:51.9405787Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:51.9406076Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:51.9406256Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:51.9406407Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:51.9406558Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:51.9406679Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:51.9406846Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:51.9406986Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:51.9407333Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:51.9407466Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:51.9407589Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:51.9407704Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:51.9407827Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:51.9407940Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:51.9408051Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:51.9408156Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:51.9408269Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:51.9408443Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:51.9408648Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:10:51.9409222Z U at::_ops::arange_start::call(c10::Scalar const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:51.9409867Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:51.9410537Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:51.9410709Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:10:51.9411199Z U at::_ops::scalar_tensor::call(c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:51.9411390Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:10:51.9411705Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:10:51.9412497Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:10:51.9412977Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:51.9413091Z U at::detail::getCUDAHooks() 2025-05-07T20:10:51.9413200Z U at::detail::getHIPHooks() 2025-05-07T20:10:51.9413313Z U at::get_thread_num() 2025-05-07T20:10:51.9413416Z U at::globalContext() 2025-05-07T20:10:51.9413534Z U at::internal::set_thread_num(int) 2025-05-07T20:10:51.9413831Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:10:51.9413999Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:10:51.9414215Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:51.9414346Z U c10::ClassType::addMethod(torch::jit::Function*) 2025-05-07T20:10:51.9414687Z U c10::ClassType::getMethod(std::__cxx11::basic_string, std::allocator > const&) const 2025-05-07T20:10:51.9414855Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:51.9415466Z U c10::DictType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:51.9415811Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:51.9415958Z U c10::Error::what() const 2025-05-07T20:10:51.9416059Z U c10::GradMode::is_enabled() 2025-05-07T20:10:51.9416167Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:10:51.9416325Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:10:51.9416492Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:51.9416635Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:10:51.9416762Z U c10::IValue::is(c10::IValue const&) const 2025-05-07T20:10:51.9416975Z U c10::IValue::isTensorList() const 2025-05-07T20:10:51.9417108Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:51.9417217Z U c10::IntType::get() 2025-05-07T20:10:51.9417669Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:51.9417859Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:51.9417986Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:51.9418080Z U c10::NoneType::get() 2025-05-07T20:10:51.9418291Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:51.9418424Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:10:51.9418542Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:10:51.9418695Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:51.9418800Z U c10::StringType::get() 2025-05-07T20:10:51.9418934Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:51.9419072Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:51.9419460Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:51.9419640Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:51.9419752Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:51.9419903Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:51.9420315Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:10:51.9420413Z U c10::TensorType::get() 2025-05-07T20:10:51.9421157Z U c10::TupleType::TupleType(std::vector, std::allocator > >, std::optional, std::shared_ptr) 2025-05-07T20:10:51.9421280Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:51.9421941Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:51.9422160Z U c10::_fastEqualsForContainer(c10::IValue const&, c10::IValue const&) 2025-05-07T20:10:51.9422287Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:51.9422399Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:51.9422528Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:51.9422635Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:51.9422747Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:51.9422867Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:51.9423101Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:51.9423217Z U c10::cuda::device_count() 2025-05-07T20:10:51.9423365Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:51.9423493Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:51.9423626Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:51.9423817Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:51.9423965Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:51.9424083Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:51.9424504Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:51.9424981Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:51.9425970Z U c10::detail::infer_schema::make_function_schema(std::__cxx11::basic_string, std::allocator >&&, std::__cxx11::basic_string, std::allocator >&&, c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:51.9426208Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:51.9426677Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:51.9426993Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:51.9427533Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:51.9427726Z U c10::getCustomClassTypeImpl(std::type_index const&) 2025-05-07T20:10:51.9427823Z U c10::get_default_dtype() 2025-05-07T20:10:51.9428083Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:10:51.9428281Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:10:51.9428741Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:51.9428849Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:51.9429154Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:51.9429529Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:51.9429692Z U c10::ivalue::Future::extractStorages(c10::IValue const&) 2025-05-07T20:10:51.9429846Z U c10::ivalue::Object::resizeObject(unsigned long) 2025-05-07T20:10:51.9430089Z U c10::ivalue::checkCustomClassType(c10::ClassType const*, c10::Type const*) 2025-05-07T20:10:51.9430233Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:51.9430382Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:51.9430547Z U c10::operator<<(std::ostream&, c10::FunctionSchema const&) 2025-05-07T20:10:51.9430653Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:51.9430861Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:51.9430982Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:51.9431113Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:51.9431260Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:51.9431376Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:51.9431568Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:51.9431695Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:51.9431810Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:51.9431932Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:51.9432052Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:51.9432180Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:51.9432291Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:51.9432404Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:51.9432580Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:51.9432708Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:51.9433475Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:51.9434472Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:51.9435323Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:51.9436114Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:51.9436957Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:51.9437865Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:51.9438579Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:10:51.9439382Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:10:51.9440221Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:51.9441165Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:10:51.9442000Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:51.9442734Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:10:51.9443579Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:10:51.9444412Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:51.9445134Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:10:51.9445897Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:10:51.9446703Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:51.9447532Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:10:51.9448366Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:51.9449121Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:10:51.9449983Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:10:51.9450850Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:51.9451627Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:51.9452476Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:51.9453320Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:51.9454144Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:51.9455191Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:51.9456155Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:51.9456298Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:10:51.9456452Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:51.9456619Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:51.9456761Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:51.9457346Z U linearize_cache_indices_cuda(at::Tensor const&, at::Tensor const&, at::Tensor const&, std::optional const&, long, long) 2025-05-07T20:10:51.9457535Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:51.9457701Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:51.9457853Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:51.9458472Z U lru_cache_populate_byte_cuda(at::Tensor, at::Tensor, long, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, long, at::Tensor, long, bool, std::optional) 2025-05-07T20:10:51.9458895Z U lxu_cache_lookup_cuda(at::Tensor, at::Tensor, long, bool, std::optional, std::optional, std::optional) 2025-05-07T20:10:51.9458993Z U memchr@GLIBC_2.2.5 2025-05-07T20:10:51.9459103Z U memcpy@GLIBC_2.14 2025-05-07T20:10:51.9459200Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:51.9459294Z U memset@GLIBC_2.2.5 2025-05-07T20:10:51.9459412Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:51.9459548Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:51.9459796Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:10:51.9460139Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:51.9460539Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:51.9460871Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:51.9461610Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream(std::__cxx11::basic_string, std::allocator > const&, std::_Ios_Openmode)@GLIBCXX_3.4.21 2025-05-07T20:10:51.9461990Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:51.9462362Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:51.9462524Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:10:51.9462672Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:10:51.9462789Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:51.9462924Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:51.9463064Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:10:51.9463203Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:51.9463354Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:51.9463563Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:51.9463698Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:51.9463956Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:51.9464529Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:51.9464715Z U std::condition_variable::condition_variable()@GLIBCXX_3.4.11 2025-05-07T20:10:51.9465043Z U std::condition_variable::notify_all()@GLIBCXX_3.4.11 2025-05-07T20:10:51.9465224Z U std::condition_variable::~condition_variable()@GLIBCXX_3.4.11 2025-05-07T20:10:51.9465344Z U std::current_exception()@CXXABI_1.3.3 2025-05-07T20:10:51.9465595Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:51.9465709Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:51.9465854Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:51.9465973Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:51.9466147Z U std::istream& std::istream::_M_extract(long&)@GLIBCXX_3.4.9 2025-05-07T20:10:51.9466258Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:51.9466655Z U std::logic_error::logic_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:10:51.9466779Z U std::logic_error::~logic_error()@GLIBCXX_3.4 2025-05-07T20:10:51.9466953Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:51.9467186Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:51.9467306Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:51.9467459Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:51.9467628Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:10:51.9467838Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:10:51.9468004Z U std::runtime_error::runtime_error(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:51.9468145Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:10:51.9468246Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:51.9468337Z U strcmp@GLIBC_2.2.5 2025-05-07T20:10:51.9468426Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:51.9468553Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:51.9469101Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:51.9469549Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:51.9469787Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:51.9470769Z U torch::detail::class_base::class_base(std::__cxx11::basic_string, std::allocator > const&, std::__cxx11::basic_string, std::allocator > const&, std::__cxx11::basic_string, std::allocator >, std::type_info const&, std::type_info const&) 2025-05-07T20:10:51.9471114Z U torch::detail::class_base::withNewArguments(c10::FunctionSchema const&, std::initializer_list) 2025-05-07T20:10:51.9471478Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:51.9471859Z U torch::registerCustomClassMethod(std::unique_ptr >) 2025-05-07T20:10:51.9471997Z U torch::serialize::InputArchive::InputArchive() 2025-05-07T20:10:51.9472287Z U torch::serialize::InputArchive::load_from(char const*, unsigned long, std::optional) 2025-05-07T20:10:51.9472742Z U torch::serialize::InputArchive::read(std::__cxx11::basic_string, std::allocator > const&, at::Tensor&, bool) 2025-05-07T20:10:51.9473041Z U torch::serialize::OutputArchive::OutputArchive(std::shared_ptr) 2025-05-07T20:10:51.9473199Z U torch::serialize::OutputArchive::save_to(std::ostream&) 2025-05-07T20:10:51.9473661Z U torch::serialize::OutputArchive::write(std::__cxx11::basic_string, std::allocator > const&, at::Tensor const&, bool) 2025-05-07T20:10:51.9473932Z U typeinfo for c10::Error 2025-05-07T20:10:51.9474056Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:10:51.9474280Z U typeinfo for std::logic_error@GLIBCXX_3.4 2025-05-07T20:10:51.9474406Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:10:51.9474711Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:10:51.9474920Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:10:51.9475226Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:51.9475383Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:51.9475563Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:51.9475724Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:51.9475827Z U vtable for c10::Error 2025-05-07T20:10:51.9476204Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:51.9476436Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:51.9476567Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:10:51.9476683Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:51.9476806Z w _ITM_registerTMCloneTable 2025-05-07T20:10:51.9476913Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:51.9477008Z w __gmon_start__ 2025-05-07T20:10:51.9477124Z w __pthread_key_create 2025-05-07T20:10:51.9477232Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:51.9477348Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:51.9477513Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:51.9477732Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:10:51.9477740Z 2025-05-07T20:10:51.9477887Z linux-vdso.so.1 (0x00007ffdaf3bc000) 2025-05-07T20:10:51.9477993Z libc10.so => not found 2025-05-07T20:10:51.9478093Z libnvrtc.so.12 => not found 2025-05-07T20:10:51.9478186Z libc10_cuda.so => not found 2025-05-07T20:10:51.9478296Z libnccl.so.2 => not found 2025-05-07T20:10:51.9478393Z libcuda.so.1 => not found 2025-05-07T20:10:51.9478767Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so (0x00007f01eea00000) 2025-05-07T20:10:51.9479232Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007f01ed200000) 2025-05-07T20:10:51.9479689Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so (0x00007f020a2ed000) 2025-05-07T20:10:51.9479820Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:51.9479919Z libtorch.so => not found 2025-05-07T20:10:51.9480032Z libtorch_cpu.so => not found 2025-05-07T20:10:51.9480135Z libtorch_cuda.so => not found 2025-05-07T20:10:51.9480243Z libcudart.so.12 => not found 2025-05-07T20:10:51.9480438Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f01ecf9c000) 2025-05-07T20:10:51.9480601Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f020a2bd000) 2025-05-07T20:10:51.9480851Z libc.so.6 => /lib64/libc.so.6 (0x00007f01ecd94000) 2025-05-07T20:10:51.9480944Z libc10.so => not found 2025-05-07T20:10:51.9481063Z libnvrtc.so.12 => not found 2025-05-07T20:10:51.9481177Z libc10_cuda.so => not found 2025-05-07T20:10:51.9481270Z libnccl.so.2 => not found 2025-05-07T20:10:51.9481380Z libcuda.so.1 => not found 2025-05-07T20:10:51.9481719Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so (0x00007f020a244000) 2025-05-07T20:10:51.9481826Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:51.9481923Z libtorch.so => not found 2025-05-07T20:10:51.9482040Z libtorch_cpu.so => not found 2025-05-07T20:10:51.9482165Z libtorch_cuda.so => not found 2025-05-07T20:10:51.9482287Z libm.so.6 => /lib64/libm.so.6 (0x00007f020a167000) 2025-05-07T20:10:51.9482427Z /lib64/ld-linux-x86-64.so.2 (0x00007f020a2fe000) 2025-05-07T20:10:51.9482519Z libtorch.so => not found 2025-05-07T20:10:51.9482607Z libc10.so => not found 2025-05-07T20:10:51.9482703Z libnvrtc.so.12 => not found 2025-05-07T20:10:51.9482813Z libc10_cuda.so => not found 2025-05-07T20:10:51.9482904Z libnccl.so.2 => not found 2025-05-07T20:10:51.9482998Z libcuda.so.1 => not found 2025-05-07T20:10:51.9483114Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:51.9483209Z libtorch_cpu.so => not found 2025-05-07T20:10:51.9483303Z libtorch_cuda.so => not found 2025-05-07T20:10:51.9483396Z libcudart.so.12 => not found 2025-05-07T20:10:51.9483558Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f01eefaa000) 2025-05-07T20:10:51.9483650Z libtorch.so => not found 2025-05-07T20:10:51.9483737Z libc10.so => not found 2025-05-07T20:10:51.9483895Z libnvrtc.so.12 => not found 2025-05-07T20:10:51.9483983Z libc10_cuda.so => not found 2025-05-07T20:10:51.9484075Z libnccl.so.2 => not found 2025-05-07T20:10:51.9484167Z libcuda.so.1 => not found 2025-05-07T20:10:51.9484279Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:51.9484373Z libtorch_cpu.so => not found 2025-05-07T20:10:51.9484470Z libtorch_cuda.so => not found 2025-05-07T20:10:51.9484656Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f020a15a000) 2025-05-07T20:10:51.9484747Z libtorch.so => not found 2025-05-07T20:10:51.9484835Z libc10.so => not found 2025-05-07T20:10:51.9484930Z libnvrtc.so.12 => not found 2025-05-07T20:10:51.9485039Z libc10_cuda.so => not found 2025-05-07T20:10:51.9485136Z libnccl.so.2 => not found 2025-05-07T20:10:51.9485228Z libcuda.so.1 => not found 2025-05-07T20:10:51.9485345Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:51.9485439Z libtorch_cpu.so => not found 2025-05-07T20:10:51.9485534Z libtorch_cuda.so => not found 2025-05-07T20:10:51.9485689Z librt.so.1 => /lib64/librt.so.1 (0x00007f020a151000) 2025-05-07T20:10:51.9485694Z 2025-05-07T20:10:51.9485799Z [CHECK] Displaying ELF information: 2025-05-07T20:10:51.9486027Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:10:51.9486033Z 2025-05-07T20:10:51.9501612Z 2025-05-07T20:10:51.9502381Z Dynamic section at offset 0x1ac7bfc8 contains 41 entries: 2025-05-07T20:10:51.9502823Z Tag Type Name/Value 2025-05-07T20:10:51.9503430Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:51.9504064Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:51.9504644Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:51.9505224Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:51.9506044Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:51.9506613Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:10:51.9507257Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:10:51.9507886Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:10:51.9508527Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:51.9509073Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:51.9509322Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:51.9509553Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:51.9509760Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:51.9509966Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:51.9510184Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:51.9510416Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:51.9510654Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_inference.so] 2025-05-07T20:10:51.9510860Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:51.9510985Z 0x000000000000000c (INIT) 0x1a0000 2025-05-07T20:10:51.9511106Z 0x000000000000000d (FINI) 0x74838c 2025-05-07T20:10:51.9511234Z 0x0000000000000019 (INIT_ARRAY) 0x1ac7aca0 2025-05-07T20:10:51.9511385Z 0x000000000000001b (INIT_ARRAYSZ) 392 (bytes) 2025-05-07T20:10:51.9511514Z 0x000000000000001a (FINI_ARRAY) 0x1ac7ae28 2025-05-07T20:10:51.9511642Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:51.9511779Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:51.9511900Z 0x0000000000000005 (STRTAB) 0x27a50 2025-05-07T20:10:51.9512017Z 0x0000000000000006 (SYMTAB) 0x9db0 2025-05-07T20:10:51.9512222Z 0x000000000000000a (STRSZ) 1387089 (bytes) 2025-05-07T20:10:51.9512348Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:51.9512477Z 0x0000000000000003 (PLTGOT) 0x1ac84fe8 2025-05-07T20:10:51.9512618Z 0x0000000000000002 (PLTRELSZ) 20568 (bytes) 2025-05-07T20:10:51.9512756Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:51.9512880Z 0x0000000000000017 (JMPREL) 0x19af18 2025-05-07T20:10:51.9513002Z 0x0000000000000007 (RELA) 0x17cd80 2025-05-07T20:10:51.9513164Z 0x0000000000000008 (RELASZ) 123288 (bytes) 2025-05-07T20:10:51.9513293Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:51.9513418Z 0x000000006ffffffe (VERNEED) 0x17cc60 2025-05-07T20:10:51.9513555Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:10:51.9513681Z 0x000000006ffffff0 (VERSYM) 0x17a4a2 2025-05-07T20:10:51.9513798Z 0x000000006ffffff9 (RELACOUNT) 539 2025-05-07T20:10:51.9513908Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:51.9513913Z 2025-05-07T20:10:51.9514050Z ################################################################################ 2025-05-07T20:10:51.9514055Z 2025-05-07T20:10:51.9514059Z 2025-05-07T20:10:51.9514351Z ################################################################################ 2025-05-07T20:10:51.9514718Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:10:51.9514851Z [CHECK] Listing out library size: 2025-05-07T20:10:51.9515207Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:10:51.9515212Z 2025-05-07T20:10:51.9515509Z 5 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:10:51.9515582Z 2025-05-07T20:10:51.9516061Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:10:51.9516649Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:51.9516655Z 2025-05-07T20:10:51.9781297Z GLIBC_2.2.5 2025-05-07T20:10:51.9781526Z GLIBC_2.3 2025-05-07T20:10:51.9781657Z GLIBC_2.14 2025-05-07T20:10:51.9781666Z 2025-05-07T20:10:51.9781670Z 2025-05-07T20:10:51.9782496Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:10:51.9783234Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:51.9783245Z 2025-05-07T20:10:52.0045045Z GLIBCXX_3.4 2025-05-07T20:10:52.0045320Z GLIBCXX_3.4.9 2025-05-07T20:10:52.0045601Z GLIBCXX_3.4.11 2025-05-07T20:10:52.0045849Z GLIBCXX_3.4.15 2025-05-07T20:10:52.0046414Z GLIBCXX_3.4.18 2025-05-07T20:10:52.0046664Z GLIBCXX_3.4.20 2025-05-07T20:10:52.0046904Z GLIBCXX_3.4.21 2025-05-07T20:10:52.0046944Z 2025-05-07T20:10:52.0046957Z 2025-05-07T20:10:52.0070228Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so > /tmp/tmp.s2aXA03kkW.symbols.txt 2025-05-07T20:10:52.0070274Z 2025-05-07T20:10:52.0288862Z 2025-05-07T20:10:52.0316796Z [CHECK] Total Number of symbols: 2987 2025-05-07T20:10:52.0335355Z [CHECK] Number of fbgemm symbols: 1 2025-05-07T20:10:52.0355834Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so > /tmp/tmp.ckVX2hgatv.usymbols.txt 2025-05-07T20:10:52.0355868Z 2025-05-07T20:10:52.0379270Z 2025-05-07T20:10:52.0403873Z [CHECK] Listing out undefined symbols (189 total): 2025-05-07T20:10:52.0421078Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:52.0421822Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:52.0421949Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:52.0422105Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:52.0422402Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:52.0422542Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:52.0422689Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:52.0422806Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:52.0422921Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:52.0423052Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:52.0423158Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:52.0423273Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:52.0423380Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:52.0423515Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:52.0423621Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:52.0423742Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:10:52.0423946Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:10:52.0424081Z U at::RecordFunction::currentThreadId() 2025-05-07T20:10:52.0424199Z U at::RecordFunction::end() 2025-05-07T20:10:52.0424348Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:10:52.0424506Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:10:52.0425259Z U at::_ops::_sparse_coo_tensor_unsafe::call(at::Tensor const&, at::Tensor const&, c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:52.0425680Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:10:52.0426381Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:52.0427081Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:52.0427258Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:10:52.0427443Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:10:52.0427651Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:52.0427800Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:10:52.0429531Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:52.0429694Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:10:52.0429865Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:10:52.0430018Z U at::sequence_number::get_and_increment() 2025-05-07T20:10:52.0430121Z U bcmp@GLIBC_2.2.5 2025-05-07T20:10:52.0430233Z U c10::AnyType::get() 2025-05-07T20:10:52.0430354Z U c10::BoolType::get() 2025-05-07T20:10:52.0430539Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:10:52.0430657Z U c10::Dispatcher::realSingleton() 2025-05-07T20:10:52.0431197Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:10:52.0431965Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:10:52.0432343Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:52.0432463Z U c10::Error::what() const 2025-05-07T20:10:52.0432560Z U c10::FloatType::get() 2025-05-07T20:10:52.0432673Z U c10::GradMode::is_enabled() 2025-05-07T20:10:52.0432800Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:10:52.0432961Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:10:52.0433077Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:10:52.0433203Z U c10::IValue::isBoolList() const 2025-05-07T20:10:52.0433319Z U c10::IValue::isIntList() const 2025-05-07T20:10:52.0433441Z U c10::IValue::isSymIntList() const 2025-05-07T20:10:52.0433569Z U c10::IValue::isTensorList() const 2025-05-07T20:10:52.0433717Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:52.0433819Z U c10::IntType::get() 2025-05-07T20:10:52.0434439Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:52.0434615Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:52.0434740Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:52.0434871Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:52.0435099Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:52.0435382Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:52.0435668Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:10:52.0435791Z U c10::StringType::get() 2025-05-07T20:10:52.0435935Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:10:52.0436078Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:52.0436269Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:10:52.0436448Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:52.0436602Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:10:52.0437020Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:52.0437159Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:52.0437288Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:10:52.0437469Z U c10::SymInt::operator*(c10::SymInt const&) const 2025-05-07T20:10:52.0437613Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:10:52.0437728Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:52.0437876Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:10:52.0438009Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:10:52.0438137Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:10:52.0438264Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:52.0438375Z U c10::SymIntType::get() 2025-05-07T20:10:52.0438500Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:10:52.0438619Z U c10::TensorType::get() 2025-05-07T20:10:52.0438748Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:52.0439186Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:52.0439741Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:52.0439994Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:52.0440493Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:52.0440942Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:52.0441484Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:52.0441793Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:10:52.0441977Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:10:52.0442093Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:52.0442244Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:10:52.0442599Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:52.0442716Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:10:52.0442865Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:10:52.0443014Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:10:52.0443175Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:10:52.0443358Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:52.0443486Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:52.0443726Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:10:52.0443821Z U free@GLIBC_2.2.5 2025-05-07T20:10:52.0443996Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:52.0444089Z U memcmp@GLIBC_2.2.5 2025-05-07T20:10:52.0444204Z U memcpy@GLIBC_2.14 2025-05-07T20:10:52.0444306Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:52.0444397Z U memset@GLIBC_2.2.5 2025-05-07T20:10:52.0444509Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:52.0444632Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:52.0444733Z U realloc@GLIBC_2.2.5 2025-05-07T20:10:52.0444960Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:10:52.0445277Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:52.0445649Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:52.0445954Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:52.0446320Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:52.0446666Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:52.0446781Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:52.0446936Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:52.0447073Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:52.0447202Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:52.0447375Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:52.0447499Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:52.0447634Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:10:52.0447868Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:52.0448402Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:52.0448530Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:52.0448659Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:52.0448771Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:52.0448886Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:52.0449000Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:52.0449171Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:52.0449395Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:52.0449527Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:52.0449681Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:52.0449808Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:10:52.0450252Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:10:52.0450387Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:10:52.0450490Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:52.0450588Z U strcmp@GLIBC_2.2.5 2025-05-07T20:10:52.0450691Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:52.0450810Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:52.0451381Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:52.0451823Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:52.0452063Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:52.0452219Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:10:52.0452495Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:10:52.0452667Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:10:52.0452866Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:10:52.0453039Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:10:52.0453364Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:10:52.0453515Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:10:52.0453689Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:10:52.0453856Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:10:52.0454010Z U torch::autograd::Node::assign_parent() 2025-05-07T20:10:52.0454117Z U torch::autograd::Node::metadata() 2025-05-07T20:10:52.0454245Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:10:52.0454476Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:10:52.0454734Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:10:52.0454866Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:10:52.0455065Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:10:52.0455277Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:10:52.0457757Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:10:52.0457905Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:10:52.0458046Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:10:52.0458229Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:10:52.0458965Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:10:52.0459111Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:10:52.0459498Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:10:52.0459854Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:52.0459966Z U typeinfo for c10::Error 2025-05-07T20:10:52.0460096Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:52.0460218Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:10:52.0460337Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:10:52.0460512Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:10:52.0460624Z U typeinfo for torch::autograd::Node 2025-05-07T20:10:52.0460764Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:52.0460933Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:52.0461080Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:10:52.0461225Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:52.0461335Z U vtable for c10::Error 2025-05-07T20:10:52.0461642Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:52.0461766Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:52.0461996Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:52.0462131Z U vtable for torch::autograd::Node 2025-05-07T20:10:52.0462299Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:52.0462412Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:52.0462514Z w _ITM_registerTMCloneTable 2025-05-07T20:10:52.0462611Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:52.0462712Z w __gmon_start__ 2025-05-07T20:10:52.0462806Z w __pthread_key_create 2025-05-07T20:10:52.0462910Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:52.0463021Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:52.0463162Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:52.0463431Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:10:52.0463441Z 2025-05-07T20:10:52.0475242Z linux-vdso.so.1 (0x00007ffef5bd0000) 2025-05-07T20:10:52.0475452Z libc10.so => not found 2025-05-07T20:10:52.0476232Z libnvrtc.so.12 => not found 2025-05-07T20:10:52.0476335Z libc10_cuda.so => not found 2025-05-07T20:10:52.0476428Z libnccl.so.2 => not found 2025-05-07T20:10:52.0476538Z libcuda.so.1 => not found 2025-05-07T20:10:52.0477019Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so (0x00007f0441453000) 2025-05-07T20:10:52.0477497Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f043fc00000) 2025-05-07T20:10:52.0477608Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:52.0477698Z libtorch.so => not found 2025-05-07T20:10:52.0477791Z libtorch_cpu.so => not found 2025-05-07T20:10:52.0477897Z libtorch_cuda.so => not found 2025-05-07T20:10:52.0478196Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f043f99c000) 2025-05-07T20:10:52.0478341Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f0441423000) 2025-05-07T20:10:52.0478473Z libc.so.6 => /lib64/libc.so.6 (0x00007f043f794000) 2025-05-07T20:10:52.0478610Z /lib64/ld-linux-x86-64.so.2 (0x00007f0441464000) 2025-05-07T20:10:52.0478701Z libtorch.so => not found 2025-05-07T20:10:52.0478784Z libc10.so => not found 2025-05-07T20:10:52.0478888Z libnvrtc.so.12 => not found 2025-05-07T20:10:52.0478980Z libc10_cuda.so => not found 2025-05-07T20:10:52.0479069Z libnccl.so.2 => not found 2025-05-07T20:10:52.0479168Z libcuda.so.1 => not found 2025-05-07T20:10:52.0479310Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:52.0479405Z libtorch_cpu.so => not found 2025-05-07T20:10:52.0479496Z libtorch_cuda.so => not found 2025-05-07T20:10:52.0479658Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f04413c9000) 2025-05-07T20:10:52.0479832Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f04413c4000) 2025-05-07T20:10:52.0479925Z libtorch.so => not found 2025-05-07T20:10:52.0480020Z libc10.so => not found 2025-05-07T20:10:52.0480116Z libnvrtc.so.12 => not found 2025-05-07T20:10:52.0480244Z libc10_cuda.so => not found 2025-05-07T20:10:52.0480340Z libnccl.so.2 => not found 2025-05-07T20:10:52.0480437Z libcuda.so.1 => not found 2025-05-07T20:10:52.0480536Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:52.0480634Z libtorch_cpu.so => not found 2025-05-07T20:10:52.0480736Z libtorch_cuda.so => not found 2025-05-07T20:10:52.0480841Z libcudart.so.12 => not found 2025-05-07T20:10:52.0480964Z libm.so.6 => /lib64/libm.so.6 (0x00007f04412e5000) 2025-05-07T20:10:52.0480971Z 2025-05-07T20:10:52.0481080Z [CHECK] Displaying ELF information: 2025-05-07T20:10:52.0481411Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:10:52.0481416Z 2025-05-07T20:10:52.0511700Z 2025-05-07T20:10:52.0512275Z Dynamic section at offset 0x4b5fc8 contains 40 entries: 2025-05-07T20:10:52.0512447Z Tag Type Name/Value 2025-05-07T20:10:52.0512660Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:52.0513030Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:52.0513276Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:52.0513474Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:52.0513667Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:52.0513893Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:10:52.0514278Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:10:52.0514494Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:52.0514707Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:52.0514908Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:52.0515189Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:52.0515390Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:52.0515594Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:52.0515781Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:52.0515990Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:52.0516312Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_split_host.so] 2025-05-07T20:10:52.0516493Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:52.0516605Z 0x000000000000000c (INIT) 0xd6000 2025-05-07T20:10:52.0516736Z 0x000000000000000d (FINI) 0x3f64b8 2025-05-07T20:10:52.0516913Z 0x0000000000000019 (INIT_ARRAY) 0x4add80 2025-05-07T20:10:52.0517038Z 0x000000000000001b (INIT_ARRAYSZ) 304 (bytes) 2025-05-07T20:10:52.0517155Z 0x000000000000001a (FINI_ARRAY) 0x4adeb0 2025-05-07T20:10:52.0517290Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:52.0517397Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:52.0517508Z 0x0000000000000005 (STRTAB) 0x16e00 2025-05-07T20:10:52.0517627Z 0x0000000000000006 (SYMTAB) 0x55e0 2025-05-07T20:10:52.0517759Z 0x000000000000000a (STRSZ) 609767 (bytes) 2025-05-07T20:10:52.0517878Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:52.0518047Z 0x0000000000000003 (PLTGOT) 0x4b8fe8 2025-05-07T20:10:52.0518300Z 0x0000000000000002 (PLTRELSZ) 31704 (bytes) 2025-05-07T20:10:52.0518404Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:52.0518514Z 0x0000000000000017 (JMPREL) 0xcdaf0 2025-05-07T20:10:52.0518631Z 0x0000000000000007 (RELA) 0xad450 2025-05-07T20:10:52.0518758Z 0x0000000000000008 (RELASZ) 132768 (bytes) 2025-05-07T20:10:52.0518916Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:52.0519044Z 0x000000006ffffffe (VERNEED) 0xad340 2025-05-07T20:10:52.0519146Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:10:52.0519256Z 0x000000006ffffff0 (VERSYM) 0xabbe8 2025-05-07T20:10:52.0519355Z 0x000000006ffffff9 (RELACOUNT) 40 2025-05-07T20:10:52.0519468Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:52.0519474Z 2025-05-07T20:10:52.0519582Z ################################################################################ 2025-05-07T20:10:52.0519587Z 2025-05-07T20:10:52.0519591Z 2025-05-07T20:10:52.0519704Z ################################################################################ 2025-05-07T20:10:52.0520000Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:10:52.0520101Z [CHECK] Listing out library size: 2025-05-07T20:10:52.0520397Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:10:52.0520445Z 2025-05-07T20:10:52.0524273Z 339 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:10:52.0525266Z 2025-05-07T20:10:52.0526061Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:10:52.0526605Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:52.0526611Z 2025-05-07T20:10:52.1499301Z GLIBC_2.2.5 2025-05-07T20:10:52.1499713Z GLIBC_2.3 2025-05-07T20:10:52.1499913Z GLIBC_2.14 2025-05-07T20:10:52.1500024Z 2025-05-07T20:10:52.1500224Z 2025-05-07T20:10:52.1500958Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:10:52.1502145Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:52.1502804Z 2025-05-07T20:10:52.2485045Z GLIBCXX_3.4 2025-05-07T20:10:52.2485683Z GLIBCXX_3.4.9 2025-05-07T20:10:52.2486292Z GLIBCXX_3.4.20 2025-05-07T20:10:52.2486878Z GLIBCXX_3.4.21 2025-05-07T20:10:52.2487251Z 2025-05-07T20:10:52.2487265Z 2025-05-07T20:10:52.2509784Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so > /tmp/tmp.3uUlN9gEoy.symbols.txt 2025-05-07T20:10:52.2511337Z 2025-05-07T20:10:52.3436923Z 2025-05-07T20:10:52.3478162Z [CHECK] Total Number of symbols: 12626 2025-05-07T20:10:52.3528194Z [CHECK] Number of fbgemm symbols: 5267 2025-05-07T20:10:52.3546256Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so > /tmp/tmp.kDUTU6DH8c.usymbols.txt 2025-05-07T20:10:52.3548179Z 2025-05-07T20:10:52.3593155Z 2025-05-07T20:10:52.3620217Z [CHECK] Listing out undefined symbols (171 total): 2025-05-07T20:10:52.3643599Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:52.3644264Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:52.3644632Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:52.3645020Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:52.3645415Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:52.3645969Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:52.3646346Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:52.3646711Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:52.3647063Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:52.3647432Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:52.3647749Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:52.3648125Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:52.3648434Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:52.3648754Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:52.3649082Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:52.3649386Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:52.3649711Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:52.3650012Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:52.3650313Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:52.3650614Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:52.3650929Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:52.3651311Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:10:52.3651720Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:10:52.3652257Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:10:52.3652998Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:10:52.3653631Z U at::Tensor::index(std::initializer_list) const 2025-05-07T20:10:52.3654235Z U at::Tensor::index_put_(std::initializer_list, at::Tensor const&) 2025-05-07T20:10:52.3655269Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:52.3656339Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:10:52.3656926Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:52.3657358Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:10:52.3657787Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:10:52.3658188Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:10:52.3658644Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:52.3659035Z U c10::BoolType::get() 2025-05-07T20:10:52.3659357Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:52.3659778Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:10:52.3660148Z U c10::Dispatcher::realSingleton() 2025-05-07T20:10:52.3660850Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:10:52.3662088Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:10:52.3663115Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:52.3663671Z U c10::Error::what() const 2025-05-07T20:10:52.3663957Z U c10::FloatType::get() 2025-05-07T20:10:52.3664288Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:10:52.3664732Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:52.3665126Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:52.3665470Z U c10::IntType::get() 2025-05-07T20:10:52.3665810Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:52.3666195Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:52.3666543Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:52.3666887Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:52.3667246Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:10:52.3667601Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:52.3667984Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:52.3668593Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:52.3669199Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:52.3669562Z U c10::SymInt::operator+=(c10::SymInt const&) 2025-05-07T20:10:52.3669903Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:10:52.3670261Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:52.3670619Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:10:52.3670980Z U c10::SymInt::sym_ge(c10::SymInt const&) const 2025-05-07T20:10:52.3671338Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:10:52.3671670Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:10:52.3672019Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:10:52.3672333Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:52.3672653Z U c10::SymIntType::get() 2025-05-07T20:10:52.3672933Z U c10::TensorType::get() 2025-05-07T20:10:52.3673242Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:52.3674278Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:52.3675450Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:52.3675823Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:52.3676164Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:52.3676519Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:52.3676876Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:52.3677215Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:52.3677692Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:52.3678157Z U c10::cuda::device_count() 2025-05-07T20:10:52.3678508Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:52.3678912Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:52.3679396Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:52.3679810Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:52.3680223Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:52.3680624Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:52.3681356Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:52.3682276Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:52.3683179Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:52.3684134Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:52.3685200Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:52.3686022Z U c10::get_default_dtype() 2025-05-07T20:10:52.3686369Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:52.3686708Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:52.3687359Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:10:52.3687933Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:10:52.3688342Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:52.3688684Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:52.3689047Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:10:52.3689420Z U c10::operator%(c10::SymInt const&, int) 2025-05-07T20:10:52.3689769Z U c10::operator*(c10::SymInt const&, long) 2025-05-07T20:10:52.3690114Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:10:52.3690438Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:10:52.3690797Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:52.3691172Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:52.3691546Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:10:52.3691936Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:10:52.3692259Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:52.3692639Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:52.3693045Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:52.3693381Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:52.3693722Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:52.3694042Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:52.3694363Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:52.3694671Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:52.3695000Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:52.3695332Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:52.3695645Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:52.3695966Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:52.3696279Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:52.3696796Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:52.3697142Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:52.3697680Z U fbgemm_gpu::reshape_vbe_output(at::Tensor const&, long, at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:52.3698194Z U float at::Tensor::item() const 2025-05-07T20:10:52.3698550Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:10:52.3698949Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:52.3699484Z U free@GLIBC_2.2.5 2025-05-07T20:10:52.3699805Z U int at::Tensor::item() const 2025-05-07T20:10:52.3700172Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:52.3700641Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:52.3701076Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:52.3701490Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:52.3701886Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:52.3702237Z U memcpy@GLIBC_2.14 2025-05-07T20:10:52.3702531Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:52.3702838Z U memset@GLIBC_2.2.5 2025-05-07T20:10:52.3703140Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:52.3703479Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:52.3704047Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:52.3704886Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:52.3705488Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:52.3705847Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:52.3706249Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:52.3706664Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:52.3707225Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:52.3708144Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:52.3708961Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:52.3709330Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:52.3709680Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:52.3710034Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:52.3710363Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:52.3710776Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:52.3711308Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:52.3711771Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:52.3712121Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:52.3712430Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:52.3712762Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:52.3713590Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:52.3714852Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:52.3715687Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:52.3716455Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:52.3717026Z U typeinfo for c10::Error 2025-05-07T20:10:52.3717392Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:52.3717814Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:52.3718244Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:52.3718619Z U vtable for c10::Error 2025-05-07T20:10:52.3719165Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:52.3719829Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:52.3720332Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:52.3720733Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:52.3721047Z w _ITM_registerTMCloneTable 2025-05-07T20:10:52.3721365Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:52.3721700Z w __gmon_start__ 2025-05-07T20:10:52.3721969Z w __pthread_key_create 2025-05-07T20:10:52.3722312Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:52.3722789Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:10:52.3723148Z 2025-05-07T20:10:52.3723285Z linux-vdso.so.1 (0x00007ffde8bab000) 2025-05-07T20:10:52.3723590Z libc10.so => not found 2025-05-07T20:10:52.3723834Z libnvrtc.so.12 => not found 2025-05-07T20:10:52.3724102Z libc10_cuda.so => not found 2025-05-07T20:10:52.3724352Z libnccl.so.2 => not found 2025-05-07T20:10:52.3724613Z libcuda.so.1 => not found 2025-05-07T20:10:52.3725244Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f653f600000) 2025-05-07T20:10:52.3725934Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:52.3726225Z libtorch.so => not found 2025-05-07T20:10:52.3726489Z libtorch_cpu.so => not found 2025-05-07T20:10:52.3726878Z libtorch_cuda.so => not found 2025-05-07T20:10:52.3727119Z libcudart.so.12 => not found 2025-05-07T20:10:52.3727436Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f653f39c000) 2025-05-07T20:10:52.3727818Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f653f9d2000) 2025-05-07T20:10:52.3728174Z libc.so.6 => /lib64/libc.so.6 (0x00007f653f194000) 2025-05-07T20:10:52.3728861Z /lib64/ld-linux-x86-64.so.2 (0x00007f655551c000) 2025-05-07T20:10:52.3729194Z libc10.so => not found 2025-05-07T20:10:52.3729430Z libnvrtc.so.12 => not found 2025-05-07T20:10:52.3729703Z libc10_cuda.so => not found 2025-05-07T20:10:52.3729976Z libnccl.so.2 => not found 2025-05-07T20:10:52.3730229Z libcuda.so.1 => not found 2025-05-07T20:10:52.3730761Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so (0x00007f653ec00000) 2025-05-07T20:10:52.3731667Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so (0x00007f6555507000) 2025-05-07T20:10:52.3732321Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:52.3732591Z libtorch.so => not found 2025-05-07T20:10:52.3732848Z libtorch_cpu.so => not found 2025-05-07T20:10:52.3733110Z libtorch_cuda.so => not found 2025-05-07T20:10:52.3733377Z libcudart.so.12 => not found 2025-05-07T20:10:52.3733698Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f653f97c000) 2025-05-07T20:10:52.3734077Z libm.so.6 => /lib64/libm.so.6 (0x00007f653eb25000) 2025-05-07T20:10:52.3734408Z libc10.so => not found 2025-05-07T20:10:52.3734651Z libnvrtc.so.12 => not found 2025-05-07T20:10:52.3734917Z libc10_cuda.so => not found 2025-05-07T20:10:52.3735167Z libnccl.so.2 => not found 2025-05-07T20:10:52.3735426Z libcuda.so.1 => not found 2025-05-07T20:10:52.3736009Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so (0x00007f653eaae000) 2025-05-07T20:10:52.3736593Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:52.3736876Z libtorch.so => not found 2025-05-07T20:10:52.3737116Z libtorch_cpu.so => not found 2025-05-07T20:10:52.3737393Z libtorch_cuda.so => not found 2025-05-07T20:10:52.3737646Z libtorch.so => not found 2025-05-07T20:10:52.3737897Z libc10.so => not found 2025-05-07T20:10:52.3738127Z libnvrtc.so.12 => not found 2025-05-07T20:10:52.3738397Z libc10_cuda.so => not found 2025-05-07T20:10:52.3738645Z libnccl.so.2 => not found 2025-05-07T20:10:52.3738896Z libcuda.so.1 => not found 2025-05-07T20:10:52.3739190Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:52.3739465Z libtorch_cpu.so => not found 2025-05-07T20:10:52.3739741Z libtorch_cuda.so => not found 2025-05-07T20:10:52.3740077Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f653f975000) 2025-05-07T20:10:52.3740465Z libtorch.so => not found 2025-05-07T20:10:52.3740714Z libc10.so => not found 2025-05-07T20:10:52.3741084Z libnvrtc.so.12 => not found 2025-05-07T20:10:52.3741322Z libc10_cuda.so => not found 2025-05-07T20:10:52.3741601Z libnccl.so.2 => not found 2025-05-07T20:10:52.3741832Z libcuda.so.1 => not found 2025-05-07T20:10:52.3742089Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:52.3742332Z libtorch_cpu.so => not found 2025-05-07T20:10:52.3742580Z libtorch_cuda.so => not found 2025-05-07T20:10:52.3742868Z librt.so.1 => /lib64/librt.so.1 (0x00007f653f96c000) 2025-05-07T20:10:52.3743088Z 2025-05-07T20:10:52.3743188Z [CHECK] Displaying ELF information: 2025-05-07T20:10:52.3743624Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:10:52.3743970Z 2025-05-07T20:10:52.3756397Z 2025-05-07T20:10:52.3756934Z Dynamic section at offset 0x15292018 contains 40 entries: 2025-05-07T20:10:52.3758079Z Tag Type Name/Value 2025-05-07T20:10:52.3759342Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:52.3760805Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:52.3762485Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:52.3763553Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:52.3764072Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:52.3764625Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:10:52.3765174Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:52.3765704Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:52.3766211Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:52.3766746Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:52.3767284Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:52.3767797Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:52.3768330Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:52.3768827Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:52.3769355Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:52.3769940Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_forward.so] 2025-05-07T20:10:52.3770623Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:52.3771345Z 0x000000000000000c (INIT) 0x453000 2025-05-07T20:10:52.3771659Z 0x000000000000000d (FINI) 0x1fe941c 2025-05-07T20:10:52.3772194Z 0x0000000000000019 (INIT_ARRAY) 0x152889a8 2025-05-07T20:10:52.3772586Z 0x000000000000001b (INIT_ARRAYSZ) 752 (bytes) 2025-05-07T20:10:52.3772987Z 0x000000000000001a (FINI_ARRAY) 0x15288c98 2025-05-07T20:10:52.3773509Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:52.3773874Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:52.3774200Z 0x0000000000000005 (STRTAB) 0x624b8 2025-05-07T20:10:52.3774545Z 0x0000000000000006 (SYMTAB) 0x184f0 2025-05-07T20:10:52.3774916Z 0x000000000000000a (STRSZ) 3694099 (bytes) 2025-05-07T20:10:52.3775272Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:52.3775634Z 0x0000000000000003 (PLTGOT) 0x152a8fe8 2025-05-07T20:10:52.3775993Z 0x0000000000000002 (PLTRELSZ) 14520 (bytes) 2025-05-07T20:10:52.3777687Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:52.3778027Z 0x0000000000000017 (JMPREL) 0x44ece0 2025-05-07T20:10:52.3778370Z 0x0000000000000007 (RELA) 0x3ee668 2025-05-07T20:10:52.3778718Z 0x0000000000000008 (RELASZ) 394872 (bytes) 2025-05-07T20:10:52.3779093Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:52.3779460Z 0x000000006ffffffe (VERNEED) 0x3ee578 2025-05-07T20:10:52.3779831Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:52.3780177Z 0x000000006ffffff0 (VERSYM) 0x3e82cc 2025-05-07T20:10:52.3780512Z 0x000000006ffffff9 (RELACOUNT) 1976 2025-05-07T20:10:52.3780834Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:52.3781034Z 2025-05-07T20:10:52.3781152Z ################################################################################ 2025-05-07T20:10:52.3781396Z 2025-05-07T20:10:52.3781401Z 2025-05-07T20:10:52.3781521Z ################################################################################ 2025-05-07T20:10:52.3782066Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:10:52.3782590Z [CHECK] Listing out library size: 2025-05-07T20:10:52.3783099Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:10:52.3783509Z 2025-05-07T20:10:52.3783800Z 1 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:10:52.3784138Z 2025-05-07T20:10:52.3784567Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:10:52.3785760Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:52.3786374Z 2025-05-07T20:10:52.3830469Z GLIBC_2.2.5 2025-05-07T20:10:52.3830736Z GLIBC_2.3 2025-05-07T20:10:52.3830958Z GLIBC_2.14 2025-05-07T20:10:52.3831468Z 2025-05-07T20:10:52.3831647Z 2025-05-07T20:10:52.3832183Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:10:52.3833309Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:52.3833972Z 2025-05-07T20:10:52.3895963Z GLIBCXX_3.4 2025-05-07T20:10:52.3896582Z GLIBCXX_3.4.9 2025-05-07T20:10:52.3897219Z GLIBCXX_3.4.18 2025-05-07T20:10:52.3897799Z GLIBCXX_3.4.20 2025-05-07T20:10:52.3898377Z GLIBCXX_3.4.21 2025-05-07T20:10:52.3898715Z 2025-05-07T20:10:52.3898728Z 2025-05-07T20:10:52.3917539Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so > /tmp/tmp.Y9oC8N2f1v.symbols.txt 2025-05-07T20:10:52.3919091Z 2025-05-07T20:10:52.3945407Z 2025-05-07T20:10:52.3972333Z [CHECK] Total Number of symbols: 357 2025-05-07T20:10:52.3986000Z [CHECK] Number of fbgemm symbols: 57 2025-05-07T20:10:52.3999926Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so > /tmp/tmp.H7O2u6D6Fe.usymbols.txt 2025-05-07T20:10:52.4000479Z 2025-05-07T20:10:52.4020182Z 2025-05-07T20:10:52.4043569Z [CHECK] Listing out undefined symbols (118 total): 2025-05-07T20:10:52.4060970Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:52.4061806Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:52.4062358Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:52.4062727Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:52.4063161Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:52.4063741Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:52.4064143Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:52.4064514Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:52.4064882Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:52.4065355Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:52.4065707Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:52.4066009Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:52.4066398Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:52.4066721Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:52.4067026Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:52.4067352Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:52.4067657Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:52.4068092Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:10:52.4068390Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:52.4068702Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:52.4069457Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:52.4070697Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:52.4071681Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:52.4072079Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:52.4072402Z U c10::IntType::get() 2025-05-07T20:10:52.4072757Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:52.4073131Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:52.4073559Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:52.4074424Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:52.4075343Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:52.4075745Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:52.4076102Z U c10::TensorType::get() 2025-05-07T20:10:52.4076473Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:52.4077447Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:52.4078422Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:52.4078822Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:52.4079213Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:52.4079570Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:52.4079950Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:52.4080438Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:52.4081056Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:52.4081510Z U c10::cuda::device_count() 2025-05-07T20:10:52.4081871Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:52.4082259Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:52.4082619Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:52.4083025Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:52.4084580Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:52.4084994Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:52.4085715Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:52.4086569Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:52.4087442Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:52.4088346Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:52.4089321Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:52.4090103Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:52.4090422Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:52.4090774Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:52.4110233Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:52.4110652Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:52.4111105Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:52.4111469Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:52.4112046Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:52.4112393Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:52.4112733Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:52.4113074Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:52.4113410Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:52.4113749Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:52.4114200Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:52.4114760Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:10:52.4115154Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:52.4115568Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:52.4115915Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:52.4116249Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:52.4116602Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:52.4116949Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:52.4117323Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:52.4117758Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:52.4118184Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:52.4118543Z U memcpy@GLIBC_2.14 2025-05-07T20:10:52.4118821Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:52.4119110Z U memset@GLIBC_2.2.5 2025-05-07T20:10:52.4119470Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:52.4119820Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:52.4120403Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:52.4121309Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:52.4122080Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:52.4122854Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:52.4123409Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:52.4123728Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:52.4124059Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:52.4124420Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:52.4124841Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:52.4125329Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:52.4126178Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:52.4126912Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:52.4127237Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:52.4127564Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:52.4127870Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:52.4128252Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:52.4129160Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:52.4129717Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:52.4130054Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:52.4130369Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:52.4130694Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:52.4131508Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:52.4132671Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:52.4133514Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:52.4134237Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:52.4134934Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:52.4135406Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:52.4135829Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:52.4136259Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:52.4136854Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:52.4137531Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:52.4137966Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:52.4138345Z w _ITM_registerTMCloneTable 2025-05-07T20:10:52.4138658Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:52.4138950Z w __gmon_start__ 2025-05-07T20:10:52.4139228Z w __pthread_key_create 2025-05-07T20:10:52.4139561Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:52.4140055Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:10:52.4140478Z 2025-05-07T20:10:52.4140653Z linux-vdso.so.1 (0x00007ffe0a3cd000) 2025-05-07T20:10:52.4140943Z libtorch.so => not found 2025-05-07T20:10:52.4141327Z libc10.so => not found 2025-05-07T20:10:52.4141717Z libnvrtc.so.12 => not found 2025-05-07T20:10:52.4141964Z libc10_cuda.so => not found 2025-05-07T20:10:52.4142204Z libnccl.so.2 => not found 2025-05-07T20:10:52.4142444Z libcuda.so.1 => not found 2025-05-07T20:10:52.4142694Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:52.4142946Z libtorch_cpu.so => not found 2025-05-07T20:10:52.4143203Z libtorch_cuda.so => not found 2025-05-07T20:10:52.4143446Z libcudart.so.12 => not found 2025-05-07T20:10:52.4143764Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fe7c4ccf000) 2025-05-07T20:10:52.4144189Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fe7c4c79000) 2025-05-07T20:10:52.4144573Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fe7c4c4b000) 2025-05-07T20:10:52.4144923Z libc.so.6 => /lib64/libc.so.6 (0x00007fe7c4a43000) 2025-05-07T20:10:52.4145260Z /lib64/ld-linux-x86-64.so.2 (0x00007fe7c4fae000) 2025-05-07T20:10:52.4145605Z libm.so.6 => /lib64/libm.so.6 (0x00007fe7c4968000) 2025-05-07T20:10:52.4145817Z 2025-05-07T20:10:52.4145917Z [CHECK] Displaying ELF information: 2025-05-07T20:10:52.4146354Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:10:52.4146883Z 2025-05-07T20:10:52.4146887Z 2025-05-07T20:10:52.4147038Z Dynamic section at offset 0x71b10 contains 39 entries: 2025-05-07T20:10:52.4147419Z Tag Type Name/Value 2025-05-07T20:10:52.4147823Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:52.4148359Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:52.4148865Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:52.4149363Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:52.4149874Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:52.4150361Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:52.4150881Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:52.4151388Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:52.4151904Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:52.4152424Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:52.4152930Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:52.4153444Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:10:52.4153931Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:52.4154536Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:52.4155147Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:52.4155725Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_embedding_inplace_ops.so] 2025-05-07T20:10:52.4156225Z 0x000000000000000c (INIT) 0x10000 2025-05-07T20:10:52.4156554Z 0x000000000000000d (FINI) 0x316ac 2025-05-07T20:10:52.4156892Z 0x0000000000000019 (INIT_ARRAY) 0x71130 2025-05-07T20:10:52.4157229Z 0x000000000000001b (INIT_ARRAYSZ) 40 (bytes) 2025-05-07T20:10:52.4157637Z 0x000000000000001a (FINI_ARRAY) 0x71158 2025-05-07T20:10:52.4157965Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:52.4158309Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:10:52.4158642Z 0x0000000000000005 (STRTAB) 0x2ba8 2025-05-07T20:10:52.4158954Z 0x0000000000000006 (SYMTAB) 0xa18 2025-05-07T20:10:52.4159300Z 0x000000000000000a (STRSZ) 36158 (bytes) 2025-05-07T20:10:52.4159647Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:52.4159986Z 0x0000000000000003 (PLTGOT) 0x71fe8 2025-05-07T20:10:52.4160327Z 0x0000000000000002 (PLTRELSZ) 5520 (bytes) 2025-05-07T20:10:52.4160849Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:52.4161175Z 0x0000000000000017 (JMPREL) 0xdfa8 2025-05-07T20:10:52.4161486Z 0x0000000000000007 (RELA) 0xbcc8 2025-05-07T20:10:52.4161834Z 0x0000000000000008 (RELASZ) 8928 (bytes) 2025-05-07T20:10:52.4162177Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:52.4162530Z 0x000000006ffffffe (VERNEED) 0xbbb8 2025-05-07T20:10:52.4162854Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:52.4163211Z 0x000000006ffffff0 (VERSYM) 0xb8e6 2025-05-07T20:10:52.4163535Z 0x000000006ffffff9 (RELACOUNT) 162 2025-05-07T20:10:52.4164047Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:52.4164248Z 2025-05-07T20:10:52.4164373Z ################################################################################ 2025-05-07T20:10:52.4164593Z 2025-05-07T20:10:52.4164598Z 2025-05-07T20:10:52.4164710Z ################################################################################ 2025-05-07T20:10:52.4165230Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:10:52.4165722Z [CHECK] Listing out library size: 2025-05-07T20:10:52.4166197Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:10:52.4166583Z 2025-05-07T20:10:52.4166943Z 35 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:10:52.4167280Z 2025-05-07T20:10:52.4167672Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:10:52.4168651Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:52.4168656Z 2025-05-07T20:10:52.4268701Z GLIBC_2.2.5 2025-05-07T20:10:52.4268967Z GLIBC_2.3 2025-05-07T20:10:52.4269205Z GLIBC_2.14 2025-05-07T20:10:52.4269222Z 2025-05-07T20:10:52.4269274Z 2025-05-07T20:10:52.4270559Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:10:52.4272130Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:52.4272155Z 2025-05-07T20:10:52.4389003Z GLIBCXX_3.4 2025-05-07T20:10:52.4389284Z GLIBCXX_3.4.9 2025-05-07T20:10:52.4389560Z GLIBCXX_3.4.11 2025-05-07T20:10:52.4389814Z GLIBCXX_3.4.15 2025-05-07T20:10:52.4390117Z GLIBCXX_3.4.18 2025-05-07T20:10:52.4390210Z GLIBCXX_3.4.20 2025-05-07T20:10:52.4390291Z GLIBCXX_3.4.21 2025-05-07T20:10:52.4390297Z 2025-05-07T20:10:52.4390301Z 2025-05-07T20:10:52.4412975Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so > /tmp/tmp.KQTYiW0cNR.symbols.txt 2025-05-07T20:10:52.4413006Z 2025-05-07T20:10:52.4494767Z 2025-05-07T20:10:52.4518441Z [CHECK] Total Number of symbols: 1545 2025-05-07T20:10:52.4533485Z [CHECK] Number of fbgemm symbols: 211 2025-05-07T20:10:52.4553197Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so > /tmp/tmp.t2CVLGc9gt.usymbols.txt 2025-05-07T20:10:52.4553239Z 2025-05-07T20:10:52.4575662Z 2025-05-07T20:10:52.4599089Z [CHECK] Listing out undefined symbols (266 total): 2025-05-07T20:10:52.4617124Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:52.4617540Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:52.4617689Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:52.4617925Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:52.4618095Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:52.4618239Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:52.4618585Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:52.4618746Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:52.4618869Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:52.4619015Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:52.4619145Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:52.4619269Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:52.4619440Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:52.4619548Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:52.4619741Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:52.4619845Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:52.4619956Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:52.4620073Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:52.4620182Z U __cxa_pure_virtual@CXXABI_1.3 2025-05-07T20:10:52.4620286Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:52.4620399Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:10:52.4620512Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:52.4620624Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:52.4620727Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:52.4620855Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:10:52.4621056Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:10:52.4621230Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:10:52.4621370Z U at::RecordFunction::currentThreadId() 2025-05-07T20:10:52.4621499Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:10:52.4621649Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:10:52.4621855Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:10:52.4621965Z U at::TensorMaker::make_tensor() 2025-05-07T20:10:52.4622085Z U at::_ops::all::call(at::Tensor const&) 2025-05-07T20:10:52.4622237Z U at::_ops::concat::call(c10::ArrayRef, long) 2025-05-07T20:10:52.4622410Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:10:52.4622999Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:52.4623655Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:52.4623829Z U at::_ops::eq_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:52.4623998Z U at::_ops::eq_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:52.4624188Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:10:52.4624355Z U at::_ops::gt_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:52.4624699Z U at::_ops::index_add::call(at::Tensor const&, long, at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:52.4624918Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:10:52.4625036Z U at::_ops::max::call(at::Tensor const&) 2025-05-07T20:10:52.4625205Z U at::_ops::mul_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:52.4625518Z U at::_ops::narrow::call(at::Tensor const&, long, c10::SymInt, c10::SymInt) 2025-05-07T20:10:52.4625741Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:10:52.4625966Z U at::_ops::split_with_sizes::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:10:52.4626263Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:10:52.4626881Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:10:52.4627049Z U at::_ops::view::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:52.4627210Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:52.4627708Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:52.4628246Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:52.4628763Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:10:52.4628906Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:10:52.4629125Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:10:52.4629276Z U at::globalContext() 2025-05-07T20:10:52.4629418Z U at::has_internal_overlap(at::TensorBase const&) 2025-05-07T20:10:52.4629547Z U at::sequence_number::get_and_increment() 2025-05-07T20:10:52.4629654Z U bcmp@GLIBC_2.2.5 2025-05-07T20:10:52.4629773Z U bool at::Tensor::item() const 2025-05-07T20:10:52.4629877Z U c10::AnyType::get() 2025-05-07T20:10:52.4630058Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:10:52.4630267Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:52.4630369Z U c10::BoolType::get() 2025-05-07T20:10:52.4630538Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:52.4630722Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:10:52.4630843Z U c10::Dispatcher::realSingleton() 2025-05-07T20:10:52.4631376Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:10:52.4632007Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:10:52.4632379Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:52.4632495Z U c10::Error::what() const 2025-05-07T20:10:52.4632602Z U c10::GradMode::is_enabled() 2025-05-07T20:10:52.4632716Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:10:52.4632951Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:52.4633112Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:10:52.4633232Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:10:52.4633357Z U c10::IValue::isBoolList() const 2025-05-07T20:10:52.4633470Z U c10::IValue::isIntList() const 2025-05-07T20:10:52.4633584Z U c10::IValue::isSymIntList() const 2025-05-07T20:10:52.4633698Z U c10::IValue::isTensorList() const 2025-05-07T20:10:52.4633855Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:52.4633995Z U c10::IntType::get() 2025-05-07T20:10:52.4634594Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:52.4634784Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:52.4634904Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:52.4635076Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:52.4635218Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:52.4635499Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:10:52.4635661Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:52.4635779Z U c10::StringType::get() 2025-05-07T20:10:52.4635928Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:52.4636330Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:52.4636478Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:52.4636602Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:52.4636713Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:52.4636871Z U c10::SymIntType::get() 2025-05-07T20:10:52.4637023Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:52.4637148Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:10:52.4637596Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:10:52.4637748Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:52.4637854Z U c10::TensorType::get() 2025-05-07T20:10:52.4638056Z U c10::Type::isSubtypeOfExt(c10::Type const&, std::ostream*) const 2025-05-07T20:10:52.4638161Z U c10::Type::is_module() const 2025-05-07T20:10:52.4638289Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:52.4639015Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:52.4639152Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:52.4639269Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:52.4639406Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:52.4639518Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:52.4639635Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:52.4639761Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:52.4640008Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:52.4640112Z U c10::cuda::device_count() 2025-05-07T20:10:52.4640263Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:52.4640472Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:52.4640727Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:52.4640853Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:52.4641008Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:52.4641109Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:52.4641507Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:52.4642013Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:52.4642251Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:52.4642735Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:52.4643050Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:52.4643591Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:52.4643859Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:10:52.4644112Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:10:52.4644294Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:10:52.4644409Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:52.4644510Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:52.4644808Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:10:52.4645008Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:10:52.4645140Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:10:52.4645289Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:10:52.4645408Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:52.4645517Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:52.4645660Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:10:52.4646012Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:52.4646143Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:52.4646267Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:52.4646430Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:10:52.4646538Z U c10::throwNullDataPtrError() 2025-05-07T20:10:52.4646646Z U c10::typeKindToString(c10::TypeKind) 2025-05-07T20:10:52.4646751Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:52.4646856Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:10:52.4647033Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:52.4647153Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:52.4647276Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:10:52.4647392Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:52.4647515Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:52.4647660Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:52.4647776Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:52.4647883Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:52.4648002Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:52.4648118Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:52.4648231Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:10:52.4648360Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:10:52.4648482Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:52.4648616Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:52.4648722Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:52.4648836Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:52.4648953Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:52.4649067Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:52.4649255Z U fbgemm_gpu::asynchronous_complete_cumsum_cpu(at::Tensor const&) 2025-05-07T20:10:52.4649438Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:52.4649524Z U free@GLIBC_2.2.5 2025-05-07T20:10:52.4649664Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:52.4649753Z U log2f@GLIBC_2.2.5 2025-05-07T20:10:52.4649857Z U long at::Tensor::item() const 2025-05-07T20:10:52.4650021Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:52.4650153Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:52.4650288Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:52.4650378Z U memcmp@GLIBC_2.2.5 2025-05-07T20:10:52.4650475Z U memcpy@GLIBC_2.14 2025-05-07T20:10:52.4650561Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:52.4650649Z U memset@GLIBC_2.2.5 2025-05-07T20:10:52.4650797Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:52.4650910Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:52.4650999Z U realloc@GLIBC_2.2.5 2025-05-07T20:10:52.4651196Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:10:52.4651520Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:52.4651877Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:52.4652183Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:52.4652525Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:52.4652634Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:52.4652753Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:52.4652883Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:52.4653013Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:52.4653179Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:52.4653301Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:52.4653433Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:10:52.4653663Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:52.4654191Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:52.4654337Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:52.4654455Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:52.4654567Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:52.4654673Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:52.4654788Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:52.4654957Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:52.4655200Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:52.4655326Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:52.4655479Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:52.4655605Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:10:52.4655797Z U std::runtime_error::runtime_error(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:52.4656206Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:10:52.4656335Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:10:52.4656447Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:52.4656537Z U strcmp@GLIBC_2.2.5 2025-05-07T20:10:52.4656626Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:52.4656743Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:52.4657297Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:52.4657727Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:52.4658005Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:52.4658118Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:10:52.4658577Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:10:52.4658766Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:10:52.4658966Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:10:52.4659148Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:10:52.4659497Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:10:52.4659648Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:10:52.4659835Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:10:52.4660019Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:10:52.4660141Z U torch::autograd::Node::assign_parent() 2025-05-07T20:10:52.4660427Z U torch::autograd::Node::metadata() 2025-05-07T20:10:52.4660562Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:10:52.4660819Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:10:52.4661102Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:10:52.4661255Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:10:52.4661466Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:10:52.4661713Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:10:52.4667321Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:10:52.4667516Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:10:52.4667670Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:10:52.4667841Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:10:52.4668000Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:10:52.4668417Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:10:52.4668794Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:52.4669345Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:10:52.4669486Z U typeinfo for c10::Error 2025-05-07T20:10:52.4669602Z U typeinfo for c10::Type 2025-05-07T20:10:52.4669796Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:52.4669929Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:10:52.4670075Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:10:52.4670197Z U typeinfo for torch::autograd::Node 2025-05-07T20:10:52.4670355Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:52.4670536Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:52.4670697Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:10:52.4670858Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:52.4671037Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:10:52.4671146Z U vtable for c10::Error 2025-05-07T20:10:52.4671260Z U vtable for c10::ListType 2025-05-07T20:10:52.4671613Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:52.4671751Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:52.4671983Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:52.4672135Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:10:52.4672248Z U vtable for torch::autograd::Node 2025-05-07T20:10:52.4672431Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:52.4672545Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:52.4672667Z w _ITM_registerTMCloneTable 2025-05-07T20:10:52.4672774Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:52.4672869Z w __gmon_start__ 2025-05-07T20:10:52.4673019Z w __pthread_key_create 2025-05-07T20:10:52.4673131Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:52.4673249Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:52.4673411Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:52.4673635Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:10:52.4673642Z 2025-05-07T20:10:52.4673788Z linux-vdso.so.1 (0x00007fffdf7f4000) 2025-05-07T20:10:52.4673896Z libc10.so => not found 2025-05-07T20:10:52.4673997Z libnvrtc.so.12 => not found 2025-05-07T20:10:52.4674212Z libc10_cuda.so => not found 2025-05-07T20:10:52.4674354Z libnccl.so.2 => not found 2025-05-07T20:10:52.4674473Z libcuda.so.1 => not found 2025-05-07T20:10:52.4675032Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f8d3bbb8000) 2025-05-07T20:10:52.4675576Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f8d38600000) 2025-05-07T20:10:52.4675704Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:52.4675800Z libtorch.so => not found 2025-05-07T20:10:52.4675899Z libtorch_cpu.so => not found 2025-05-07T20:10:52.4676018Z libtorch_cuda.so => not found 2025-05-07T20:10:52.4676116Z libcudart.so.12 => not found 2025-05-07T20:10:52.4676278Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f8d3839c000) 2025-05-07T20:10:52.4676405Z libm.so.6 => /lib64/libm.so.6 (0x00007f8d39725000) 2025-05-07T20:10:52.4676569Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f8d3bb88000) 2025-05-07T20:10:52.4676698Z libc.so.6 => /lib64/libc.so.6 (0x00007f8d38194000) 2025-05-07T20:10:52.4676830Z /lib64/ld-linux-x86-64.so.2 (0x00007f8d3bd65000) 2025-05-07T20:10:52.4676931Z libc10.so => not found 2025-05-07T20:10:52.4677031Z libnvrtc.so.12 => not found 2025-05-07T20:10:52.4677128Z libc10_cuda.so => not found 2025-05-07T20:10:52.4677237Z libnccl.so.2 => not found 2025-05-07T20:10:52.4677330Z libcuda.so.1 => not found 2025-05-07T20:10:52.4677462Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:52.4677558Z libtorch.so => not found 2025-05-07T20:10:52.4677692Z libtorch_cpu.so => not found 2025-05-07T20:10:52.4677798Z libtorch_cuda.so => not found 2025-05-07T20:10:52.4677899Z libcudart.so.12 => not found 2025-05-07T20:10:52.4678017Z libtorch.so => not found 2025-05-07T20:10:52.4678108Z libc10.so => not found 2025-05-07T20:10:52.4678204Z libnvrtc.so.12 => not found 2025-05-07T20:10:52.4678301Z libc10_cuda.so => not found 2025-05-07T20:10:52.4678414Z libnccl.so.2 => not found 2025-05-07T20:10:52.4678516Z libcuda.so.1 => not found 2025-05-07T20:10:52.4678617Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:52.4678737Z libtorch_cpu.so => not found 2025-05-07T20:10:52.4678841Z libtorch_cuda.so => not found 2025-05-07T20:10:52.4678942Z libcudart.so.12 => not found 2025-05-07T20:10:52.4679103Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f8d3813e000) 2025-05-07T20:10:52.4679109Z 2025-05-07T20:10:52.4679238Z [CHECK] Displaying ELF information: 2025-05-07T20:10:52.4679492Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:10:52.4679497Z 2025-05-07T20:10:52.4697969Z 2025-05-07T20:10:52.4698833Z Dynamic section at offset 0x220d958 contains 42 entries: 2025-05-07T20:10:52.4699214Z Tag Type Name/Value 2025-05-07T20:10:52.4699796Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:52.4700403Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:52.4701030Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:52.4701665Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:52.4702237Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:52.4704785Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:10:52.4704995Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:10:52.4705200Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:52.4705400Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:52.4705596Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:52.4705789Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:52.4706054Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:52.4706247Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:52.4706426Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:10:52.4706625Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:52.4706872Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:52.4707082Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:52.4707329Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_index_select.so] 2025-05-07T20:10:52.4707502Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:52.4707612Z 0x000000000000000c (INIT) 0x56000 2025-05-07T20:10:52.4707721Z 0x000000000000000d (FINI) 0x1515ac 2025-05-07T20:10:52.4707855Z 0x0000000000000019 (INIT_ARRAY) 0x220b430 2025-05-07T20:10:52.4707980Z 0x000000000000001b (INIT_ARRAYSZ) 144 (bytes) 2025-05-07T20:10:52.4708097Z 0x000000000000001a (FINI_ARRAY) 0x220b4c0 2025-05-07T20:10:52.4708232Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:52.4708340Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:10:52.4708452Z 0x0000000000000005 (STRTAB) 0xbb50 2025-05-07T20:10:52.4708559Z 0x0000000000000006 (SYMTAB) 0x2a60 2025-05-07T20:10:52.4708741Z 0x000000000000000a (STRSZ) 242227 (bytes) 2025-05-07T20:10:52.4709024Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:52.4709146Z 0x0000000000000003 (PLTGOT) 0x220efe8 2025-05-07T20:10:52.4709297Z 0x0000000000000002 (PLTRELSZ) 16872 (bytes) 2025-05-07T20:10:52.4709409Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:52.4709700Z 0x0000000000000017 (JMPREL) 0x512d8 2025-05-07T20:10:52.4709830Z 0x0000000000000007 (RELA) 0x47af8 2025-05-07T20:10:52.4710002Z 0x0000000000000008 (RELASZ) 38880 (bytes) 2025-05-07T20:10:52.4710130Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:52.4710252Z 0x000000006ffffffe (VERNEED) 0x47998 2025-05-07T20:10:52.4710388Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:10:52.4710515Z 0x000000006ffffff0 (VERSYM) 0x46d84 2025-05-07T20:10:52.4710631Z 0x000000006ffffff9 (RELACOUNT) 571 2025-05-07T20:10:52.4710759Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:52.4710765Z 2025-05-07T20:10:52.4710887Z ################################################################################ 2025-05-07T20:10:52.4710893Z 2025-05-07T20:10:52.4710897Z 2025-05-07T20:10:52.4711013Z ################################################################################ 2025-05-07T20:10:52.4711267Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:10:52.4711377Z [CHECK] Listing out library size: 2025-05-07T20:10:52.4711617Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:10:52.4711623Z 2025-05-07T20:10:52.4711826Z 73 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:10:52.4711830Z 2025-05-07T20:10:52.4712191Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:10:52.4712676Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:52.4712683Z 2025-05-07T20:10:52.5112396Z GLIBC_2.2.5 2025-05-07T20:10:52.5112495Z GLIBC_2.3 2025-05-07T20:10:52.5112887Z GLIBC_2.14 2025-05-07T20:10:52.5113055Z 2025-05-07T20:10:52.5113065Z 2025-05-07T20:10:52.5113697Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:10:52.5114700Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:52.5114709Z 2025-05-07T20:10:52.5510826Z GLIBCXX_3.4 2025-05-07T20:10:52.5511096Z GLIBCXX_3.4.9 2025-05-07T20:10:52.5511347Z GLIBCXX_3.4.11 2025-05-07T20:10:52.5511591Z GLIBCXX_3.4.14 2025-05-07T20:10:52.5511895Z GLIBCXX_3.4.15 2025-05-07T20:10:52.5512207Z GLIBCXX_3.4.18 2025-05-07T20:10:52.5512307Z GLIBCXX_3.4.19 2025-05-07T20:10:52.5512418Z GLIBCXX_3.4.20 2025-05-07T20:10:52.5512519Z GLIBCXX_3.4.21 2025-05-07T20:10:52.5512544Z 2025-05-07T20:10:52.5512549Z 2025-05-07T20:10:52.5531651Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so > /tmp/tmp.qkN2miV3jz.symbols.txt 2025-05-07T20:10:52.5531674Z 2025-05-07T20:10:52.5860672Z 2025-05-07T20:10:52.5888233Z [CHECK] Total Number of symbols: 6648 2025-05-07T20:10:52.5911691Z [CHECK] Number of fbgemm symbols: 4516 2025-05-07T20:10:52.5928899Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so > /tmp/tmp.tLTyt29Vsy.usymbols.txt 2025-05-07T20:10:52.5928932Z 2025-05-07T20:10:52.5964789Z 2025-05-07T20:10:52.5991692Z [CHECK] Listing out undefined symbols (465 total): 2025-05-07T20:10:52.6007142Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:52.6008367Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:52.6008997Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:52.6009296Z U __assert_fail@GLIBC_2.2.5 2025-05-07T20:10:52.6009721Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:52.6010167Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:52.6010535Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:52.6010927Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:52.6011320Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:52.6011682Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:52.6012081Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:52.6012436Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:52.6012776Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:52.6012889Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:52.6012992Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:52.6013123Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:52.6013237Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:52.6013347Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:52.6013468Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:52.6013576Z U __cxa_pure_virtual@CXXABI_1.3 2025-05-07T20:10:52.6013679Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:52.6013802Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:10:52.6013903Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:52.6014019Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:52.6014121Z U __once_proxy@GLIBCXX_3.4.11 2025-05-07T20:10:52.6014232Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:52.6014495Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:10:52.6014626Z U at::RecordFunction::currentThreadId() 2025-05-07T20:10:52.6014774Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:10:52.6014933Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:10:52.6015052Z U at::SplitUntil32Bit::begin() const 2025-05-07T20:10:52.6015189Z U at::SplitUntil32Bit::end() const 2025-05-07T20:10:52.6015338Z U at::SplitUntil32Bit::iterator::operator*() const 2025-05-07T20:10:52.6015474Z U at::SplitUntil32Bit::iterator::operator++() 2025-05-07T20:10:52.6015773Z U at::Tensor::index(std::initializer_list) const 2025-05-07T20:10:52.6015972Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:10:52.6016213Z U at::TensorIteratorBase::build(at::TensorIteratorConfig&) 2025-05-07T20:10:52.6016392Z U at::TensorIteratorBase::can_use_32bit_indexing() const 2025-05-07T20:10:52.6016530Z U at::TensorIteratorBase::data_ptr(long) const 2025-05-07T20:10:52.6016669Z U at::TensorIteratorBase::is_contiguous() const 2025-05-07T20:10:52.6016812Z U at::TensorIteratorBase::numel() const 2025-05-07T20:10:52.6016964Z U at::TensorIteratorBase::with_32bit_indexing() const 2025-05-07T20:10:52.6017184Z U at::TensorIteratorConfig::add_borrowed_input(at::TensorBase const&) 2025-05-07T20:10:52.6017406Z U at::TensorIteratorConfig::add_borrowed_output(at::TensorBase const&) 2025-05-07T20:10:52.6017538Z U at::TensorMaker::make_tensor() 2025-05-07T20:10:52.6017669Z U at::_ops::_is_all_true::call(at::Tensor const&) 2025-05-07T20:10:52.6017813Z U at::_ops::_unique::call(at::Tensor const&, bool, bool) 2025-05-07T20:10:52.6018063Z U at::_ops::add_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:52.6018307Z U at::_ops::add__Tensor::call(at::Tensor&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:52.6018422Z U at::_ops::all::call(at::Tensor const&) 2025-05-07T20:10:52.6018995Z U at::_ops::baddbmm::call(at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::Scalar const&, c10::Scalar const&) 2025-05-07T20:10:52.6019186Z U at::_ops::broadcast_to::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:52.6019332Z U at::_ops::cat::call(c10::IListRef const&, long) 2025-05-07T20:10:52.6019534Z U at::_ops::cat_out::call(c10::IListRef const&, long, at::Tensor&) 2025-05-07T20:10:52.6019690Z U at::_ops::clamp_max::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:52.6019889Z U at::_ops::clone::call(at::Tensor const&, std::optional) 2025-05-07T20:10:52.6020056Z U at::_ops::contiguous::call(at::Tensor const&, c10::MemoryFormat) 2025-05-07T20:10:52.6020195Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:10:52.6020408Z U at::_ops::cumsum::call(at::Tensor const&, long, std::optional) 2025-05-07T20:10:52.6020724Z U at::_ops::diff::call(at::Tensor const&, long, long, std::optional const&, std::optional const&) 2025-05-07T20:10:52.6020884Z U at::_ops::div_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:52.6021426Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:52.6022022Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:52.6022207Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:10:52.6022386Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:10:52.6022499Z U at::_ops::floor::call(at::Tensor const&) 2025-05-07T20:10:52.6022989Z U at::_ops::full::call(c10::ArrayRef, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:52.6023182Z U at::_ops::ge_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:52.6023461Z U at::_ops::index_put_::call(at::Tensor&, c10::List > const&, at::Tensor const&, bool) 2025-05-07T20:10:52.6023682Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:10:52.6023811Z U at::_ops::item::call(at::Tensor const&) 2025-05-07T20:10:52.6023965Z U at::_ops::le_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:52.6024072Z U at::_ops::max::call(at::Tensor const&) 2025-05-07T20:10:52.6024234Z U at::_ops::mul_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:52.6024760Z U at::_ops::ones_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:52.6024933Z U at::_ops::permute::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:52.6025409Z U at::_ops::range::call(c10::Scalar const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:52.6025586Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:52.6025954Z U at::_ops::resize_::call(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:10:52.6026115Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:10:52.6026516Z U at::_ops::set__source_Storage_storage_offset::call(at::Tensor&, c10::Storage, c10::SymInt, c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:52.6026854Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:10:52.6026994Z U at::_ops::sort::call(at::Tensor const&, long, bool) 2025-05-07T20:10:52.6027204Z U at::_ops::split_sizes::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:10:52.6027347Z U at::_ops::squeeze_dim::call(at::Tensor const&, long) 2025-05-07T20:10:52.6027564Z U at::_ops::sub_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:52.6027742Z U at::_ops::sum::call(at::Tensor const&, std::optional) 2025-05-07T20:10:52.6027988Z U at::_ops::tensor_split_indices::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:10:52.6028274Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:10:52.6029334Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:10:52.6029503Z U at::_ops::transpose_int::call(at::Tensor const&, long, long) 2025-05-07T20:10:52.6029808Z U at::_ops::unique_consecutive::call(at::Tensor const&, bool, bool, std::optional) 2025-05-07T20:10:52.6029959Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:10:52.6030133Z U at::_ops::view::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:52.6030292Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:52.6030417Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:10:52.6030886Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:52.6031493Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:52.6031819Z U at::checkScalarTypes(char const*, at::TensorArg const&, c10::ArrayRef) 2025-05-07T20:10:52.6031946Z U at::cuda::getCurrentCUDABlasHandle() 2025-05-07T20:10:52.6032078Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:10:52.6032221Z U at::cuda::getDeviceProperties(signed char) 2025-05-07T20:10:52.6032361Z U at::cuda::get_p2p_access(signed char, signed char) 2025-05-07T20:10:52.6032696Z U at::detail::computeStorageNbytes(c10::ArrayRef, c10::ArrayRef, unsigned long, unsigned long) 2025-05-07T20:10:52.6032832Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:10:52.6032994Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:10:52.6033094Z U at::get_num_threads() 2025-05-07T20:10:52.6033201Z U at::get_thread_num() 2025-05-07T20:10:52.6033416Z U at::internal::OpaqueOptionalTensorRef::~OpaqueOptionalTensorRef() 2025-05-07T20:10:52.6033535Z U at::internal::set_thread_num(int) 2025-05-07T20:10:52.6033773Z U at::native::_rowwise_prune(at::Tensor const&, at::Tensor const&, c10::ScalarType) 2025-05-07T20:10:52.6034480Z U at::native::empty_like(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:52.6035190Z U at::native::empty_meta_symint(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:52.6035469Z U at::native::resize_(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:10:52.6035613Z U at::print(std::ostream&, at::Tensor const&, long) 2025-05-07T20:10:52.6035738Z U at::sequence_number::get_and_increment() 2025-05-07T20:10:52.6035912Z U at::tensor(c10::ArrayRef, c10::TensorOptions const&) 2025-05-07T20:10:52.6036009Z U bcmp@GLIBC_2.2.5 2025-05-07T20:10:52.6036129Z U bool at::Tensor::item() const 2025-05-07T20:10:52.6036271Z U bool* at::TensorBase::data_ptr() const 2025-05-07T20:10:52.6036420Z U bool* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:52.6036518Z U c10::AnyType::get() 2025-05-07T20:10:52.6036696Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:10:52.6036869Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:10:52.6037073Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:52.6037181Z U c10::BoolType::get() 2025-05-07T20:10:52.6037283Z U c10::DeviceObjType::get() 2025-05-07T20:10:52.6037441Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:52.6037655Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:10:52.6037783Z U c10::Dispatcher::realSingleton() 2025-05-07T20:10:52.6038300Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:10:52.6038937Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:10:52.6039335Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:52.6039434Z U c10::Error::what() const 2025-05-07T20:10:52.6039546Z U c10::FloatType::get() 2025-05-07T20:10:52.6039682Z U c10::GradMode::is_enabled() 2025-05-07T20:10:52.6039797Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:10:52.6039957Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:10:52.6040128Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:52.6040281Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:10:52.6040399Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:10:52.6040635Z U c10::IValue::isBoolList() const 2025-05-07T20:10:52.6040737Z U c10::IValue::isIntList() const 2025-05-07T20:10:52.6040844Z U c10::IValue::isSymIntList() const 2025-05-07T20:10:52.6040956Z U c10::IValue::isTensorList() const 2025-05-07T20:10:52.6041089Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:52.6041286Z U c10::InferenceMode::is_enabled() 2025-05-07T20:10:52.6041394Z U c10::IntType::get() 2025-05-07T20:10:52.6041837Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:52.6042024Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:52.6042149Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:52.6042267Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:52.6042383Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:52.6042611Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:52.6042727Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:10:52.6042837Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:10:52.6042954Z U c10::ScalarTypeType::get() 2025-05-07T20:10:52.6043211Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:10:52.6043505Z U c10::SmallVectorBase::mallocForGrow(unsigned long, unsigned long, unsigned long&) 2025-05-07T20:10:52.6043666Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:52.6043757Z U c10::StringType::get() 2025-05-07T20:10:52.6043887Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:10:52.6044031Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:52.6044165Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:52.6044533Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:52.6044667Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:52.6044821Z U c10::SymInt::operator%(c10::SymInt const&) const 2025-05-07T20:10:52.6044941Z U c10::SymInt::operator*(c10::SymInt const&) const 2025-05-07T20:10:52.6045075Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:10:52.6045182Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:52.6045297Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:10:52.6045423Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:10:52.6045523Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:52.6045615Z U c10::SymIntType::get() 2025-05-07T20:10:52.6045777Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:52.6045893Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:10:52.6046302Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:10:52.6046471Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:52.6046570Z U c10::TensorType::get() 2025-05-07T20:10:52.6047300Z U c10::TupleType::TupleType(std::vector, std::allocator > >, std::optional, std::shared_ptr) 2025-05-07T20:10:52.6047485Z U c10::Type::isSubtypeOfExt(c10::Type const&, std::ostream*) const 2025-05-07T20:10:52.6047584Z U c10::Type::is_module() const 2025-05-07T20:10:52.6047701Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:52.6048377Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:52.6048503Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:52.6048657Z U c10::cuda::CUDAKernelLaunchRegistry::get_singleton_ref() 2025-05-07T20:10:52.6048944Z U c10::cuda::CUDAKernelLaunchRegistry::get_uvm_assertions_ptr_for_current_device() 2025-05-07T20:10:52.6049252Z U c10::cuda::CUDAKernelLaunchRegistry::insert(char const*, char const*, unsigned int, char const*, int) 2025-05-07T20:10:52.6049362Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:52.6049490Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:52.6049595Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:52.6049702Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:52.6049805Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:52.6050043Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:52.6050138Z U c10::cuda::current_device() 2025-05-07T20:10:52.6050232Z U c10::cuda::device_count() 2025-05-07T20:10:52.6050377Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:52.6050499Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:52.6050630Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:52.6050771Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:52.6050916Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:52.6051019Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:52.6051433Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:52.6051907Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:52.6052166Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:52.6052634Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:52.6052946Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:52.6054775Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:52.6055038Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:10:52.6055287Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:10:52.6055543Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:10:52.6055650Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:52.6055747Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:52.6056059Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:10:52.6056226Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:10:52.6056341Z U c10::impl::PyObjectSlot::PyObjectSlot() 2025-05-07T20:10:52.6056470Z U c10::impl::PyObjectSlot::~PyObjectSlot() 2025-05-07T20:10:52.6056606Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:10:52.6056755Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:10:52.6056878Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:52.6056986Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:52.6057130Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:10:52.6057511Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:52.6057622Z U c10::operator*(c10::SymInt const&, int) 2025-05-07T20:10:52.6057728Z U c10::operator+(c10::SymInt const&, int) 2025-05-07T20:10:52.6057854Z U c10::operator+(c10::SymInt const&, unsigned long) 2025-05-07T20:10:52.6057969Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:10:52.6058095Z U c10::operator-(c10::SymInt const&, unsigned long) 2025-05-07T20:10:52.6058201Z U c10::operator/(c10::SymInt const&, int) 2025-05-07T20:10:52.6058311Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:10:52.6058441Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:52.6058568Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:52.6058722Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:10:52.6058848Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:10:52.6058958Z U c10::operator==(c10::SymInt const&, int) 2025-05-07T20:10:52.6059070Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:10:52.6059177Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:10:52.6059282Z U c10::report_overflow(char const*) 2025-05-07T20:10:52.6059397Z U c10::throwNullDataPtrError() 2025-05-07T20:10:52.6059506Z U c10::typeKindToString(c10::TypeKind) 2025-05-07T20:10:52.6059600Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:52.6059711Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:10:52.6059921Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:52.6060029Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:52.6060116Z U ceil@GLIBC_2.2.5 2025-05-07T20:10:52.6060228Z U cublasGemmStridedBatchedEx 2025-05-07T20:10:52.6060323Z U cublasSetStream_v2 2025-05-07T20:10:52.6060442Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:10:52.6060569Z U cudaDeviceGetByPCIBusId@libcudart.so.12 2025-05-07T20:10:52.6060683Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:52.6060833Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:52.6060947Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:52.6061059Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:52.6061161Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:52.6061295Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:52.6061418Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:52.6061517Z U cudaFree@libcudart.so.12 2025-05-07T20:10:52.6061632Z U cudaFuncGetAttributes@libcudart.so.12 2025-05-07T20:10:52.6061751Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:10:52.6061850Z U cudaGetDevice@libcudart.so.12 2025-05-07T20:10:52.6062143Z U cudaGetDeviceCount@libcudart.so.12 2025-05-07T20:10:52.6062288Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:10:52.6062399Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:52.6062512Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:52.6062643Z U cudaHostGetDevicePointer@libcudart.so.12 2025-05-07T20:10:52.6062757Z U cudaHostRegister@libcudart.so.12 2025-05-07T20:10:52.6062874Z U cudaHostUnregister@libcudart.so.12 2025-05-07T20:10:52.6062992Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:52.6063111Z U cudaMallocManaged@libcudart.so.12 2025-05-07T20:10:52.6063248Z U cudaMemAdvise@libcudart.so.12 2025-05-07T20:10:52.6063369Z U cudaMemPrefetchAsync@libcudart.so.12 2025-05-07T20:10:52.6063482Z U cudaMemcpy2DAsync@libcudart.so.12 2025-05-07T20:10:52.6063599Z U cudaMemcpyAsync@libcudart.so.12 2025-05-07T20:10:52.6063708Z U cudaMemsetAsync@libcudart.so.12 2025-05-07T20:10:52.6064000Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.12 2025-05-07T20:10:52.6064131Z U cudaPeekAtLastError@libcudart.so.12 2025-05-07T20:10:52.6064235Z U cudaSetDevice@libcudart.so.12 2025-05-07T20:10:52.6064342Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:52.6064499Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:52.6064793Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:52.6064934Z U double* at::TensorBase::data_ptr() const 2025-05-07T20:10:52.6065110Z U double* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:52.6065203Z U exit@GLIBC_2.2.5 2025-05-07T20:10:52.6065288Z U exp10@GLIBC_2.2.5 2025-05-07T20:10:52.6065374Z U exp2@GLIBC_2.2.5 2025-05-07T20:10:52.6065474Z U exp@GLIBC_2.2.5 2025-05-07T20:10:52.6065556Z U expf@GLIBC_2.2.5 2025-05-07T20:10:52.6065746Z U fbgemm_gpu::asynchronous_complete_cumsum_cpu(at::Tensor const&) 2025-05-07T20:10:52.6065947Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:10:52.6066138Z U fbgemm_gpu::asynchronous_exclusive_cumsum_cpu(at::Tensor const&) 2025-05-07T20:10:52.6066330Z U fbgemm_gpu::asynchronous_exclusive_cumsum_gpu(at::Tensor const&) 2025-05-07T20:10:52.6066565Z U fbgemm_gpu::asynchronous_inclusive_cumsum_gpu(at::Tensor const&) 2025-05-07T20:10:52.6066706Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:10:52.6066854Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:52.6066954Z U fmod@GLIBC_2.2.5 2025-05-07T20:10:52.6067041Z U free@GLIBC_2.2.5 2025-05-07T20:10:52.6067155Z U get_info_B_num_bits_from_T(int, int) 2025-05-07T20:10:52.6067260Z U int at::Tensor::item() const 2025-05-07T20:10:52.6067458Z U int const* at::TensorBase::const_data_ptr() const 2025-05-07T20:10:52.6067583Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:52.6067720Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:52.6067821Z U isnan@GLIBC_2.2.5 2025-05-07T20:10:52.6067949Z U lgamma@GLIBC_2.2.5 2025-05-07T20:10:52.6068045Z U llrint@GLIBC_2.2.5 2025-05-07T20:10:52.6068145Z U llround@GLIBC_2.2.5 2025-05-07T20:10:52.6068233Z U log10@GLIBC_2.2.5 2025-05-07T20:10:52.6068321Z U log2@GLIBC_2.2.5 2025-05-07T20:10:52.6068525Z U log@GLIBC_2.2.5 2025-05-07T20:10:52.6068617Z U logl@GLIBC_2.2.5 2025-05-07T20:10:52.6068729Z U long at::Tensor::item() const 2025-05-07T20:10:52.6068896Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:52.6069066Z U long const* at::TensorBase::const_data_ptr() const 2025-05-07T20:10:52.6069194Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:52.6069337Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:52.6069608Z U lrint@GLIBC_2.2.5 2025-05-07T20:10:52.6069840Z U madvise@GLIBC_2.2.5 2025-05-07T20:10:52.6069929Z U malloc@GLIBC_2.2.5 2025-05-07T20:10:52.6070232Z U memcmp@GLIBC_2.2.5 2025-05-07T20:10:52.6070319Z U memcpy@GLIBC_2.14 2025-05-07T20:10:52.6070409Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:52.6070499Z U memset@GLIBC_2.2.5 2025-05-07T20:10:52.6070603Z U nextafter@GLIBC_2.2.5 2025-05-07T20:10:52.6070700Z U nvmlDeviceGetCount_v2 2025-05-07T20:10:52.6070815Z U nvmlDeviceGetHandleByIndex_v2 2025-05-07T20:10:52.6070953Z U nvmlDeviceGetNvLinkRemotePciInfo_v2 2025-05-07T20:10:52.6071063Z U nvmlDeviceGetNvLinkState 2025-05-07T20:10:52.6071167Z U nvmlDeviceGetPciInfo_v3 2025-05-07T20:10:52.6071259Z U nvmlInit_v2 2025-05-07T20:10:52.6071381Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:52.6071500Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:52.6071626Z U operator new[](unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:52.6071729Z U pow@GLIBC_2.2.5 2025-05-07T20:10:52.6071818Z U realloc@GLIBC_2.2.5 2025-05-07T20:10:52.6071977Z U short* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:52.6072185Z U signed char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:52.6072276Z U sin@GLIBC_2.2.5 2025-05-07T20:10:52.6072487Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:10:52.6072672Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:10:52.6072860Z U std::_Rb_tree_increment(std::_Rb_tree_node_base const*)@GLIBCXX_3.4 2025-05-07T20:10:52.6073028Z U std::_Rb_tree_increment(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:10:52.6073447Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:10:52.6073786Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:52.6074270Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:52.6074614Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:52.6075028Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:52.6075401Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:52.6075560Z U std::__once_call@GLIBCXX_3.4.11 2025-05-07T20:10:52.6075679Z U std::__once_callable@GLIBCXX_3.4.11 2025-05-07T20:10:52.6075796Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:52.6075918Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:52.6076030Z U std::__throw_bad_cast()@GLIBCXX_3.4 2025-05-07T20:10:52.6076169Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:10:52.6076314Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:52.6076452Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:52.6076591Z U std::__throw_out_of_range(char const*)@GLIBCXX_3.4 2025-05-07T20:10:52.6076767Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:52.6076894Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:52.6077033Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:10:52.6077248Z U std::basic_filebuf >::close()@GLIBCXX_3.4 2025-05-07T20:10:52.6077610Z U std::basic_ifstream >::basic_ifstream(char const*, std::_Ios_Openmode)@GLIBCXX_3.4 2025-05-07T20:10:52.6077851Z U std::basic_ifstream >::~basic_ifstream()@GLIBCXX_3.4 2025-05-07T20:10:52.6078099Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:52.6078435Z U std::basic_ofstream >::basic_ofstream(char const*, std::_Ios_Openmode)@GLIBCXX_3.4 2025-05-07T20:10:52.6078667Z U std::basic_ofstream >::~basic_ofstream()@GLIBCXX_3.4 2025-05-07T20:10:52.6079250Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:52.6079402Z U std::chrono::_V2::system_clock::now()@GLIBCXX_3.4.19 2025-05-07T20:10:52.6079500Z U std::cout@GLIBCXX_3.4 2025-05-07T20:10:52.6079661Z U std::ctype::_M_widen_init() const@GLIBCXX_3.4.11 2025-05-07T20:10:52.6079787Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:52.6079906Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:52.6080032Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:52.6080147Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:52.6080259Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:52.6080460Z U std::ostream& std::ostream::_M_insert(double)@GLIBCXX_3.4.9 2025-05-07T20:10:52.6080639Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:52.6080909Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:52.6081033Z U std::ostream::flush()@GLIBCXX_3.4 2025-05-07T20:10:52.6081155Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:52.6081266Z U std::ostream::put(char)@GLIBCXX_3.4 2025-05-07T20:10:52.6081407Z U std::ostream::write(char const*, long)@GLIBCXX_3.4 2025-05-07T20:10:52.6081577Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:52.6081710Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:10:52.6081939Z U std::runtime_error::runtime_error(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:52.6082373Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:10:52.6082536Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:10:52.6082646Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:52.6082759Z U strcmp@GLIBC_2.2.5 2025-05-07T20:10:52.6082853Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:52.6082947Z U sysconf@GLIBC_2.2.5 2025-05-07T20:10:52.6083084Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:52.6083669Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:52.6084129Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:52.6084638Z U torch::Library::_def(std::variant&&, torch::CppFunction&&, std::vector > const&) & 2025-05-07T20:10:52.6084898Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:52.6085058Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:10:52.6085351Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:10:52.6085533Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:10:52.6085743Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:10:52.6085927Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:10:52.6086278Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:10:52.6086437Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:10:52.6086622Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:10:52.6086801Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:10:52.6087053Z U torch::autograd::Node::assign_parent() 2025-05-07T20:10:52.6087155Z U torch::autograd::Node::metadata() 2025-05-07T20:10:52.6087277Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:10:52.6087505Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:10:52.6087765Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:10:52.6087894Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:10:52.6088084Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:10:52.6088299Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:10:52.6090836Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:10:52.6090977Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:10:52.6091141Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:10:52.6091301Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:10:52.6091440Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:10:52.6091813Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:10:52.6092157Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:52.6092511Z U torch::jit::parseSchemaOrName(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:52.6092695Z U torch::pickle_load(std::vector > const&) 2025-05-07T20:10:52.6092819Z U torch::pickle_save(c10::IValue const&) 2025-05-07T20:10:52.6093325Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:10:52.6093465Z U typeinfo for c10::Error 2025-05-07T20:10:52.6093560Z U typeinfo for c10::Type 2025-05-07T20:10:52.6093682Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:52.6093795Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:10:52.6093934Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:10:52.6094051Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:10:52.6094160Z U typeinfo for torch::autograd::Node 2025-05-07T20:10:52.6094350Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:10:52.6094542Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:52.6094962Z U void fbgemm::FloatOrHalfToFused8BitRowwiseQuantizedSBFloat(float const*, unsigned long, int, unsigned char*) 2025-05-07T20:10:52.6095460Z U void fbgemm::FloatOrHalfToFused8BitRowwiseQuantizedSBFloat(unsigned short const*, unsigned long, int, unsigned char*) 2025-05-07T20:10:52.6095875Z U void fbgemm::FloatOrHalfToFusedNBitRowwiseQuantizedSBHalf(int, float const*, unsigned long, int, unsigned char*) 2025-05-07T20:10:52.6096377Z U void fbgemm::FloatOrHalfToFusedNBitRowwiseQuantizedSBHalf(int, unsigned short const*, unsigned long, int, unsigned char*) 2025-05-07T20:10:52.6096788Z U void fbgemm::Fused8BitRowwiseQuantizedSBFloatToFloatOrHalf(unsigned char const*, unsigned long, int, float*) 2025-05-07T20:10:52.6097264Z U void fbgemm::Fused8BitRowwiseQuantizedSBFloatToFloatOrHalf(unsigned char const*, unsigned long, int, unsigned short*) 2025-05-07T20:10:52.6097748Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalf(int, unsigned char const*, unsigned long, int, float*, bool) 2025-05-07T20:10:52.6098259Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalf(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:10:52.6098739Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, float*, bool) 2025-05-07T20:10:52.6099331Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:10:52.6099875Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:10:52.6100022Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:52.6100202Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:52.6100348Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:10:52.6100497Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:52.6100648Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:10:52.6100755Z U vtable for at::TensorIterator 2025-05-07T20:10:52.6100863Z U vtable for at::TensorIteratorBase 2025-05-07T20:10:52.6100966Z U vtable for c10::Error 2025-05-07T20:10:52.6101058Z U vtable for c10::ListType 2025-05-07T20:10:52.6101363Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:52.6101498Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:52.6101711Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:52.6101827Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:10:52.6101969Z U vtable for torch::autograd::Node 2025-05-07T20:10:52.6102136Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:52.6102234Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:52.6102346Z w _ITM_registerTMCloneTable 2025-05-07T20:10:52.6102443Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:52.6102525Z w __gmon_start__ 2025-05-07T20:10:52.6102627Z w __pthread_key_create 2025-05-07T20:10:52.6102729Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:52.6102830Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:52.6102917Z w pthread_once 2025-05-07T20:10:52.6103066Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:52.6103226Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:10:52.6103235Z 2025-05-07T20:10:52.6103396Z linux-vdso.so.1 (0x00007ffc7b14a000) 2025-05-07T20:10:52.6103479Z libc10.so => not found 2025-05-07T20:10:52.6103571Z libnvrtc.so.12 => not found 2025-05-07T20:10:52.6103660Z libc10_cuda.so => not found 2025-05-07T20:10:52.6103759Z libnccl.so.2 => not found 2025-05-07T20:10:52.6103844Z libcuda.so.1 => not found 2025-05-07T20:10:52.6104184Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so (0x00007f1267600000) 2025-05-07T20:10:52.6104720Z fbgemm_gpu_embedding_inplace_ops.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so (0x00007f126c9ff000) 2025-05-07T20:10:52.6105187Z fbgemm_gpu_tbe_index_select.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so (0x00007f1265200000) 2025-05-07T20:10:52.6105636Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007f1263a00000) 2025-05-07T20:10:52.6106112Z fbgemm_gpu_tbe_optimizers.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so (0x00007f1263000000) 2025-05-07T20:10:52.6106202Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:52.6106286Z libtorch.so => not found 2025-05-07T20:10:52.6106795Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f1262e59000) 2025-05-07T20:10:52.6107236Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f1261c00000) 2025-05-07T20:10:52.6107323Z libtorch_cpu.so => not found 2025-05-07T20:10:52.6107423Z libtorch_cuda.so => not found 2025-05-07T20:10:52.6107509Z libcudart.so.12 => not found 2025-05-07T20:10:52.6107683Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f126199c000) 2025-05-07T20:10:52.6107796Z libm.so.6 => /lib64/libm.so.6 (0x00007f1265125000) 2025-05-07T20:10:52.6107939Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f126c9cd000) 2025-05-07T20:10:52.6108047Z libc.so.6 => /lib64/libc.so.6 (0x00007f1261794000) 2025-05-07T20:10:52.6108169Z /lib64/ld-linux-x86-64.so.2 (0x00007f126ca78000) 2025-05-07T20:10:52.6108256Z libc10.so => not found 2025-05-07T20:10:52.6108337Z libnvrtc.so.12 => not found 2025-05-07T20:10:52.6108420Z libc10_cuda.so => not found 2025-05-07T20:10:52.6108513Z libnccl.so.2 => not found 2025-05-07T20:10:52.6108593Z libcuda.so.1 => not found 2025-05-07T20:10:52.6108926Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so (0x00007f1267b89000) 2025-05-07T20:10:52.6109017Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:52.6109114Z libtorch.so => not found 2025-05-07T20:10:52.6109198Z libtorch_cpu.so => not found 2025-05-07T20:10:52.6109291Z libtorch_cuda.so => not found 2025-05-07T20:10:52.6109385Z libtorch.so => not found 2025-05-07T20:10:52.6109463Z libc10.so => not found 2025-05-07T20:10:52.6109576Z libnvrtc.so.12 => not found 2025-05-07T20:10:52.6109661Z libc10_cuda.so => not found 2025-05-07T20:10:52.6109759Z libnccl.so.2 => not found 2025-05-07T20:10:52.6109876Z libcuda.so.1 => not found 2025-05-07T20:10:52.6109969Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:52.6110066Z libtorch_cpu.so => not found 2025-05-07T20:10:52.6110152Z libtorch_cuda.so => not found 2025-05-07T20:10:52.6110238Z libcudart.so.12 => not found 2025-05-07T20:10:52.6110557Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f126c96f000) 2025-05-07T20:10:52.6110654Z libc10.so => not found 2025-05-07T20:10:52.6110742Z libnvrtc.so.12 => not found 2025-05-07T20:10:52.6110860Z libc10_cuda.so => not found 2025-05-07T20:10:52.6110958Z libnccl.so.2 => not found 2025-05-07T20:10:52.6111044Z libcuda.so.1 => not found 2025-05-07T20:10:52.6111132Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:52.6111219Z libtorch.so => not found 2025-05-07T20:10:52.6111321Z libtorch_cpu.so => not found 2025-05-07T20:10:52.6111417Z libtorch_cuda.so => not found 2025-05-07T20:10:52.6111506Z libcudart.so.12 => not found 2025-05-07T20:10:52.6111604Z libtorch.so => not found 2025-05-07T20:10:52.6111688Z libc10.so => not found 2025-05-07T20:10:52.6111774Z libnvrtc.so.12 => not found 2025-05-07T20:10:52.6111860Z libc10_cuda.so => not found 2025-05-07T20:10:52.6111960Z libnccl.so.2 => not found 2025-05-07T20:10:52.6112051Z libcuda.so.1 => not found 2025-05-07T20:10:52.6112145Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:52.6112248Z libtorch_cpu.so => not found 2025-05-07T20:10:52.6112343Z libtorch_cuda.so => not found 2025-05-07T20:10:52.6112433Z libcudart.so.12 => not found 2025-05-07T20:10:52.6112519Z libtorch.so => not found 2025-05-07T20:10:52.6112800Z libc10.so => not found 2025-05-07T20:10:52.6112896Z libnvrtc.so.12 => not found 2025-05-07T20:10:52.6112995Z libc10_cuda.so => not found 2025-05-07T20:10:52.6113149Z libnccl.so.2 => not found 2025-05-07T20:10:52.6113243Z libcuda.so.1 => not found 2025-05-07T20:10:52.6113343Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:52.6113433Z libtorch_cpu.so => not found 2025-05-07T20:10:52.6113540Z libtorch_cuda.so => not found 2025-05-07T20:10:52.6113632Z libcudart.so.12 => not found 2025-05-07T20:10:52.6113726Z libc10.so => not found 2025-05-07T20:10:52.6113837Z libnvrtc.so.12 => not found 2025-05-07T20:10:52.6113928Z libc10_cuda.so => not found 2025-05-07T20:10:52.6114199Z libnccl.so.2 => not found 2025-05-07T20:10:52.6114297Z libcuda.so.1 => not found 2025-05-07T20:10:52.6114407Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:52.6114534Z libtorch.so => not found 2025-05-07T20:10:52.6114633Z libtorch_cpu.so => not found 2025-05-07T20:10:52.6114744Z libtorch_cuda.so => not found 2025-05-07T20:10:52.6114840Z libcudart.so.12 => not found 2025-05-07T20:10:52.6114932Z libtorch.so => not found 2025-05-07T20:10:52.6115026Z libc10.so => not found 2025-05-07T20:10:52.6115183Z libnvrtc.so.12 => not found 2025-05-07T20:10:52.6115275Z libc10_cuda.so => not found 2025-05-07T20:10:52.6115369Z libnccl.so.2 => not found 2025-05-07T20:10:52.6115472Z libcuda.so.1 => not found 2025-05-07T20:10:52.6115572Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:52.6115670Z libtorch_cpu.so => not found 2025-05-07T20:10:52.6115770Z libtorch_cuda.so => not found 2025-05-07T20:10:52.6115881Z libcudart.so.12 => not found 2025-05-07T20:10:52.6115977Z libtorch.so => not found 2025-05-07T20:10:52.6116072Z libc10.so => not found 2025-05-07T20:10:52.6116180Z libnvrtc.so.12 => not found 2025-05-07T20:10:52.6116276Z libc10_cuda.so => not found 2025-05-07T20:10:52.6116373Z libnccl.so.2 => not found 2025-05-07T20:10:52.6116466Z libcuda.so.1 => not found 2025-05-07T20:10:52.6116579Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:52.6116675Z libtorch_cpu.so => not found 2025-05-07T20:10:52.6116767Z libtorch_cuda.so => not found 2025-05-07T20:10:52.6116916Z librt.so.1 => /lib64/librt.so.1 (0x00007f1267b84000) 2025-05-07T20:10:52.6117092Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f1267b7f000) 2025-05-07T20:10:52.6117127Z 2025-05-07T20:10:52.6117232Z [CHECK] Displaying ELF information: 2025-05-07T20:10:52.6117440Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:10:52.6117445Z 2025-05-07T20:10:52.6117488Z 2025-05-07T20:10:52.6117649Z Dynamic section at offset 0x48e4fa8 contains 47 entries: 2025-05-07T20:10:52.6117760Z Tag Type Name/Value 2025-05-07T20:10:52.6117967Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:52.6118169Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:52.6118361Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:52.6118573Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:52.6118774Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:52.6118962Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:10:52.6119222Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_embedding_inplace_ops.so] 2025-05-07T20:10:52.6119469Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_index_select.so] 2025-05-07T20:10:52.6119683Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:10:52.6119914Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_optimizers.so] 2025-05-07T20:10:52.6120129Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:52.6120325Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:52.6120567Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:10:52.6120792Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:10:52.6121022Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:52.6121224Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:52.6121440Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:52.6121633Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:52.6121815Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:10:52.6122023Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:52.6122242Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:52.6122451Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:52.6122660Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_py.so] 2025-05-07T20:10:52.6122882Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:52.6122997Z 0x000000000000000c (INIT) 0x1bb000 2025-05-07T20:10:52.6123107Z 0x000000000000000d (FINI) 0x75816c 2025-05-07T20:10:52.6123242Z 0x0000000000000019 (INIT_ARRAY) 0x48d6858 2025-05-07T20:10:52.6123369Z 0x000000000000001b (INIT_ARRAYSZ) 1160 (bytes) 2025-05-07T20:10:52.6123487Z 0x000000000000001a (FINI_ARRAY) 0x48d6ce0 2025-05-07T20:10:52.6123613Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:52.6123722Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:10:52.6123833Z 0x0000000000000005 (STRTAB) 0x33248 2025-05-07T20:10:52.6123942Z 0x0000000000000006 (SYMTAB) 0xc2f0 2025-05-07T20:10:52.6124092Z 0x000000000000000a (STRSZ) 1276767 (bytes) 2025-05-07T20:10:52.6124207Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:52.6124324Z 0x0000000000000003 (PLTGOT) 0x48eafe8 2025-05-07T20:10:52.6124471Z 0x0000000000000002 (PLTRELSZ) 68808 (bytes) 2025-05-07T20:10:52.6124580Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:52.6124720Z 0x0000000000000017 (JMPREL) 0x1a9648 2025-05-07T20:10:52.6124842Z 0x0000000000000007 (RELA) 0x16e320 2025-05-07T20:10:52.6124973Z 0x0000000000000008 (RELASZ) 242472 (bytes) 2025-05-07T20:10:52.6125086Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:52.6125199Z 0x000000006ffffffe (VERNEED) 0x16e1a0 2025-05-07T20:10:52.6125317Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:10:52.6125430Z 0x000000006ffffff0 (VERSYM) 0x16ada8 2025-05-07T20:10:52.6125539Z 0x000000006ffffff9 (RELACOUNT) 2870 2025-05-07T20:10:52.6125652Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:52.6125656Z 2025-05-07T20:10:52.6125772Z ################################################################################ 2025-05-07T20:10:52.6125778Z 2025-05-07T20:10:52.6125782Z 2025-05-07T20:10:52.6125893Z ################################################################################ 2025-05-07T20:10:52.6126211Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:10:52.6126315Z [CHECK] Listing out library size: 2025-05-07T20:10:52.6126610Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:10:52.6126614Z 2025-05-07T20:10:52.6129500Z 904 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:10:52.6129525Z 2025-05-07T20:10:52.6131714Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:10:52.6133314Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:52.6133335Z 2025-05-07T20:10:52.8094097Z GLIBC_2.2.5 2025-05-07T20:10:52.8094490Z GLIBC_2.3 2025-05-07T20:10:52.8094712Z GLIBC_2.14 2025-05-07T20:10:52.8094840Z 2025-05-07T20:10:52.8094849Z 2025-05-07T20:10:52.8095305Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:10:52.8096409Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:52.8097056Z 2025-05-07T20:10:53.0085300Z GLIBCXX_3.4 2025-05-07T20:10:53.0085552Z GLIBCXX_3.4.9 2025-05-07T20:10:53.0085759Z GLIBCXX_3.4.11 2025-05-07T20:10:53.0086114Z GLIBCXX_3.4.14 2025-05-07T20:10:53.0086316Z GLIBCXX_3.4.15 2025-05-07T20:10:53.0086515Z GLIBCXX_3.4.18 2025-05-07T20:10:53.0086707Z GLIBCXX_3.4.20 2025-05-07T20:10:53.0086903Z GLIBCXX_3.4.21 2025-05-07T20:10:53.0087031Z 2025-05-07T20:10:53.0087036Z 2025-05-07T20:10:53.0111508Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so > /tmp/tmp.oMyo6RRnVn.symbols.txt 2025-05-07T20:10:53.0113059Z 2025-05-07T20:10:53.2155601Z 2025-05-07T20:10:53.2236603Z [CHECK] Total Number of symbols: 12682 2025-05-07T20:10:53.2327355Z [CHECK] Number of fbgemm symbols: 2318 2025-05-07T20:10:53.2346833Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so > /tmp/tmp.n8JTQvlQ34.usymbols.txt 2025-05-07T20:10:53.2348413Z 2025-05-07T20:10:53.2413453Z 2025-05-07T20:10:53.2439278Z [CHECK] Listing out undefined symbols (273 total): 2025-05-07T20:10:53.2456742Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:53.2457650Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:53.2458266Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:53.2458639Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:53.2459064Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:53.2459647Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:53.2460045Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:53.2460441Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:53.2460793Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:53.2461169Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:53.2461524Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:53.2461866Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:53.2462181Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:53.2462583Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:53.2462916Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:53.2463236Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:53.2463573Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:53.2463893Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:53.2464216Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:53.2464632Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:10:53.2464959Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:53.2465258Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:53.2465575Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:53.2465908Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:10:53.2466300Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:10:53.2466723Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:10:53.2467125Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:10:53.2467542Z U at::RecordFunction::currentThreadId() 2025-05-07T20:10:53.2468082Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:10:53.2468449Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:10:53.2468876Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:10:53.2469446Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:10:53.2470006Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:10:53.2470860Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:53.2472092Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:53.2473048Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:53.2474031Z U at::_ops::sparse_coo_tensor_indices_size::call(at::Tensor const&, at::Tensor const&, c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:53.2475473Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:10:53.2476044Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:10:53.2476476Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:53.2477222Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:53.2478407Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:53.2479315Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:10:53.2479719Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:10:53.2480105Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:10:53.2480503Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:10:53.2480884Z U at::get_thread_num() 2025-05-07T20:10:53.2481187Z U at::globalContext() 2025-05-07T20:10:53.2481522Z U at::internal::set_thread_num(int) 2025-05-07T20:10:53.2481877Z U at::sequence_number::get_and_increment() 2025-05-07T20:10:53.2482282Z U at::tensor(c10::ArrayRef, c10::TensorOptions const&) 2025-05-07T20:10:53.2482725Z U at::toAccumulateType(c10::ScalarType, bool) 2025-05-07T20:10:53.2483061Z U bcmp@GLIBC_2.2.5 2025-05-07T20:10:53.2483359Z U c10::AnyType::get() 2025-05-07T20:10:53.2483763Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:53.2484197Z U c10::BoolType::get() 2025-05-07T20:10:53.2484564Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:53.2485012Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:10:53.2485435Z U c10::Dispatcher::realSingleton() 2025-05-07T20:10:53.2486182Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:10:53.2487457Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:10:53.2488654Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:53.2489239Z U c10::Error::what() const 2025-05-07T20:10:53.2489564Z U c10::FloatType::get() 2025-05-07T20:10:53.2489904Z U c10::GeneratorImpl::device() const 2025-05-07T20:10:53.2490238Z U c10::GradMode::is_enabled() 2025-05-07T20:10:53.2490578Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:10:53.2490981Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:10:53.2491440Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:53.2491886Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:10:53.2492425Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:10:53.2492913Z U c10::IValue::isBoolList() const 2025-05-07T20:10:53.2493218Z U c10::IValue::isIntList() const 2025-05-07T20:10:53.2493550Z U c10::IValue::isSymIntList() const 2025-05-07T20:10:53.2493865Z U c10::IValue::isTensorList() const 2025-05-07T20:10:53.2494268Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:53.2494804Z U c10::IntType::get() 2025-05-07T20:10:53.2495161Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:53.2495630Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:53.2495984Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:53.2496353Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:53.2496799Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:53.2497275Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:10:53.2510003Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:10:53.2510913Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:10:53.2511476Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:53.2511845Z U c10::StringType::get() 2025-05-07T20:10:53.2512196Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:53.2512664Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:53.2513502Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:53.2514329Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:53.2514718Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:53.2515065Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:53.2515413Z U c10::SymIntType::get() 2025-05-07T20:10:53.2515769Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:53.2516166Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:10:53.2516583Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:53.2516945Z U c10::TensorType::get() 2025-05-07T20:10:53.2517277Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:53.2518226Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:53.2519181Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:53.2519537Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:53.2519922Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:53.2520252Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:53.2520583Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:53.2520929Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:53.2521400Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:53.2521864Z U c10::cuda::device_count() 2025-05-07T20:10:53.2522217Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:53.2522593Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:53.2523021Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:53.2523416Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:53.2523828Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:53.2524254Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:53.2524909Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:53.2525975Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:53.2526861Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:53.2527715Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:53.2528868Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:53.2529912Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:53.2530803Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:53.2531147Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:53.2531683Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:10:53.2532314Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:10:53.2532776Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:10:53.2533200Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:10:53.2533608Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:53.2533947Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:53.2534340Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:10:53.2534991Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:53.2535596Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:10:53.2535974Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:53.2536361Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:53.2536784Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:10:53.2537205Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:10:53.2537585Z U c10::operator<=(c10::SymInt const&, int) 2025-05-07T20:10:53.2537949Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:10:53.2538293Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:10:53.2538656Z U c10::throwNullDataPtrError() 2025-05-07T20:10:53.2538980Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:53.2539370Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:10:53.2539797Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:53.2540342Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:53.2540705Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:10:53.2541191Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:53.2541730Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:53.2542079Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:53.2543748Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:53.2544112Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:53.2544434Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:53.2544791Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:53.2545182Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:10:53.2545563Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:10:53.2545920Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:53.2546269Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:53.2546597Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:53.2546934Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:53.2547288Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:53.2547631Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:53.2548625Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:53.2549822Z U fbgemm::SparseAdaGradSignature::Type fbgemm::GenerateSparseAdaGrad(int, bool, int, bool) 2025-05-07T20:10:53.2550396Z U fbgemm::fbgemmAlignedFree(void*) 2025-05-07T20:10:53.2550816Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:10:53.2551271Z U float at::Tensor::item() const 2025-05-07T20:10:53.2551635Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:10:53.2552026Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:53.2552384Z U free@GLIBC_2.2.5 2025-05-07T20:10:53.2552704Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:53.2553064Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:53.2553496Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:53.2553900Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:53.2554724Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:53.2555105Z U memcmp@GLIBC_2.2.5 2025-05-07T20:10:53.2555388Z U memcpy@GLIBC_2.14 2025-05-07T20:10:53.2555686Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:53.2555975Z U memset@GLIBC_2.2.5 2025-05-07T20:10:53.2556284Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:53.2556626Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:53.2557194Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:10:53.2557932Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:10:53.2558672Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:10:53.2559436Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:10:53.2560225Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:10:53.2560987Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:10:53.2561535Z U realloc@GLIBC_2.2.5 2025-05-07T20:10:53.2562174Z U split_embedding_codegen_forward_cpu(at::Tensor, at::Tensor, at::Tensor, c10::SymInt, at::Tensor, at::Tensor, at::Tensor, long, at::Tensor, long) 2025-05-07T20:10:53.2563186Z U split_embedding_codegen_grad_indice_weights_cpu(at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor) 2025-05-07T20:10:53.2563804Z U sqrt@GLIBC_2.2.5 2025-05-07T20:10:53.2564100Z U sqrtf@GLIBC_2.2.5 2025-05-07T20:10:53.2564538Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:10:53.2565207Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:53.2566061Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:53.2566999Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:53.2567797Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:53.2568395Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:53.2568720Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:53.2569078Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:10:53.2569456Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:53.2569852Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:53.2570296Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:53.2570704Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:53.2571081Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:10:53.2571551Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:53.2572465Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:53.2573269Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:53.2573617Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:53.2573970Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:53.2574325Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:53.2574764Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:53.2575155Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:53.2575652Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:53.2576109Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:53.2576483Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:53.2576886Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:10:53.2577521Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:10:53.2578147Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:10:53.2578522Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:53.2578813Z U strcmp@GLIBC_2.2.5 2025-05-07T20:10:53.2579102Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:53.2579414Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:53.2580172Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:53.2581286Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:53.2582052Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:53.2582782Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:10:53.2583336Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:10:53.2583909Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:10:53.2584414Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:10:53.2584902Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:10:53.2585600Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:10:53.2586394Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:10:53.2586848Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:10:53.2587348Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:10:53.2587764Z U torch::autograd::Node::assign_parent() 2025-05-07T20:10:53.2588172Z U torch::autograd::Node::metadata() 2025-05-07T20:10:53.2588550Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:10:53.2589080Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:10:53.2589726Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:10:53.2590251Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:10:53.2590719Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:10:53.2591285Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:10:53.2594439Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:10:53.2597377Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:10:53.2597798Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:10:53.2598242Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:10:53.2598693Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:10:53.2599362Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:10:53.2600288Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:53.2601320Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:10:53.2602070Z U typeinfo for c10::Error 2025-05-07T20:10:53.2602426Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:53.2602828Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:10:53.2603215Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:10:53.2603576Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:10:53.2603942Z U typeinfo for torch::autograd::Node 2025-05-07T20:10:53.2605261Z U void internal::csr2csc(internal::HyperCompressedSparseColumn&, int, at::TensorAccessor const&, at::TensorAccessor const&, at::TensorAccessor const&, long, int const*, long) 2025-05-07T20:10:53.2607496Z U void internal::csr2csc(internal::HyperCompressedSparseColumn&, int, at::TensorAccessor const&, at::TensorAccessor const&, at::TensorAccessor const&, long, int const*, long) 2025-05-07T20:10:53.2608824Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:53.2609254Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:53.2609678Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:10:53.2610112Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:53.2610488Z U vtable for c10::Error 2025-05-07T20:10:53.2611043Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:53.2611623Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:53.2612082Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:53.2612538Z U vtable for torch::autograd::Node 2025-05-07T20:10:53.2612945Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:53.2613342Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:53.2613675Z w _ITM_registerTMCloneTable 2025-05-07T20:10:53.2613979Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:53.2614284Z w __gmon_start__ 2025-05-07T20:10:53.2614554Z w __pthread_key_create 2025-05-07T20:10:53.2614861Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:53.2615200Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:53.2615559Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:53.2616056Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:10:53.2616520Z 2025-05-07T20:10:53.2616702Z linux-vdso.so.1 (0x00007ffdb338e000) 2025-05-07T20:10:53.2616985Z libc10.so => not found 2025-05-07T20:10:53.2617207Z libnvrtc.so.12 => not found 2025-05-07T20:10:53.2617468Z libc10_cuda.so => not found 2025-05-07T20:10:53.2617724Z libnccl.so.2 => not found 2025-05-07T20:10:53.2617959Z libcuda.so.1 => not found 2025-05-07T20:10:53.2618559Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007f1423e00000) 2025-05-07T20:10:53.2619738Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f1423a00000) 2025-05-07T20:10:53.2620902Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f1423859000) 2025-05-07T20:10:53.2621650Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:53.2621910Z libtorch.so => not found 2025-05-07T20:10:53.2622406Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so (0x00007f1423200000) 2025-05-07T20:10:53.2623350Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f1422000000) 2025-05-07T20:10:53.2624007Z libtorch_cpu.so => not found 2025-05-07T20:10:53.2624271Z libtorch_cuda.so => not found 2025-05-07T20:10:53.2624544Z libcudart.so.12 => not found 2025-05-07T20:10:53.2624860Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f1421d9c000) 2025-05-07T20:10:53.2625276Z libm.so.6 => /lib64/libm.so.6 (0x00007f1425525000) 2025-05-07T20:10:53.2625645Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f145f30a000) 2025-05-07T20:10:53.2626018Z libc.so.6 => /lib64/libc.so.6 (0x00007f1421b94000) 2025-05-07T20:10:53.2626368Z /lib64/ld-linux-x86-64.so.2 (0x00007f145f342000) 2025-05-07T20:10:53.2626680Z libtorch.so => not found 2025-05-07T20:10:53.2626925Z libc10.so => not found 2025-05-07T20:10:53.2627148Z libnvrtc.so.12 => not found 2025-05-07T20:10:53.2627399Z libc10_cuda.so => not found 2025-05-07T20:10:53.2627647Z libnccl.so.2 => not found 2025-05-07T20:10:53.2627889Z libcuda.so.1 => not found 2025-05-07T20:10:53.2628127Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:53.2628591Z libtorch_cpu.so => not found 2025-05-07T20:10:53.2628842Z libtorch_cuda.so => not found 2025-05-07T20:10:53.2629094Z libcudart.so.12 => not found 2025-05-07T20:10:53.2629401Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f14254cf000) 2025-05-07T20:10:53.2629729Z libc10.so => not found 2025-05-07T20:10:53.2629962Z libnvrtc.so.12 => not found 2025-05-07T20:10:53.2630207Z libc10_cuda.so => not found 2025-05-07T20:10:53.2630519Z libnccl.so.2 => not found 2025-05-07T20:10:53.2630752Z libcuda.so.1 => not found 2025-05-07T20:10:53.2631357Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so (0x00007f1423df5000) 2025-05-07T20:10:53.2631998Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:53.2632264Z libtorch.so => not found 2025-05-07T20:10:53.2632496Z libtorch_cpu.so => not found 2025-05-07T20:10:53.2632751Z libtorch_cuda.so => not found 2025-05-07T20:10:53.2633007Z libcudart.so.12 => not found 2025-05-07T20:10:53.2633250Z libc10.so => not found 2025-05-07T20:10:53.2633488Z libnvrtc.so.12 => not found 2025-05-07T20:10:53.2633726Z libc10_cuda.so => not found 2025-05-07T20:10:53.2633978Z libnccl.so.2 => not found 2025-05-07T20:10:53.2634293Z libcuda.so.1 => not found 2025-05-07T20:10:53.2634543Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:53.2634802Z libtorch.so => not found 2025-05-07T20:10:53.2635050Z libtorch_cpu.so => not found 2025-05-07T20:10:53.2635304Z libtorch_cuda.so => not found 2025-05-07T20:10:53.2635566Z libcudart.so.12 => not found 2025-05-07T20:10:53.2635817Z libc10.so => not found 2025-05-07T20:10:53.2636043Z libnvrtc.so.12 => not found 2025-05-07T20:10:53.2636296Z libc10_cuda.so => not found 2025-05-07T20:10:53.2636545Z libnccl.so.2 => not found 2025-05-07T20:10:53.2636787Z libcuda.so.1 => not found 2025-05-07T20:10:53.2637287Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so (0x00007f1423d7a000) 2025-05-07T20:10:53.2637855Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:53.2638109Z libtorch.so => not found 2025-05-07T20:10:53.2638353Z libtorch_cpu.so => not found 2025-05-07T20:10:53.2638618Z libtorch_cuda.so => not found 2025-05-07T20:10:53.2638864Z libtorch.so => not found 2025-05-07T20:10:53.2639098Z libc10.so => not found 2025-05-07T20:10:53.2639379Z libnvrtc.so.12 => not found 2025-05-07T20:10:53.2639631Z libc10_cuda.so => not found 2025-05-07T20:10:53.2639872Z libnccl.so.2 => not found 2025-05-07T20:10:53.2640121Z libcuda.so.1 => not found 2025-05-07T20:10:53.2640365Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:53.2640631Z libtorch_cpu.so => not found 2025-05-07T20:10:53.2640884Z libtorch_cuda.so => not found 2025-05-07T20:10:53.2641143Z libcudart.so.12 => not found 2025-05-07T20:10:53.2641388Z libtorch.so => not found 2025-05-07T20:10:53.2641621Z libc10.so => not found 2025-05-07T20:10:53.2641851Z libnvrtc.so.12 => not found 2025-05-07T20:10:53.2642093Z libc10_cuda.so => not found 2025-05-07T20:10:53.2642382Z libnccl.so.2 => not found 2025-05-07T20:10:53.2642619Z libcuda.so.1 => not found 2025-05-07T20:10:53.2642865Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:53.2643119Z libtorch_cpu.so => not found 2025-05-07T20:10:53.2643376Z libtorch_cuda.so => not found 2025-05-07T20:10:53.2643739Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f1423d6d000) 2025-05-07T20:10:53.2644112Z libtorch.so => not found 2025-05-07T20:10:53.2644344Z libc10.so => not found 2025-05-07T20:10:53.2644584Z libnvrtc.so.12 => not found 2025-05-07T20:10:53.2644842Z libc10_cuda.so => not found 2025-05-07T20:10:53.2645087Z libnccl.so.2 => not found 2025-05-07T20:10:53.2645330Z libcuda.so.1 => not found 2025-05-07T20:10:53.2645576Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:53.2645843Z libtorch_cpu.so => not found 2025-05-07T20:10:53.2646096Z libtorch_cuda.so => not found 2025-05-07T20:10:53.2646392Z librt.so.1 => /lib64/librt.so.1 (0x00007f1423d64000) 2025-05-07T20:10:53.2646627Z 2025-05-07T20:10:53.2646850Z [CHECK] Displaying ELF information: 2025-05-07T20:10:53.2647301Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:10:53.2647665Z 2025-05-07T20:10:53.2647670Z 2025-05-07T20:10:53.2647829Z Dynamic section at offset 0x38775ba0 contains 45 entries: 2025-05-07T20:10:53.2648187Z Tag Type Name/Value 2025-05-07T20:10:53.2648588Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:53.2649095Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:53.2649585Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:53.2650071Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:53.2650543Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:53.2651044Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:10:53.2651569Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:10:53.2652129Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:10:53.2652674Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:53.2653174Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:53.2653769Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:10:53.2654234Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:10:53.2654728Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:53.2655194Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:53.2655668Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:53.2656137Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:53.2656595Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:10:53.2657045Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:53.2657494Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:53.2657991Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:53.2658763Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:10:53.2659297Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:53.2659680Z 0x000000000000000c (INIT) 0x652000 2025-05-07T20:10:53.2660033Z 0x000000000000000d (FINI) 0x2f6443c 2025-05-07T20:10:53.2660394Z 0x0000000000000019 (INIT_ARRAY) 0x3871d880 2025-05-07T20:10:53.2660752Z 0x000000000000001b (INIT_ARRAYSZ) 1824 (bytes) 2025-05-07T20:10:53.2661152Z 0x000000000000001a (FINI_ARRAY) 0x3871dfa0 2025-05-07T20:10:53.2661492Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:53.2661841Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:10:53.2662166Z 0x0000000000000005 (STRTAB) 0x62978 2025-05-07T20:10:53.2662501Z 0x0000000000000006 (SYMTAB) 0x18470 2025-05-07T20:10:53.2662899Z 0x000000000000000a (STRSZ) 5120077 (bytes) 2025-05-07T20:10:53.2663257Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:53.2663613Z 0x0000000000000003 (PLTGOT) 0x38788fe8 2025-05-07T20:10:53.2663968Z 0x0000000000000002 (PLTRELSZ) 63264 (bytes) 2025-05-07T20:10:53.2664317Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:53.2664636Z 0x0000000000000017 (JMPREL) 0x641978 2025-05-07T20:10:53.2664971Z 0x0000000000000007 (RELA) 0x54ae50 2025-05-07T20:10:53.2665316Z 0x0000000000000008 (RELASZ) 1010472 (bytes) 2025-05-07T20:10:53.2665685Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:53.2666040Z 0x000000006ffffffe (VERNEED) 0x54ace0 2025-05-07T20:10:53.2666362Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:10:53.2666703Z 0x000000006ffffff0 (VERSYM) 0x5449c6 2025-05-07T20:10:53.2667032Z 0x000000006ffffff9 (RELACOUNT) 28262 2025-05-07T20:10:53.2667367Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:53.2667568Z 2025-05-07T20:10:53.2667741Z ################################################################################ 2025-05-07T20:10:53.2667978Z 2025-05-07T20:10:53.2667983Z 2025-05-07T20:10:53.2668094Z ################################################################################ 2025-05-07T20:10:53.2668640Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:10:53.2669159Z [CHECK] Listing out library size: 2025-05-07T20:10:53.2669667Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:10:53.2670186Z 2025-05-07T20:10:53.2670418Z 142 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:10:53.2670767Z 2025-05-07T20:10:53.2671169Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:10:53.2672178Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:53.2672765Z 2025-05-07T20:10:53.2880531Z GLIBC_2.2.5 2025-05-07T20:10:53.2881179Z GLIBC_2.3 2025-05-07T20:10:53.2881717Z GLIBC_2.14 2025-05-07T20:10:53.2882092Z 2025-05-07T20:10:53.2882106Z 2025-05-07T20:10:53.2883484Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:10:53.2885807Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:53.2886420Z 2025-05-07T20:10:53.3154537Z GLIBCXX_3.4 2025-05-07T20:10:53.3155200Z GLIBCXX_3.4.9 2025-05-07T20:10:53.3155837Z GLIBCXX_3.4.11 2025-05-07T20:10:53.3156750Z GLIBCXX_3.4.18 2025-05-07T20:10:53.3157345Z GLIBCXX_3.4.20 2025-05-07T20:10:53.3157900Z GLIBCXX_3.4.21 2025-05-07T20:10:53.3158246Z 2025-05-07T20:10:53.3158270Z 2025-05-07T20:10:53.3174768Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so > /tmp/tmp.w4Sak4t6No.symbols.txt 2025-05-07T20:10:53.3175328Z 2025-05-07T20:10:53.3415158Z 2025-05-07T20:10:53.3441267Z [CHECK] Total Number of symbols: 1629 2025-05-07T20:10:53.3463879Z [CHECK] Number of fbgemm symbols: 227 2025-05-07T20:10:53.3480209Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so > /tmp/tmp.ghEB3WkAgk.usymbols.txt 2025-05-07T20:10:53.3481822Z 2025-05-07T20:10:53.3504797Z 2025-05-07T20:10:53.3533834Z [CHECK] Listing out undefined symbols (171 total): 2025-05-07T20:10:53.3551104Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:53.3552197Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:53.3552778Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:53.3553130Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:53.3553592Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:53.3553972Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:53.3554511Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:53.3554910Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:53.3555317Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:53.3555701Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:53.3556051Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:53.3556373Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:53.3556687Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:53.3557013Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:53.3557328Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:10:53.3557787Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:53.3558122Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:53.3558451Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:10:53.3558864Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:10:53.3559288Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:10:53.3559748Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:10:53.3560228Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:10:53.3561187Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:53.3562506Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:53.3563447Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:53.3564048Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:10:53.3564936Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:53.3566081Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:53.3566912Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:10:53.3567404Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:10:53.3567745Z U at::globalContext() 2025-05-07T20:10:53.3568159Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:53.3568581Z U c10::BoolType::get() 2025-05-07T20:10:53.3568928Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:53.3569294Z U c10::FloatType::get() 2025-05-07T20:10:53.3569599Z U c10::GeneratorImpl::device() const 2025-05-07T20:10:53.3570033Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:53.3570446Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:53.3570798Z U c10::IntType::get() 2025-05-07T20:10:53.3571163Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:53.3571585Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:53.3571976Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:53.3572382Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:53.3572776Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:53.3573430Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:53.3574067Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:53.3574432Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:53.3574746Z U c10::SymIntType::get() 2025-05-07T20:10:53.3575105Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:53.3575529Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:53.3575900Z U c10::TensorType::get() 2025-05-07T20:10:53.3576232Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:53.3577180Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:53.3578139Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:53.3578507Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:53.3578841Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:53.3579190Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:53.3579519Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:53.3579866Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:53.3580429Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:53.3580874Z U c10::cuda::device_count() 2025-05-07T20:10:53.3581205Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:53.3581553Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:53.3581926Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:53.3582280Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:53.3582661Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:53.3583027Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:53.3583703Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:53.3584521Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:53.3585354Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:53.3586225Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:53.3587184Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:53.3587940Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:53.3588273Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:53.3588618Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:10:53.3589017Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:10:53.3589396Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:53.3589762Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:53.3590139Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:53.3590483Z U c10::throwNullDataPtrError() 2025-05-07T20:10:53.3590787Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:53.3591100Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:10:53.3591482Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:53.3591887Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:53.3592216Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:10:53.3592575Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:53.3592935Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:53.3593270Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:53.3593604Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:53.3593927Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:53.3594342Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:53.3594889Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:53.3595307Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:10:53.3595695Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:10:53.3596061Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:53.3596418Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:53.3596750Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:53.3597105Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:53.3597454Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:53.3597820Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:53.3600274Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:10:53.3602782Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:10:53.3603213Z U float at::Tensor::item() const 2025-05-07T20:10:53.3603590Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:10:53.3604046Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:53.3604437Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:53.3604866Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:53.3605282Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:53.3605719Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:53.3606121Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:53.3606471Z U memcpy@GLIBC_2.14 2025-05-07T20:10:53.3606773Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:53.3607174Z U memset@GLIBC_2.2.5 2025-05-07T20:10:53.3607663Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:53.3608052Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:53.3608626Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:10:53.3609432Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:10:53.3610192Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:10:53.3610973Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:10:53.3611762Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:53.3612621Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:53.3613460Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:53.3614268Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:53.3614880Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:53.3615232Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:53.3615622Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:53.3616027Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:53.3616441Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:53.3616871Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:53.3617369Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:53.3618294Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:53.3619107Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:53.3619471Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:53.3619824Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:53.3620175Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:53.3620580Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:53.3621128Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:53.3621594Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:53.3621945Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:53.3622276Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:53.3622590Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:53.3623424Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:53.3624621Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:53.3625571Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:53.3626305Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:53.3627332Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:10:53.3630030Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:53.3633038Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:53.3636057Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:53.3638921Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:53.3641824Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:53.3644657Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:53.3648189Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:53.3652701Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:53.3656756Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:53.3660701Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:53.3664609Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:53.3668493Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:53.3673696Z U void embedding_ops::split_embedding_backward_count_unique_indices_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int) 2025-05-07T20:10:53.3675754Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:53.3676164Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:53.3676586Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:53.3677180Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:53.3677834Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:53.3678280Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:53.3678601Z w _ITM_registerTMCloneTable 2025-05-07T20:10:53.3678912Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:53.3679197Z w __gmon_start__ 2025-05-07T20:10:53.3679475Z w __pthread_key_create 2025-05-07T20:10:53.3679819Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:53.3680141Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:53.3680505Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:53.3680995Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:10:53.3681367Z 2025-05-07T20:10:53.3681523Z linux-vdso.so.1 (0x00007ffc019b7000) 2025-05-07T20:10:53.3681815Z libc10.so => not found 2025-05-07T20:10:53.3682052Z libnvrtc.so.12 => not found 2025-05-07T20:10:53.3682319Z libc10_cuda.so => not found 2025-05-07T20:10:53.3682579Z libnccl.so.2 => not found 2025-05-07T20:10:53.3682826Z libcuda.so.1 => not found 2025-05-07T20:10:53.3683587Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007fdbb6a00000) 2025-05-07T20:10:53.3684355Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:53.3684628Z libtorch.so => not found 2025-05-07T20:10:53.3684904Z libtorch_cpu.so => not found 2025-05-07T20:10:53.3685165Z libtorch_cuda.so => not found 2025-05-07T20:10:53.3685418Z libcudart.so.12 => not found 2025-05-07T20:10:53.3685742Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fdbb679c000) 2025-05-07T20:10:53.3686156Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fdbf98e0000) 2025-05-07T20:10:53.3686521Z libc.so.6 => /lib64/libc.so.6 (0x00007fdbb6594000) 2025-05-07T20:10:53.3686875Z /lib64/ld-linux-x86-64.so.2 (0x00007fdbf9916000) 2025-05-07T20:10:53.3687187Z libc10.so => not found 2025-05-07T20:10:53.3687424Z libnvrtc.so.12 => not found 2025-05-07T20:10:53.3687671Z libc10_cuda.so => not found 2025-05-07T20:10:53.3688068Z libnccl.so.2 => not found 2025-05-07T20:10:53.3688300Z libcuda.so.1 => not found 2025-05-07T20:10:53.3688989Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007fdbb4e00000) 2025-05-07T20:10:53.3689948Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so (0x00007fdbb4a00000) 2025-05-07T20:10:53.3690997Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007fdbb4859000) 2025-05-07T20:10:53.3691682Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:53.3691920Z libtorch.so => not found 2025-05-07T20:10:53.3692390Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so (0x00007fdbb4200000) 2025-05-07T20:10:53.3693255Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007fdbb3000000) 2025-05-07T20:10:53.3693867Z libtorch_cpu.so => not found 2025-05-07T20:10:53.3694126Z libtorch_cuda.so => not found 2025-05-07T20:10:53.3694546Z libcudart.so.12 => not found 2025-05-07T20:10:53.3694851Z libm.so.6 => /lib64/libm.so.6 (0x00007fdbf97ff000) 2025-05-07T20:10:53.3695170Z libtorch.so => not found 2025-05-07T20:10:53.3695448Z libc10.so => not found 2025-05-07T20:10:53.3695681Z libnvrtc.so.12 => not found 2025-05-07T20:10:53.3696110Z libc10_cuda.so => not found 2025-05-07T20:10:53.3696374Z libnccl.so.2 => not found 2025-05-07T20:10:53.3696618Z libcuda.so.1 => not found 2025-05-07T20:10:53.3696879Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:53.3697140Z libtorch_cpu.so => not found 2025-05-07T20:10:53.3697396Z libtorch_cuda.so => not found 2025-05-07T20:10:53.3697646Z libcudart.so.12 => not found 2025-05-07T20:10:53.3697966Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fdbf07aa000) 2025-05-07T20:10:53.3698311Z libc10.so => not found 2025-05-07T20:10:53.3698555Z libnvrtc.so.12 => not found 2025-05-07T20:10:53.3698796Z libc10_cuda.so => not found 2025-05-07T20:10:53.3699044Z libnccl.so.2 => not found 2025-05-07T20:10:53.3699295Z libcuda.so.1 => not found 2025-05-07T20:10:53.3699893Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so (0x00007fdbf97ee000) 2025-05-07T20:10:53.3700681Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:53.3700935Z libtorch.so => not found 2025-05-07T20:10:53.3701212Z libtorch_cpu.so => not found 2025-05-07T20:10:53.3701465Z libtorch_cuda.so => not found 2025-05-07T20:10:53.3701732Z libcudart.so.12 => not found 2025-05-07T20:10:53.3701982Z libc10.so => not found 2025-05-07T20:10:53.3702220Z libnvrtc.so.12 => not found 2025-05-07T20:10:53.3702488Z libc10_cuda.so => not found 2025-05-07T20:10:53.3702735Z libnccl.so.2 => not found 2025-05-07T20:10:53.3702991Z libcuda.so.1 => not found 2025-05-07T20:10:53.3703238Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:53.3703544Z libtorch.so => not found 2025-05-07T20:10:53.3703780Z libtorch_cpu.so => not found 2025-05-07T20:10:53.3704052Z libtorch_cuda.so => not found 2025-05-07T20:10:53.3704312Z libcudart.so.12 => not found 2025-05-07T20:10:53.3704566Z libc10.so => not found 2025-05-07T20:10:53.3704796Z libnvrtc.so.12 => not found 2025-05-07T20:10:53.3705089Z libc10_cuda.so => not found 2025-05-07T20:10:53.3705341Z libnccl.so.2 => not found 2025-05-07T20:10:53.3705589Z libcuda.so.1 => not found 2025-05-07T20:10:53.3706113Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so (0x00007fdbf072d000) 2025-05-07T20:10:53.3706666Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:53.3707050Z libtorch.so => not found 2025-05-07T20:10:53.3707290Z libtorch_cpu.so => not found 2025-05-07T20:10:53.3707547Z libtorch_cuda.so => not found 2025-05-07T20:10:53.3707789Z libtorch.so => not found 2025-05-07T20:10:53.3708035Z libc10.so => not found 2025-05-07T20:10:53.3708250Z libnvrtc.so.12 => not found 2025-05-07T20:10:53.3708497Z libc10_cuda.so => not found 2025-05-07T20:10:53.3708746Z libnccl.so.2 => not found 2025-05-07T20:10:53.3708970Z libcuda.so.1 => not found 2025-05-07T20:10:53.3709209Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:53.3709460Z libtorch_cpu.so => not found 2025-05-07T20:10:53.3709715Z libtorch_cuda.so => not found 2025-05-07T20:10:53.3709956Z libcudart.so.12 => not found 2025-05-07T20:10:53.3710234Z libtorch.so => not found 2025-05-07T20:10:53.3710452Z libc10.so => not found 2025-05-07T20:10:53.3710688Z libnvrtc.so.12 => not found 2025-05-07T20:10:53.3710918Z libc10_cuda.so => not found 2025-05-07T20:10:53.3711155Z libnccl.so.2 => not found 2025-05-07T20:10:53.3711392Z libcuda.so.1 => not found 2025-05-07T20:10:53.3711632Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:53.3711897Z libtorch_cpu.so => not found 2025-05-07T20:10:53.3712141Z libtorch_cuda.so => not found 2025-05-07T20:10:53.3712473Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007fdbf0720000) 2025-05-07T20:10:53.3712831Z libtorch.so => not found 2025-05-07T20:10:53.3713059Z libc10.so => not found 2025-05-07T20:10:53.3713278Z libnvrtc.so.12 => not found 2025-05-07T20:10:53.3713523Z libc10_cuda.so => not found 2025-05-07T20:10:53.3713753Z libnccl.so.2 => not found 2025-05-07T20:10:53.3713998Z libcuda.so.1 => not found 2025-05-07T20:10:53.3714336Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:53.3714769Z libtorch_cpu.so => not found 2025-05-07T20:10:53.3715052Z libtorch_cuda.so => not found 2025-05-07T20:10:53.3715398Z librt.so.1 => /lib64/librt.so.1 (0x00007fdbf0717000) 2025-05-07T20:10:53.3715642Z 2025-05-07T20:10:53.3715765Z [CHECK] Displaying ELF information: 2025-05-07T20:10:53.3716249Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:10:53.3716661Z 2025-05-07T20:10:53.3716665Z 2025-05-07T20:10:53.3716830Z Dynamic section at offset 0x8d68cc8 contains 40 entries: 2025-05-07T20:10:53.3717227Z Tag Type Name/Value 2025-05-07T20:10:53.3717635Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:53.3718149Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:53.3718655Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:53.3719208Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:53.3719708Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:53.3720274Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:10:53.3720858Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:53.3721369Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:53.3721882Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:53.3722442Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:53.3722972Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:53.3723476Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:53.3724019Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:53.3724530Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:53.3725040Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:53.3725646Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_gwd.so] 2025-05-07T20:10:53.3726203Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:53.3726616Z 0x000000000000000c (INIT) 0xbe000 2025-05-07T20:10:53.3726943Z 0x000000000000000d (FINI) 0x5f04ec 2025-05-07T20:10:53.3727287Z 0x0000000000000019 (INIT_ARRAY) 0x8d5ea18 2025-05-07T20:10:53.3727649Z 0x000000000000001b (INIT_ARRAYSZ) 200 (bytes) 2025-05-07T20:10:53.3727996Z 0x000000000000001a (FINI_ARRAY) 0x8d5eae0 2025-05-07T20:10:53.3728509Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:53.3728847Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:10:53.3729320Z 0x0000000000000005 (STRTAB) 0xc600 2025-05-07T20:10:53.3729685Z 0x0000000000000006 (SYMTAB) 0x2d30 2025-05-07T20:10:53.3730108Z 0x000000000000000a (STRSZ) 597451 (bytes) 2025-05-07T20:10:53.3730475Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:53.3730818Z 0x0000000000000003 (PLTGOT) 0x8d6afe8 2025-05-07T20:10:53.3731190Z 0x0000000000000002 (PLTRELSZ) 12672 (bytes) 2025-05-07T20:10:53.3731534Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:53.3731870Z 0x0000000000000017 (JMPREL) 0xbab38 2025-05-07T20:10:53.3732200Z 0x0000000000000007 (RELA) 0x9f1a8 2025-05-07T20:10:53.3732560Z 0x0000000000000008 (RELASZ) 113040 (bytes) 2025-05-07T20:10:53.3732921Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:53.3733277Z 0x000000006ffffffe (VERNEED) 0x9f088 2025-05-07T20:10:53.3733609Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:53.3733923Z 0x000000006ffffff0 (VERSYM) 0x9e3cc 2025-05-07T20:10:53.3734265Z 0x000000006ffffff9 (RELACOUNT) 3303 2025-05-07T20:10:53.3734567Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:53.3734784Z 2025-05-07T20:10:53.3734890Z ################################################################################ 2025-05-07T20:10:53.3735114Z 2025-05-07T20:10:53.3735118Z 2025-05-07T20:10:53.3735230Z ################################################################################ 2025-05-07T20:10:53.3735788Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:10:53.3736339Z [CHECK] Listing out library size: 2025-05-07T20:10:53.3736837Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:10:53.3737282Z 2025-05-07T20:10:53.3737528Z 59 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:10:53.3737938Z 2025-05-07T20:10:53.3738382Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:10:53.3739488Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:53.3740152Z 2025-05-07T20:10:53.3831038Z GLIBC_2.2.5 2025-05-07T20:10:53.3831630Z GLIBC_2.3 2025-05-07T20:10:53.3832201Z GLIBC_2.14 2025-05-07T20:10:53.3832551Z 2025-05-07T20:10:53.3832555Z 2025-05-07T20:10:53.3833205Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:10:53.3834464Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:53.3835152Z 2025-05-07T20:10:53.3998165Z GLIBCXX_3.4 2025-05-07T20:10:53.3998779Z GLIBCXX_3.4.9 2025-05-07T20:10:53.3999409Z GLIBCXX_3.4.11 2025-05-07T20:10:53.4000013Z GLIBCXX_3.4.15 2025-05-07T20:10:53.4000564Z GLIBCXX_3.4.18 2025-05-07T20:10:53.4001129Z GLIBCXX_3.4.20 2025-05-07T20:10:53.4001678Z GLIBCXX_3.4.21 2025-05-07T20:10:53.4002034Z 2025-05-07T20:10:53.4002073Z 2025-05-07T20:10:53.4021919Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so > /tmp/tmp.gH5tIVRKz9.symbols.txt 2025-05-07T20:10:53.4022880Z 2025-05-07T20:10:53.4141180Z 2025-05-07T20:10:53.4165960Z [CHECK] Total Number of symbols: 1874 2025-05-07T20:10:53.4184822Z [CHECK] Number of fbgemm symbols: 100 2025-05-07T20:10:53.4200413Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so > /tmp/tmp.Wo85cwsPTl.usymbols.txt 2025-05-07T20:10:53.4202033Z 2025-05-07T20:10:53.4223964Z 2025-05-07T20:10:53.4255889Z [CHECK] Listing out undefined symbols (259 total): 2025-05-07T20:10:53.4272749Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:53.4273910Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:53.4274719Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:53.4275135Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:53.4275542Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:53.4275923Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:53.4276325Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:53.4276708Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:53.4277059Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:53.4277432Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:53.4277799Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:53.4278139Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:53.4278444Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:53.4278763Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:53.4279099Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:53.4279422Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:53.4279755Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:53.4280062Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:53.4280386Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:53.4280697Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:10:53.4281022Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:53.4281327Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:53.4281770Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:53.4282086Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:10:53.4282550Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:10:53.4282984Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:10:53.4283388Z U at::RecordFunction::currentThreadId() 2025-05-07T20:10:53.4283734Z U at::RecordFunction::end() 2025-05-07T20:10:53.4284176Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:10:53.4284538Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:10:53.4284964Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:10:53.4285452Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:10:53.4286262Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:53.4287553Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:53.4288421Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:53.4289133Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:53.4290226Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:53.4290991Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:10:53.4291352Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:10:53.4291719Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:10:53.4292089Z U at::globalContext() 2025-05-07T20:10:53.4292432Z U at::sequence_number::get_and_increment() 2025-05-07T20:10:53.4292731Z U bcmp@GLIBC_2.2.5 2025-05-07T20:10:53.4293015Z U c10::AnyType::get() 2025-05-07T20:10:53.4293382Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:53.4293779Z U c10::BoolType::get() 2025-05-07T20:10:53.4294109Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:53.4294541Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:10:53.4294933Z U c10::Dispatcher::realSingleton() 2025-05-07T20:10:53.4295817Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:10:53.4297055Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:10:53.4298145Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:53.4298716Z U c10::Error::what() const 2025-05-07T20:10:53.4299025Z U c10::FloatType::get() 2025-05-07T20:10:53.4299322Z U c10::GradMode::is_enabled() 2025-05-07T20:10:53.4299644Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:10:53.4300036Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:53.4300468Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:10:53.4300853Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:10:53.4301324Z U c10::IValue::isBoolList() const 2025-05-07T20:10:53.4301644Z U c10::IValue::isIntList() const 2025-05-07T20:10:53.4301951Z U c10::IValue::isSymIntList() const 2025-05-07T20:10:53.4302282Z U c10::IValue::isTensorList() const 2025-05-07T20:10:53.4302633Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:53.4302965Z U c10::IntType::get() 2025-05-07T20:10:53.4303320Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:53.4303684Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:53.4304048Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:53.4304374Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:53.4304800Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:53.4305405Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:10:53.4305911Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:53.4306291Z U c10::StringType::get() 2025-05-07T20:10:53.4306615Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:10:53.4306998Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:53.4307402Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:10:53.4307799Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:53.4308199Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:10:53.4308814Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:53.4309442Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:53.4309820Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:10:53.4310170Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:10:53.4310557Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:53.4310895Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:10:53.4311263Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:10:53.4311624Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:10:53.4311948Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:53.4312264Z U c10::SymIntType::get() 2025-05-07T20:10:53.4312602Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:53.4312981Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:10:53.4313339Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:53.4313696Z U c10::TensorType::get() 2025-05-07T20:10:53.4314030Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:53.4315270Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:53.4316263Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:53.4316623Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:53.4316983Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:53.4317344Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:53.4317687Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:53.4318047Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:53.4318507Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:53.4319025Z U c10::cuda::device_count() 2025-05-07T20:10:53.4319388Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:53.4319767Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:53.4320164Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:53.4320558Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:53.4320971Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:53.4321355Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:53.4322049Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:53.4323113Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:53.4324076Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:53.4324946Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:53.4325902Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:53.4327046Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:53.4328044Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:53.4328681Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:53.4329224Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:10:53.4329866Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:10:53.4330343Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:10:53.4330865Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:10:53.4331278Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:53.4331620Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:53.4332024Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:10:53.4332745Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:53.4333367Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:10:53.4333749Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:53.4334155Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:53.4334566Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:10:53.4335011Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:10:53.4335428Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:10:53.4335789Z U c10::throwNullDataPtrError() 2025-05-07T20:10:53.4336128Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:53.4336452Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:10:53.4336877Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:53.4337303Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:53.4337677Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:10:53.4338059Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:53.4338432Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:53.4338809Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:53.4339207Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:53.4339571Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:53.4339908Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:53.4340268Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:53.4340623Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:10:53.4341011Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:10:53.4341392Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:53.4341736Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:53.4342133Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:53.4342467Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:53.4342833Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:53.4343189Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:53.4345764Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:10:53.4348286Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:10:53.4348782Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:10:53.4349164Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:53.4349525Z U free@GLIBC_2.2.5 2025-05-07T20:10:53.4349853Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:53.4350212Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:53.4350660Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:53.4351055Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:53.4351427Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:53.4351759Z U memcmp@GLIBC_2.2.5 2025-05-07T20:10:53.4352034Z U memcpy@GLIBC_2.14 2025-05-07T20:10:53.4352323Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:53.4364965Z U memset@GLIBC_2.2.5 2025-05-07T20:10:53.4365339Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:53.4365693Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:53.4366285Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:10:53.4367051Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:10:53.4367601Z U realloc@GLIBC_2.2.5 2025-05-07T20:10:53.4368020Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:10:53.4368676Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:53.4369505Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:53.4370331Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:53.4371139Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:53.4372139Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:53.4372732Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:53.4373077Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:53.4373445Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:53.4373835Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:53.4374260Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:53.4374714Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:53.4375103Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:10:53.4375604Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:53.4376560Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:53.4377380Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:53.4377751Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:53.4378095Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:53.4378446Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:53.4378788Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:53.4379193Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:53.4379721Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:53.4380214Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:53.4380627Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:53.4381036Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:10:53.4381748Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:10:53.4382424Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:10:53.4382954Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:53.4383054Z U strcmp@GLIBC_2.2.5 2025-05-07T20:10:53.4383153Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:53.4383278Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:53.4383871Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:53.4384345Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:53.4384607Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:53.4384734Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:10:53.4385036Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:10:53.4385217Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:10:53.4385428Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:10:53.4385617Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:10:53.4385960Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:10:53.4386149Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:10:53.4386347Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:10:53.4386521Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:10:53.4386642Z U torch::autograd::Node::assign_parent() 2025-05-07T20:10:53.4386769Z U torch::autograd::Node::metadata() 2025-05-07T20:10:53.4386901Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:10:53.4387147Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:10:53.4387456Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:10:53.4387595Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:10:53.4387804Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:10:53.4388066Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:10:53.4390747Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:10:53.4390918Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:10:53.4391066Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:10:53.4391269Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:10:53.4392055Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:10:53.4392209Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:10:53.4392633Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:10:53.4392987Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:53.4393542Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:10:53.4393662Z U typeinfo for c10::Error 2025-05-07T20:10:53.4393796Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:53.4393918Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:10:53.4394059Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:10:53.4394319Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:10:53.4394436Z U typeinfo for torch::autograd::Node 2025-05-07T20:10:53.4395890Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:53.4397373Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:53.4398809Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:53.4400231Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:53.4401591Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:53.4402961Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:53.4403141Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:53.4403300Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:53.4403466Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:10:53.4403618Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:53.4403715Z U vtable for c10::Error 2025-05-07T20:10:53.4404058Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:53.4404189Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:53.4404416Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:53.4404547Z U vtable for torch::autograd::Node 2025-05-07T20:10:53.4404724Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:53.4404833Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:53.4404955Z w _ITM_registerTMCloneTable 2025-05-07T20:10:53.4405053Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:53.4405143Z w __gmon_start__ 2025-05-07T20:10:53.4405238Z w __pthread_key_create 2025-05-07T20:10:53.4405365Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:53.4405472Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:53.4405613Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:53.4405893Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:10:53.4405902Z 2025-05-07T20:10:53.4406036Z linux-vdso.so.1 (0x00007ffcf7771000) 2025-05-07T20:10:53.4406125Z libc10.so => not found 2025-05-07T20:10:53.4406238Z libnvrtc.so.12 => not found 2025-05-07T20:10:53.4406328Z libc10_cuda.so => not found 2025-05-07T20:10:53.4406418Z libnccl.so.2 => not found 2025-05-07T20:10:53.4406511Z libcuda.so.1 => not found 2025-05-07T20:10:53.4407088Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007f7897800000) 2025-05-07T20:10:53.4407185Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:53.4407280Z libtorch.so => not found 2025-05-07T20:10:53.4407387Z libtorch_cpu.so => not found 2025-05-07T20:10:53.4407507Z libtorch_cuda.so => not found 2025-05-07T20:10:53.4407600Z libcudart.so.12 => not found 2025-05-07T20:10:53.4407771Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f789759c000) 2025-05-07T20:10:53.4407916Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f78d534c000) 2025-05-07T20:10:53.4408089Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f78d531e000) 2025-05-07T20:10:53.4408208Z libc.so.6 => /lib64/libc.so.6 (0x00007f7897394000) 2025-05-07T20:10:53.4408351Z /lib64/ld-linux-x86-64.so.2 (0x00007f78d53aa000) 2025-05-07T20:10:53.4408435Z libc10.so => not found 2025-05-07T20:10:53.4408527Z libnvrtc.so.12 => not found 2025-05-07T20:10:53.4408627Z libc10_cuda.so => not found 2025-05-07T20:10:53.4408716Z libnccl.so.2 => not found 2025-05-07T20:10:53.4408803Z libcuda.so.1 => not found 2025-05-07T20:10:53.4409263Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007f7895c00000) 2025-05-07T20:10:53.4409734Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f7895800000) 2025-05-07T20:10:53.4410269Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f7895659000) 2025-05-07T20:10:53.4410373Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:53.4410479Z libtorch.so => not found 2025-05-07T20:10:53.4410854Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so (0x00007f7895000000) 2025-05-07T20:10:53.4411306Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f7893e00000) 2025-05-07T20:10:53.4411414Z libtorch_cpu.so => not found 2025-05-07T20:10:53.4411510Z libtorch_cuda.so => not found 2025-05-07T20:10:53.4411602Z libcudart.so.12 => not found 2025-05-07T20:10:53.4411739Z libm.so.6 => /lib64/libm.so.6 (0x00007f78d523d000) 2025-05-07T20:10:53.4411834Z libtorch.so => not found 2025-05-07T20:10:53.4411917Z libc10.so => not found 2025-05-07T20:10:53.4412006Z libnvrtc.so.12 => not found 2025-05-07T20:10:53.4412109Z libc10_cuda.so => not found 2025-05-07T20:10:53.4412201Z libnccl.so.2 => not found 2025-05-07T20:10:53.4412294Z libcuda.so.1 => not found 2025-05-07T20:10:53.4412410Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:53.4412503Z libtorch_cpu.so => not found 2025-05-07T20:10:53.4412596Z libtorch_cuda.so => not found 2025-05-07T20:10:53.4412688Z libcudart.so.12 => not found 2025-05-07T20:10:53.4412787Z libc10.so => not found 2025-05-07T20:10:53.4412879Z libnvrtc.so.12 => not found 2025-05-07T20:10:53.4412969Z libc10_cuda.so => not found 2025-05-07T20:10:53.4413070Z libnccl.so.2 => not found 2025-05-07T20:10:53.4413158Z libcuda.so.1 => not found 2025-05-07T20:10:53.4413597Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so (0x00007f78d522c000) 2025-05-07T20:10:53.4413699Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:53.4413804Z libtorch.so => not found 2025-05-07T20:10:53.4413896Z libtorch_cpu.so => not found 2025-05-07T20:10:53.4413989Z libtorch_cuda.so => not found 2025-05-07T20:10:53.4414093Z libcudart.so.12 => not found 2025-05-07T20:10:53.4414208Z libc10.so => not found 2025-05-07T20:10:53.4414297Z libnvrtc.so.12 => not found 2025-05-07T20:10:53.4414388Z libc10_cuda.so => not found 2025-05-07T20:10:53.4414495Z libnccl.so.2 => not found 2025-05-07T20:10:53.4414582Z libcuda.so.1 => not found 2025-05-07T20:10:53.4414677Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:53.4414781Z libtorch.so => not found 2025-05-07T20:10:53.4414868Z libtorch_cpu.so => not found 2025-05-07T20:10:53.4414962Z libtorch_cuda.so => not found 2025-05-07T20:10:53.4415057Z libcudart.so.12 => not found 2025-05-07T20:10:53.4415151Z libc10.so => not found 2025-05-07T20:10:53.4415243Z libnvrtc.so.12 => not found 2025-05-07T20:10:53.4416827Z libc10_cuda.so => not found 2025-05-07T20:10:53.4416934Z libnccl.so.2 => not found 2025-05-07T20:10:53.4417020Z libcuda.so.1 => not found 2025-05-07T20:10:53.4417384Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so (0x00007f78d1589000) 2025-05-07T20:10:53.4417495Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:53.4417622Z libtorch.so => not found 2025-05-07T20:10:53.4417716Z libtorch_cpu.so => not found 2025-05-07T20:10:53.4417812Z libtorch_cuda.so => not found 2025-05-07T20:10:53.4417913Z libtorch.so => not found 2025-05-07T20:10:53.4417996Z libc10.so => not found 2025-05-07T20:10:53.4418211Z libnvrtc.so.12 => not found 2025-05-07T20:10:53.4418310Z libc10_cuda.so => not found 2025-05-07T20:10:53.4418396Z libnccl.so.2 => not found 2025-05-07T20:10:53.4418484Z libcuda.so.1 => not found 2025-05-07T20:10:53.4418582Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:53.4418683Z libtorch_cpu.so => not found 2025-05-07T20:10:53.4418779Z libtorch_cuda.so => not found 2025-05-07T20:10:53.4418871Z libcudart.so.12 => not found 2025-05-07T20:10:53.4418970Z libtorch.so => not found 2025-05-07T20:10:53.4419051Z libc10.so => not found 2025-05-07T20:10:53.4419142Z libnvrtc.so.12 => not found 2025-05-07T20:10:53.4419229Z libc10_cuda.so => not found 2025-05-07T20:10:53.4419327Z libnccl.so.2 => not found 2025-05-07T20:10:53.4419415Z libcuda.so.1 => not found 2025-05-07T20:10:53.4419510Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:53.4419642Z libtorch_cpu.so => not found 2025-05-07T20:10:53.4419732Z libtorch_cuda.so => not found 2025-05-07T20:10:53.4419906Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f78d5217000) 2025-05-07T20:10:53.4419993Z libtorch.so => not found 2025-05-07T20:10:53.4420087Z libc10.so => not found 2025-05-07T20:10:53.4420178Z libnvrtc.so.12 => not found 2025-05-07T20:10:53.4420267Z libc10_cuda.so => not found 2025-05-07T20:10:53.4420365Z libnccl.so.2 => not found 2025-05-07T20:10:53.4420452Z libcuda.so.1 => not found 2025-05-07T20:10:53.4420552Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:53.4420643Z libtorch_cpu.so => not found 2025-05-07T20:10:53.4420743Z libtorch_cuda.so => not found 2025-05-07T20:10:53.4420874Z librt.so.1 => /lib64/librt.so.1 (0x00007f78d520e000) 2025-05-07T20:10:53.4420881Z 2025-05-07T20:10:53.4420989Z [CHECK] Displaying ELF information: 2025-05-07T20:10:53.4421293Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:10:53.4421301Z 2025-05-07T20:10:53.4421347Z 2025-05-07T20:10:53.4421513Z Dynamic section at offset 0x3a27010 contains 41 entries: 2025-05-07T20:10:53.4421630Z Tag Type Name/Value 2025-05-07T20:10:53.4421836Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:53.4422034Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:53.4422228Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:53.4422428Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:53.4422616Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:53.4422866Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:10:53.4423223Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:53.4423410Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:53.4423609Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:53.4423815Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:53.4424009Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:53.4424207Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:53.4424398Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:10:53.4424624Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:53.4424807Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:53.4425018Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:53.4425342Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_dense.so] 2025-05-07T20:10:53.4425525Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:53.4425636Z 0x000000000000000c (INIT) 0x80000 2025-05-07T20:10:53.4425746Z 0x000000000000000d (FINI) 0x261c5c 2025-05-07T20:10:53.4425876Z 0x0000000000000019 (INIT_ARRAY) 0x3a223b0 2025-05-07T20:10:53.4426167Z 0x000000000000001b (INIT_ARRAYSZ) 184 (bytes) 2025-05-07T20:10:53.4426288Z 0x000000000000001a (FINI_ARRAY) 0x3a22468 2025-05-07T20:10:53.4426415Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:53.4426525Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:10:53.4426634Z 0x0000000000000005 (STRTAB) 0xe368 2025-05-07T20:10:53.4426752Z 0x0000000000000006 (SYMTAB) 0x33a0 2025-05-07T20:10:53.4426883Z 0x000000000000000a (STRSZ) 374997 (bytes) 2025-05-07T20:10:53.4427004Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:53.4427122Z 0x0000000000000003 (PLTGOT) 0x3a28fe8 2025-05-07T20:10:53.4427285Z 0x0000000000000002 (PLTRELSZ) 18456 (bytes) 2025-05-07T20:10:53.4427389Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:53.4427502Z 0x0000000000000017 (JMPREL) 0x7b2d8 2025-05-07T20:10:53.4427617Z 0x0000000000000007 (RELA) 0x6ac28 2025-05-07T20:10:53.4427748Z 0x0000000000000008 (RELASZ) 67248 (bytes) 2025-05-07T20:10:53.4427868Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:53.4427983Z 0x000000006ffffffe (VERNEED) 0x6aae8 2025-05-07T20:10:53.4428098Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:53.4428209Z 0x000000006ffffff0 (VERSYM) 0x69c3e 2025-05-07T20:10:53.4428514Z 0x000000006ffffff9 (RELACOUNT) 1392 2025-05-07T20:10:53.4428623Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:53.4428630Z 2025-05-07T20:10:53.4428744Z ################################################################################ 2025-05-07T20:10:53.4428750Z 2025-05-07T20:10:53.4428754Z 2025-05-07T20:10:53.4428869Z ################################################################################ 2025-05-07T20:10:53.4429208Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:10:53.4429313Z [CHECK] Listing out library size: 2025-05-07T20:10:53.4429629Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:10:53.4429634Z 2025-05-07T20:10:53.4429900Z 328 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:10:53.4429904Z 2025-05-07T20:10:53.4430343Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:10:53.4430882Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:53.4430985Z 2025-05-07T20:10:53.5020631Z GLIBC_2.2.5 2025-05-07T20:10:53.5020889Z GLIBC_2.3 2025-05-07T20:10:53.5021119Z GLIBC_2.14 2025-05-07T20:10:53.5021134Z 2025-05-07T20:10:53.5021147Z 2025-05-07T20:10:53.5022583Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:10:53.5023978Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:53.5024191Z 2025-05-07T20:10:53.5663719Z GLIBCXX_3.4 2025-05-07T20:10:53.5664033Z GLIBCXX_3.4.9 2025-05-07T20:10:53.5664280Z GLIBCXX_3.4.11 2025-05-07T20:10:53.5664365Z GLIBCXX_3.4.18 2025-05-07T20:10:53.5664473Z GLIBCXX_3.4.20 2025-05-07T20:10:53.5664553Z GLIBCXX_3.4.21 2025-05-07T20:10:53.5664575Z 2025-05-07T20:10:53.5664806Z 2025-05-07T20:10:53.5684578Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so > /tmp/tmp.NklSjik0jO.symbols.txt 2025-05-07T20:10:53.5684621Z 2025-05-07T20:10:53.6283206Z 2025-05-07T20:10:53.6319906Z [CHECK] Total Number of symbols: 3739 2025-05-07T20:10:53.6357132Z [CHECK] Number of fbgemm symbols: 551 2025-05-07T20:10:53.6375332Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so > /tmp/tmp.DNq42BnM8i.usymbols.txt 2025-05-07T20:10:53.6376940Z 2025-05-07T20:10:53.6404508Z 2025-05-07T20:10:53.6431504Z [CHECK] Listing out undefined symbols (178 total): 2025-05-07T20:10:53.6445384Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:53.6446277Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:53.6446844Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:53.6447202Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:53.6447820Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:53.6448193Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:53.6448582Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:53.6448949Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:53.6449301Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:53.6449665Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:53.6450021Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:53.6450336Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:53.6450642Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:53.6450950Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:53.6451260Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:10:53.6451590Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:53.6451893Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:53.6452201Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:10:53.6452550Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:10:53.6452940Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:10:53.6453365Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:10:53.6453792Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:10:53.6454254Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:10:53.6455110Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:53.6456519Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:53.6457590Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:53.6458189Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:10:53.6459135Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:53.6460254Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:53.6461043Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:10:53.6461463Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:10:53.6461790Z U at::globalContext() 2025-05-07T20:10:53.6462155Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:53.6462541Z U c10::BoolType::get() 2025-05-07T20:10:53.6462873Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:53.6463204Z U c10::FloatType::get() 2025-05-07T20:10:53.6463492Z U c10::GeneratorImpl::device() const 2025-05-07T20:10:53.6463849Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:53.6464243Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:53.6464555Z U c10::IntType::get() 2025-05-07T20:10:53.6464894Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:53.6465270Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:53.6465611Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:53.6466025Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:10:53.6466376Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:53.6466764Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:10:53.6467163Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:53.6467764Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:53.6468364Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:53.6468710Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:10:53.6469039Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:53.6469366Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:10:53.6469695Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:10:53.6470025Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:10:53.6470325Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:53.6470616Z U c10::SymIntType::get() 2025-05-07T20:10:53.6471119Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:53.6471516Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:53.6471872Z U c10::TensorType::get() 2025-05-07T20:10:53.6472208Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:53.6473126Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:53.6474240Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:53.6474775Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:53.6475126Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:53.6475471Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:53.6475802Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:53.6476146Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:53.6476604Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:53.6477109Z U c10::cuda::device_count() 2025-05-07T20:10:53.6477441Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:53.6477820Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:53.6478200Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:53.6478613Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:53.6479019Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:53.6479392Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:53.6480129Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:53.6481000Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:53.6481857Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:53.6482799Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:53.6483834Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:53.6484674Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:53.6485006Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:53.6485361Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:10:53.6485797Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:10:53.6486199Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:53.6486541Z U c10::operator+(c10::SymInt const&, int) 2025-05-07T20:10:53.6487007Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:10:53.6487344Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:53.6487703Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:53.6488067Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:10:53.6488395Z U c10::throwNullDataPtrError() 2025-05-07T20:10:53.6488699Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:53.6488995Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:10:53.6489385Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:53.6489771Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:53.6490093Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:10:53.6490433Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:53.6490765Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:53.6491098Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:53.6491410Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:53.6491735Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:53.6492069Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:53.6492390Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:53.6492706Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:10:53.6493047Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:10:53.6493386Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:53.6493701Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:53.6494048Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:53.6494381Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:53.6494960Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:53.6495317Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:53.6497668Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:10:53.6500028Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:10:53.6500469Z U float at::Tensor::item() const 2025-05-07T20:10:53.6500830Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:10:53.6501261Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:53.6501653Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:53.6502052Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:53.6502501Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:53.6502908Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:53.6503350Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:53.6503710Z U memcpy@GLIBC_2.14 2025-05-07T20:10:53.6504215Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:53.6504512Z U memset@GLIBC_2.2.5 2025-05-07T20:10:53.6504844Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:53.6505312Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:53.6505881Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:10:53.6506850Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:10:53.6507692Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:10:53.6508498Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:10:53.6509357Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:10:53.6510133Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:10:53.6510960Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:53.6511834Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:53.6512681Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:53.6513555Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:53.6514266Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:53.6514629Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:53.6515099Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:53.6515493Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:53.6515934Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:53.6516385Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:53.6516887Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:53.6517848Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:53.6518651Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:53.6519020Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:53.6519384Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:53.6519723Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:53.6520144Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:53.6520682Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:53.6521183Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:53.6521538Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:53.6521874Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:53.6522216Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:53.6523044Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:53.6524249Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:53.6525096Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:53.6525833Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:53.6526992Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:10:53.6530001Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:53.6534103Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:53.6538192Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:53.6542302Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:53.6546095Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:53.6549729Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:53.6553227Z U void embedding_ops::split_embedding_backward_count_unique_indices_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int) 2025-05-07T20:10:53.6555619Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:53.6556049Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:53.6556506Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:53.6557127Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:53.6557796Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:53.6558270Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:53.6558602Z w _ITM_registerTMCloneTable 2025-05-07T20:10:53.6558938Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:53.6559240Z w __gmon_start__ 2025-05-07T20:10:53.6559542Z w __pthread_key_create 2025-05-07T20:10:53.6560071Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:53.6560401Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:53.6560946Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:53.6561456Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:10:53.6561882Z 2025-05-07T20:10:53.6562051Z linux-vdso.so.1 (0x00007ffc1bd52000) 2025-05-07T20:10:53.6562374Z libc10.so => not found 2025-05-07T20:10:53.6562641Z libnvrtc.so.12 => not found 2025-05-07T20:10:53.6562902Z libc10_cuda.so => not found 2025-05-07T20:10:53.6563180Z libnccl.so.2 => not found 2025-05-07T20:10:53.6563432Z libcuda.so.1 => not found 2025-05-07T20:10:53.6564178Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007fa565400000) 2025-05-07T20:10:53.6564953Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:53.6565240Z libtorch.so => not found 2025-05-07T20:10:53.6565520Z libtorch_cpu.so => not found 2025-05-07T20:10:53.6565807Z libtorch_cuda.so => not found 2025-05-07T20:10:53.6566090Z libcudart.so.12 => not found 2025-05-07T20:10:53.6566418Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fa56519c000) 2025-05-07T20:10:53.6566883Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fa5b41da000) 2025-05-07T20:10:53.6567294Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fa5b41ac000) 2025-05-07T20:10:53.6567699Z libc.so.6 => /lib64/libc.so.6 (0x00007fa564f94000) 2025-05-07T20:10:53.6568061Z /lib64/ld-linux-x86-64.so.2 (0x00007fa5b4238000) 2025-05-07T20:10:53.6568412Z libc10.so => not found 2025-05-07T20:10:53.6568657Z libnvrtc.so.12 => not found 2025-05-07T20:10:53.6568939Z libc10_cuda.so => not found 2025-05-07T20:10:53.6569217Z libnccl.so.2 => not found 2025-05-07T20:10:53.6569467Z libcuda.so.1 => not found 2025-05-07T20:10:53.6570220Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007fa563800000) 2025-05-07T20:10:53.6571226Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so (0x00007fa563400000) 2025-05-07T20:10:53.6572430Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007fa563259000) 2025-05-07T20:10:53.6573127Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:53.6573418Z libtorch.so => not found 2025-05-07T20:10:53.6573913Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so (0x00007fa562c00000) 2025-05-07T20:10:53.6574761Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007fa561a00000) 2025-05-07T20:10:53.6575373Z libtorch_cpu.so => not found 2025-05-07T20:10:53.6575616Z libtorch_cuda.so => not found 2025-05-07T20:10:53.6575866Z libcudart.so.12 => not found 2025-05-07T20:10:53.6576138Z libm.so.6 => /lib64/libm.so.6 (0x00007fa59f125000) 2025-05-07T20:10:53.6576440Z libtorch.so => not found 2025-05-07T20:10:53.6576669Z libc10.so => not found 2025-05-07T20:10:53.6576883Z libnvrtc.so.12 => not found 2025-05-07T20:10:53.6577126Z libc10_cuda.so => not found 2025-05-07T20:10:53.6577360Z libnccl.so.2 => not found 2025-05-07T20:10:53.6577595Z libcuda.so.1 => not found 2025-05-07T20:10:53.6577830Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:53.6578079Z libtorch_cpu.so => not found 2025-05-07T20:10:53.6578323Z libtorch_cuda.so => not found 2025-05-07T20:10:53.6578571Z libcudart.so.12 => not found 2025-05-07T20:10:53.6578801Z libc10.so => not found 2025-05-07T20:10:53.6579027Z libnvrtc.so.12 => not found 2025-05-07T20:10:53.6579276Z libc10_cuda.so => not found 2025-05-07T20:10:53.6579501Z libnccl.so.2 => not found 2025-05-07T20:10:53.6579734Z libcuda.so.1 => not found 2025-05-07T20:10:53.6580294Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so (0x00007fa5b4195000) 2025-05-07T20:10:53.6580896Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:53.6581144Z libtorch.so => not found 2025-05-07T20:10:53.6581383Z libtorch_cpu.so => not found 2025-05-07T20:10:53.6581618Z libtorch_cuda.so => not found 2025-05-07T20:10:53.6581910Z libcudart.so.12 => not found 2025-05-07T20:10:53.6582150Z libc10.so => not found 2025-05-07T20:10:53.6582367Z libnvrtc.so.12 => not found 2025-05-07T20:10:53.6582609Z libc10_cuda.so => not found 2025-05-07T20:10:53.6582841Z libnccl.so.2 => not found 2025-05-07T20:10:53.6583073Z libcuda.so.1 => not found 2025-05-07T20:10:53.6583303Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:53.6583553Z libtorch.so => not found 2025-05-07T20:10:53.6583778Z libtorch_cpu.so => not found 2025-05-07T20:10:53.6584028Z libtorch_cuda.so => not found 2025-05-07T20:10:53.6584263Z libcudart.so.12 => not found 2025-05-07T20:10:53.6584494Z libc10.so => not found 2025-05-07T20:10:53.6584737Z libnvrtc.so.12 => not found 2025-05-07T20:10:53.6584975Z libc10_cuda.so => not found 2025-05-07T20:10:53.6585225Z libnccl.so.2 => not found 2025-05-07T20:10:53.6585448Z libcuda.so.1 => not found 2025-05-07T20:10:53.6585927Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so (0x00007fa5b4116000) 2025-05-07T20:10:53.6586475Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:53.6586727Z libtorch.so => not found 2025-05-07T20:10:53.6586956Z libtorch_cpu.so => not found 2025-05-07T20:10:53.6587200Z libtorch_cuda.so => not found 2025-05-07T20:10:53.6587432Z libtorch.so => not found 2025-05-07T20:10:53.6587654Z libc10.so => not found 2025-05-07T20:10:53.6587873Z libnvrtc.so.12 => not found 2025-05-07T20:10:53.6588110Z libc10_cuda.so => not found 2025-05-07T20:10:53.6588352Z libnccl.so.2 => not found 2025-05-07T20:10:53.6588583Z libcuda.so.1 => not found 2025-05-07T20:10:53.6589009Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:53.6589261Z libtorch_cpu.so => not found 2025-05-07T20:10:53.6589526Z libtorch_cuda.so => not found 2025-05-07T20:10:53.6589780Z libcudart.so.12 => not found 2025-05-07T20:10:53.6590302Z libtorch.so => not found 2025-05-07T20:10:53.6590537Z libc10.so => not found 2025-05-07T20:10:53.6590776Z libnvrtc.so.12 => not found 2025-05-07T20:10:53.6591027Z libc10_cuda.so => not found 2025-05-07T20:10:53.6591282Z libnccl.so.2 => not found 2025-05-07T20:10:53.6591531Z libcuda.so.1 => not found 2025-05-07T20:10:53.6591801Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:53.6592062Z libtorch_cpu.so => not found 2025-05-07T20:10:53.6592313Z libtorch_cuda.so => not found 2025-05-07T20:10:53.6592649Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007fa59f120000) 2025-05-07T20:10:53.6593005Z libtorch.so => not found 2025-05-07T20:10:53.6593249Z libc10.so => not found 2025-05-07T20:10:53.6593555Z libnvrtc.so.12 => not found 2025-05-07T20:10:53.6593806Z libc10_cuda.so => not found 2025-05-07T20:10:53.6594044Z libnccl.so.2 => not found 2025-05-07T20:10:53.6594398Z libcuda.so.1 => not found 2025-05-07T20:10:53.6594830Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:53.6595094Z libtorch_cpu.so => not found 2025-05-07T20:10:53.6595441Z libtorch_cuda.so => not found 2025-05-07T20:10:53.6595731Z librt.so.1 => /lib64/librt.so.1 (0x00007fa59f119000) 2025-05-07T20:10:53.6595985Z 2025-05-07T20:10:53.6596095Z [CHECK] Displaying ELF information: 2025-05-07T20:10:53.6596569Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:10:53.6596975Z 2025-05-07T20:10:53.6596979Z 2025-05-07T20:10:53.6597138Z Dynamic section at offset 0x147859a8 contains 41 entries: 2025-05-07T20:10:53.6597528Z Tag Type Name/Value 2025-05-07T20:10:53.6597928Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:53.6598437Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:53.6598943Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:53.6599451Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:53.6599940Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:53.6600513Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:10:53.6601134Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:53.6601645Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:53.6602154Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:53.6602659Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:53.6603177Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:53.6603683Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:53.6604227Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:10:53.6604729Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:53.6605214Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:53.6605754Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:53.6606345Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_vbe.so] 2025-05-07T20:10:53.6607021Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:53.6607396Z 0x000000000000000c (INIT) 0x1dc000 2025-05-07T20:10:53.6607698Z 0x000000000000000d (FINI) 0xe754cc 2025-05-07T20:10:53.6608016Z 0x0000000000000019 (INIT_ARRAY) 0x1476a588 2025-05-07T20:10:53.6608343Z 0x000000000000001b (INIT_ARRAYSZ) 680 (bytes) 2025-05-07T20:10:53.6608667Z 0x000000000000001a (FINI_ARRAY) 0x1476a830 2025-05-07T20:10:53.6608981Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:53.6609292Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:10:53.6609579Z 0x0000000000000005 (STRTAB) 0x1c8a0 2025-05-07T20:10:53.6609889Z 0x0000000000000006 (SYMTAB) 0x6a00 2025-05-07T20:10:53.6610218Z 0x000000000000000a (STRSZ) 1486798 (bytes) 2025-05-07T20:10:53.6610541Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:53.6610893Z 0x0000000000000003 (PLTGOT) 0x1478afe8 2025-05-07T20:10:53.6611218Z 0x0000000000000002 (PLTRELSZ) 22152 (bytes) 2025-05-07T20:10:53.6611539Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:53.6611830Z 0x0000000000000017 (JMPREL) 0x1d5988 2025-05-07T20:10:53.6612143Z 0x0000000000000007 (RELA) 0x1896c8 2025-05-07T20:10:53.6612459Z 0x0000000000000008 (RELASZ) 312000 (bytes) 2025-05-07T20:10:53.6612792Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:53.6613116Z 0x000000006ffffffe (VERNEED) 0x1895a8 2025-05-07T20:10:53.6613415Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:53.6613719Z 0x000000006ffffff0 (VERSYM) 0x18786e 2025-05-07T20:10:53.6614025Z 0x000000006ffffff9 (RELACOUNT) 8035 2025-05-07T20:10:53.6614322Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:53.6614501Z 2025-05-07T20:10:53.6614601Z ################################################################################ 2025-05-07T20:10:53.6614814Z 2025-05-07T20:10:53.6614818Z 2025-05-07T20:10:53.6615009Z [CHECK] Verifying sample subset of symbols in the built libraries ... 2025-05-07T20:10:53.6673187Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:10:53.6700119Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:10:53.6932914Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:10:53.6970037Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:10:53.7004413Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:10:53.7057586Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:10:53.7091286Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:10:53.7118249Z [CHECK] Found symbol in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:10:53.7231345Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:53.7256006Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:53.7488406Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:53.7523546Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:53.7561171Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:53.7619791Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:53.7654562Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:53.7697590Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:53.8113118Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:53.8475971Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:53.8694490Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:53.9623486Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:53.9659316Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:53.9747738Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:54.0075469Z [CHECK] Found symbol in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:54.0076776Z ################################################################################ 2025-05-07T20:10:54.0077306Z [BUILD] Wheel Audit: dist/fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl 2025-05-07T20:10:54.0077737Z 2025-05-07T20:10:54.0078197Z + conda run --no-capture-output -n build_binary auditwheel show dist/fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl 2025-05-07T20:10:54.0078756Z 2025-05-07T20:11:05.6668151Z 2025-05-07T20:11:05.6669100Z fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl is 2025-05-07T20:11:05.6670606Z consistent with the following platform tag: "linux_x86_64". 2025-05-07T20:11:05.6671435Z 2025-05-07T20:11:05.6671927Z The wheel references external versioned symbols in these 2025-05-07T20:11:05.6673198Z system-provided shared libraries: librt.so.1 with versions 2025-05-07T20:11:05.6674366Z {'GLIBC_2.2.5'}, libgcc_s.so.1 with versions {'GCC_3.0'}, 2025-05-07T20:11:05.6674970Z libstdc++.so.6 with versions {'GLIBCXX_3.4.20', 'CXXABI_1.3.5', 2025-05-07T20:11:05.6675729Z 'CXXABI_1.3', 'GLIBCXX_3.4.15', 'GLIBCXX_3.4.11', 'GLIBCXX_3.4.9', 2025-05-07T20:11:05.6676186Z 'GLIBCXX_3.4.14', 'CXXABI_1.3.3', 'GLIBCXX_3.4.21', 'GLIBCXX_3.4.19', 2025-05-07T20:11:05.6676667Z 'CXXABI_1.3.7', 'CXXABI_1.3.11', 'GLIBCXX_3.4.18', 'GLIBCXX_3.4'}, 2025-05-07T20:11:05.6677103Z libc.so.6 with versions {'GLIBC_2.6', 'GLIBC_2.2.5', 'GLIBC_2.17', 2025-05-07T20:11:05.6677556Z 'GLIBC_2.3.3', 'GLIBC_2.3.2', 'GLIBC_2.3', 'GLIBC_2.14', 'GLIBC_2.7'}, 2025-05-07T20:11:05.6677998Z libpthread.so.0 with versions {'GLIBC_2.2.5', 'GLIBC_2.3.2', 2025-05-07T20:11:05.6678425Z 'GLIBC_2.3.4'}, libm.so.6 with versions {'GLIBC_2.2.5'}, 2025-05-07T20:11:05.6678929Z libcudart.so.12 with versions {'libcudart.so.12'}, libgomp.so.1 with 2025-05-07T20:11:05.6679412Z versions {'OMP_1.0'}, libdl.so.2 with versions {'GLIBC_2.2.5', 2025-05-07T20:11:05.6679778Z 'GLIBC_2.3.4'} 2025-05-07T20:11:05.6679902Z 2025-05-07T20:11:05.6680125Z This constrains the platform tag to "manylinux_2_27_x86_64". In order 2025-05-07T20:11:05.6680792Z to achieve a more compatible tag, you would need to recompile a new 2025-05-07T20:11:05.6681238Z wheel from source on a system with earlier versions of these 2025-05-07T20:11:05.6681609Z libraries, such as a recent manylinux image. 2025-05-07T20:11:05.7467880Z 2025-05-07T20:11:05.7467900Z 2025-05-07T20:11:05.7468608Z ################################################################################ 2025-05-07T20:11:05.7469684Z [BUILD] Enumerating the built wheels ... 2025-05-07T20:11:05.7471018Z + ls -lth dist/fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl 2025-05-07T20:11:05.7472078Z 2025-05-07T20:11:05.7484465Z -rw-r--r--. 1 root root 505M May 7 20:10 dist/fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl 2025-05-07T20:11:05.7484967Z 2025-05-07T20:11:05.7485100Z [BUILD] Enumerating the wheel SHAs ... 2025-05-07T20:11:05.7485554Z + sha1sum dist/fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl 2025-05-07T20:11:05.7485930Z 2025-05-07T20:11:06.7020013Z a92e213a0fb8a8a20d89b6deb0a6b72a39b19982 dist/fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl 2025-05-07T20:11:06.7022000Z 2025-05-07T20:11:06.7022764Z + sha256sum dist/fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl 2025-05-07T20:11:06.7023995Z 2025-05-07T20:11:08.9192123Z ad9b1c524da90020e23294bb4e307b39abd02afcea9ef0c95f6aa7ee1aefbd9e dist/fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl 2025-05-07T20:11:08.9194360Z 2025-05-07T20:11:08.9195086Z + md5sum dist/fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl 2025-05-07T20:11:08.9195965Z 2025-05-07T20:11:09.7683655Z 2ed5311f44880814ad982057e11101ae dist/fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl 2025-05-07T20:11:09.7684168Z 2025-05-07T20:11:09.7684304Z [BUILD] FBGEMM-GPU build + package completed 2025-05-07T20:11:09.7799357Z ##[group]Run actions/upload-artifact@v4 2025-05-07T20:11:09.7799713Z with: 2025-05-07T20:11:09.7800014Z name: fbgemm_default_x86_clang_py3.11_cu12.6.3.whl 2025-05-07T20:11:09.7800397Z path: fbgemm_gpu/dist/*.whl 2025-05-07T20:11:09.7800716Z if-no-files-found: error 2025-05-07T20:11:09.7801007Z compression-level: 6 2025-05-07T20:11:09.7801292Z overwrite: false 2025-05-07T20:11:09.7801581Z include-hidden-files: false 2025-05-07T20:11:09.7801860Z env: 2025-05-07T20:11:09.7802127Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T20:11:09.7802455Z BUILD_ENV: build_binary 2025-05-07T20:11:09.7802746Z BUILD_TARGET: default 2025-05-07T20:11:09.7803004Z BUILD_VARIANT: cuda 2025-05-07T20:11:09.7803287Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T20:11:09.7803566Z ##[endgroup] 2025-05-07T20:11:09.7807313Z ##[command]/usr/bin/docker exec 5cbac523ab1bac2d9a3da38db9fa5ba5ff830b0fdf4c0802e1a29dc793634db1 sh -c "cat /etc/*release | grep ^ID" 2025-05-07T20:11:10.2229647Z With the provided path, there will be 1 file uploaded 2025-05-07T20:11:10.2230227Z Artifact name is valid! 2025-05-07T20:11:10.2230625Z Root directory input is valid! 2025-05-07T20:11:10.2980698Z Beginning upload of artifact content to blob storage 2025-05-07T20:11:10.9512250Z Uploaded bytes 8388608 2025-05-07T20:11:11.1604958Z Uploaded bytes 16777216 2025-05-07T20:11:11.5307627Z Uploaded bytes 25165824 2025-05-07T20:11:11.8046824Z Uploaded bytes 33554432 2025-05-07T20:11:12.1354990Z Uploaded bytes 41943040 2025-05-07T20:11:12.4832115Z Uploaded bytes 50331648 2025-05-07T20:11:12.8125426Z Uploaded bytes 58720256 2025-05-07T20:11:13.0236994Z Uploaded bytes 67108864 2025-05-07T20:11:13.4717897Z Uploaded bytes 75497472 2025-05-07T20:11:13.7077452Z Uploaded bytes 83886080 2025-05-07T20:11:14.0900570Z Uploaded bytes 92274688 2025-05-07T20:11:14.3524209Z Uploaded bytes 100663296 2025-05-07T20:11:14.6520359Z Uploaded bytes 109051904 2025-05-07T20:11:14.9204063Z Uploaded bytes 117440512 2025-05-07T20:11:15.2415844Z Uploaded bytes 125829120 2025-05-07T20:11:15.5220916Z Uploaded bytes 134217728 2025-05-07T20:11:15.8955067Z Uploaded bytes 142606336 2025-05-07T20:11:16.2554833Z Uploaded bytes 150994944 2025-05-07T20:11:16.5824788Z Uploaded bytes 159383552 2025-05-07T20:11:16.8183287Z Uploaded bytes 167772160 2025-05-07T20:11:17.1762951Z Uploaded bytes 176160768 2025-05-07T20:11:17.4804235Z Uploaded bytes 184549376 2025-05-07T20:11:17.8760313Z Uploaded bytes 192937984 2025-05-07T20:11:18.1217930Z Uploaded bytes 201326592 2025-05-07T20:11:18.4693721Z Uploaded bytes 209715200 2025-05-07T20:11:18.8167353Z Uploaded bytes 218103808 2025-05-07T20:11:19.0668629Z Uploaded bytes 226492416 2025-05-07T20:11:19.3757029Z Uploaded bytes 234881024 2025-05-07T20:11:19.6843911Z Uploaded bytes 243269632 2025-05-07T20:11:19.9183438Z Uploaded bytes 251658240 2025-05-07T20:11:20.2555388Z Uploaded bytes 260046848 2025-05-07T20:11:20.5268864Z Uploaded bytes 268435456 2025-05-07T20:11:20.7886903Z Uploaded bytes 276824064 2025-05-07T20:11:21.0494315Z Uploaded bytes 285212672 2025-05-07T20:11:21.3670951Z Uploaded bytes 293601280 2025-05-07T20:11:21.6255048Z Uploaded bytes 301989888 2025-05-07T20:11:21.9424555Z Uploaded bytes 310378496 2025-05-07T20:11:22.3374704Z Uploaded bytes 318767104 2025-05-07T20:11:22.6021001Z Uploaded bytes 327155712 2025-05-07T20:11:22.8929634Z Uploaded bytes 335544320 2025-05-07T20:11:23.1598243Z Uploaded bytes 343932928 2025-05-07T20:11:23.4638696Z Uploaded bytes 352321536 2025-05-07T20:11:23.7833279Z Uploaded bytes 360710144 2025-05-07T20:11:24.0911379Z Uploaded bytes 369098752 2025-05-07T20:11:24.3997004Z Uploaded bytes 377487360 2025-05-07T20:11:24.6722492Z Uploaded bytes 385875968 2025-05-07T20:11:24.9753686Z Uploaded bytes 394264576 2025-05-07T20:11:25.2922353Z Uploaded bytes 402653184 2025-05-07T20:11:25.6017035Z Uploaded bytes 411041792 2025-05-07T20:11:25.8426669Z Uploaded bytes 419430400 2025-05-07T20:11:26.1752893Z Uploaded bytes 427819008 2025-05-07T20:11:26.4893638Z Uploaded bytes 436207616 2025-05-07T20:11:26.7107628Z Uploaded bytes 444596224 2025-05-07T20:11:26.9877906Z Uploaded bytes 452984832 2025-05-07T20:11:27.3658783Z Uploaded bytes 461373440 2025-05-07T20:11:27.5847069Z Uploaded bytes 469762048 2025-05-07T20:11:27.8781779Z Uploaded bytes 478150656 2025-05-07T20:11:28.2431802Z Uploaded bytes 486539264 2025-05-07T20:11:28.5095323Z Uploaded bytes 494927872 2025-05-07T20:11:28.7541647Z Uploaded bytes 503316480 2025-05-07T20:11:29.0339069Z Uploaded bytes 511705088 2025-05-07T20:11:29.1990798Z Uploaded bytes 518315501 2025-05-07T20:11:29.2215197Z Finished uploading artifact content to blob storage! 2025-05-07T20:11:29.2215933Z SHA256 digest of uploaded artifact zip is edcb4dcdf3c762e1bb0763e1de08e2cc783f95758363c553a883948da412889d 2025-05-07T20:11:29.2216564Z Finalizing artifact upload 2025-05-07T20:11:29.3236043Z Artifact fbgemm_default_x86_clang_py3.11_cu12.6.3.whl.zip successfully finalized. Artifact ID 3081457064 2025-05-07T20:11:29.3238790Z Artifact fbgemm_default_x86_clang_py3.11_cu12.6.3.whl has been successfully uploaded! Final size is 518315501 bytes. Artifact ID is 3081457064 2025-05-07T20:11:29.3245765Z Artifact download URL: https://github.com/pytorch/FBGEMM/actions/runs/14891846252/artifacts/3081457064 2025-05-07T20:11:29.3500349Z Post job cleanup. 2025-05-07T20:11:29.3505711Z ##[command]/usr/bin/docker exec 5cbac523ab1bac2d9a3da38db9fa5ba5ff830b0fdf4c0802e1a29dc793634db1 sh -c "cat /etc/*release | grep ^ID" 2025-05-07T20:11:29.6777965Z [command]/usr/bin/git version 2025-05-07T20:11:29.6812908Z git version 2.47.1 2025-05-07T20:11:29.6843086Z Copying '/github/home/.gitconfig' to '/__w/_temp/5f92e5ed-03b3-4d72-8245-836332e857aa/.gitconfig' 2025-05-07T20:11:29.6851156Z Temporarily overriding HOME='/__w/_temp/5f92e5ed-03b3-4d72-8245-836332e857aa' before making global git config changes 2025-05-07T20:11:29.6852268Z Adding repository directory to the temporary git global config as a safe directory 2025-05-07T20:11:29.6856835Z [command]/usr/bin/git config --global --add safe.directory /__w/FBGEMM/FBGEMM 2025-05-07T20:11:29.6900548Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-05-07T20:11:29.6929635Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-05-07T20:11:29.7207892Z Entering 'external/asmjit' 2025-05-07T20:11:29.7260649Z Entering 'external/composable_kernel' 2025-05-07T20:11:29.7315949Z Entering 'external/cpuinfo' 2025-05-07T20:11:29.7392758Z Entering 'external/cutlass' 2025-05-07T20:11:29.7473623Z Entering 'external/googletest' 2025-05-07T20:11:29.7532923Z Entering 'external/hipify_torch' 2025-05-07T20:11:29.7596059Z Entering 'external/json' 2025-05-07T20:11:29.7680159Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-05-07T20:11:29.7699020Z http.https://github.com/.extraheader 2025-05-07T20:11:29.7705292Z [command]/usr/bin/git config --local --unset-all http.https://github.com/.extraheader 2025-05-07T20:11:29.7732149Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-05-07T20:11:29.8000554Z Entering 'external/asmjit' 2025-05-07T20:11:29.8033180Z http.https://github.com/.extraheader 2025-05-07T20:11:29.8070010Z Entering 'external/composable_kernel' 2025-05-07T20:11:29.8114649Z http.https://github.com/.extraheader 2025-05-07T20:11:29.8156074Z Entering 'external/cpuinfo' 2025-05-07T20:11:29.8190784Z http.https://github.com/.extraheader 2025-05-07T20:11:29.8230922Z Entering 'external/cutlass' 2025-05-07T20:11:29.8276555Z http.https://github.com/.extraheader 2025-05-07T20:11:29.8319718Z Entering 'external/googletest' 2025-05-07T20:11:29.8363233Z http.https://github.com/.extraheader 2025-05-07T20:11:29.8399663Z Entering 'external/hipify_torch' 2025-05-07T20:11:29.8444538Z http.https://github.com/.extraheader 2025-05-07T20:11:29.8481377Z Entering 'external/json' 2025-05-07T20:11:29.8527068Z http.https://github.com/.extraheader 2025-05-07T20:11:29.8711438Z Stop and remove container: db7e8a0d80694b6c947d879fd1fcdfb9_amazonlinux2023_6189ae 2025-05-07T20:11:29.8716731Z ##[command]/usr/bin/docker rm --force 5cbac523ab1bac2d9a3da38db9fa5ba5ff830b0fdf4c0802e1a29dc793634db1 2025-05-07T20:11:30.6586545Z 5cbac523ab1bac2d9a3da38db9fa5ba5ff830b0fdf4c0802e1a29dc793634db1 2025-05-07T20:11:30.6616953Z Remove container network: github_network_b8a0e08264114d09b83c4f1649c20a21 2025-05-07T20:11:30.6621375Z ##[command]/usr/bin/docker network rm github_network_b8a0e08264114d09b83c4f1649c20a21 2025-05-07T20:11:31.6076024Z github_network_b8a0e08264114d09b83c4f1649c20a21 2025-05-07T20:11:31.6112571Z A job completed hook has been configured by the self-hosted runner administrator 2025-05-07T20:11:31.6134137Z ##[group]Run '/home/ec2-user/runner-scripts/after_job.sh' 2025-05-07T20:11:31.6139988Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-05-07T20:11:31.6140467Z ##[endgroup] 2025-05-07T20:11:31.6249517Z [!ALERT!] Swap in detected! [!ALERT!] 2025-05-07T20:11:41.7750786Z [!ALERT!] Swap out detected [!ALERT!] 2025-05-07T20:11:58.6141130Z Cleaning up orphan processes